Jump to content
 







Main menu
   


Navigation  



Main page
Contents
Current events
Random article
About Wikipedia
Contact us
Donate
 




Contribute  



Help
Learn to edit
Community portal
Recent changes
Upload file
 








Search  

































Create account

Log in
 









Create account
 Log in
 




Pages for logged out editors learn more  



Contributions
Talk
 



















Contents

   



(Top)
 


1 History  





2 Overview  



2.1  Component collections  





2.2  Custom scripts  







3 References  














Pipeline Pilot







 

Edit links
 









Article
Talk
 

















Read
Edit
View history
 








Tools
   


Actions  



Read
Edit
View history
 




General  



What links here
Related changes
Upload file
Special pages
Permanent link
Page information
Cite this page
Get shortened URL
Download QR code
Wikidata item
 




Print/export  



Download as PDF
Printable version
 
















Appearance
   

 






From Wikipedia, the free encyclopedia
 


Developer(s)Accelrys
Initial release1999 (1999)
Stable release

18.1 / May 2018 (2018-05)

Written inC++
Operating systemWindows and Linux
TypeVisual and dataflow programming language
LicenseProprietary
Websiteaccelrys.com/products/collaborative-science/biovia-pipeline-pilot/ Edit this on Wikidata

Pipeline Pilot is a desktop software program sold by Dassault Systèmes for processing and analyzing data. It was originally used for its basic ETL (extract, transform, and load) and analytics capabilities, which have broadened over time.

The program has the ability to design data workflows using a graphical user interface. It is a visual and dataflow programming software and has been used in cheminformatics, QSAR,[1][2][3] Next Generation Sequencing,[4] image analysis,[5][6] and text analytics.

History[edit]

Pipeline Pilot was created by SciTegic. BIOVIA subsequently acquired SciTegic and Pipeline Pilot in 2004. BIOVIA was itself purchased by Dassault Systèmes in 2014.

The product expanded from an initial focus on chemistry to include general extract, transform, and load (ETL), analytical, and data processing collection capabilities.

Overview[edit]

Pipeline Pilot is part of a class of software products that provide user interfaces for manipulating and analyzing data. Like other graphical ETL products, it enables users to pull from different data sources, such as CSV files, text files, and databases.

The graphical user interface, called the Pipeline Pilot Professional Client, allows users to drag and drop discrete data processing units called "components". Components can load, filter, join, or manipulate data. Components can also build regression models, train neural networks, or process datasets into PDF reports. Pipeline Pilot implements a component paradigm. Components are represented as nodes in a workflow. In a mathematical sense, components are modeled as nodes in a directed graph: "pipes" (graph edges) connect components and move data along from node to node, where operations are performed on the data.

Users can choose from components that come pre-installed or create their own components in workflows called "protocols". Protocols are sets of linked components. Protocols can be saved, reused, and shared. Users can mix and match components that are provided with the software from BIOVIA with their own custom components. Connections between two components are called "pipes", and are visualized in the software as two components connected by a pipe. Data flows from left to right along the pipes. Pipeline Pilot can visually condense a series of data manipulations that involve many components.

Component collections[edit]

Pipeline Pilot features a number of add-ons called "collections". Collections are groups of specialized functions like processing genetic information or analyzing polymers offered to end users for an additional licensing fee. Currently, there are a number of these collections.[7]

Group Domain Component collection
Science specific Chemistry Chemistry
ADMET
Cheminformatics
Biology Gene Expression
Sequence Analysis
Mass Spectrometry for Proteomics
Next Generation Sequencing
Materials Modeling & Simulation Materials Studio
Polymer Properties (Synthia)
Generic Reporting & Visualization Reporting
Database & Application Integration Integration
Imaging Imaging
Analysis & Statistics Data Modeling
Advanced Data Modeling
R Statistics
Document Search & Analysis Chemical Text Mining
Text Analytics
Laboratory Plate Data Analytics
Analytical Instrumentation

Custom scripts[edit]

Pipeline Pilot is often used when one or more large (1TB+) and/or complex datasets are processed. Early in its development, Pipeline Pilot created a scripting language called "PilotScript" that enabled end users to write basic programming scripts that could be incorporated into a Pipeline Pilot protocol. Later releases extended support for a variety of programming languages, including Python, .NET, Matlab, Perl, SQL, Java, VBScript and R.[8] The product supports a number of APIs for different programming languages that can be executed without the program's graphical user interface.

The syntax for PilotScript is based on PLSQL. It can be used in components such as the Custom Manipulator (PilotScript) or the Custom Filter (PilotScript). As an example, the following script can be used to add a property named "Hello" to each record passing through a custom scripting component in a Pipeline Pilot protocol. The value of the property is the string "Hello World!".

Hello := "Hello World!";

References[edit]

  1. ^ Hassan, Moises; Brown, Robert D.; Varma-O'Brien, Shikha; Rogers, David (2007). "Cheminformatics Analysis and Learning in a Data Pipelining Environment". ChemInform. 38 (12). doi:10.1002/chin.200712278. ISSN 0931-7597.
  • ^ Hu, Ye; Lounkine, Eugen; Bajorath, Jürgen (2009). "Improving the Search Performance of Extended Connectivity Fingerprints through Activity-Oriented Feature Filtering and Application of a Bit-Density-Dependent Similarity Function". ChemMedChem. 4 (4): 540–548. doi:10.1002/cmdc.200800408. ISSN 1860-7179. PMID 19263458. S2CID 35868099.
  • ^ Warr, Wendy A. (2012). "Scientific workflow systems: Pipeline Pilot and KNIME". Journal of Computer-Aided Molecular Design. 26 (7): 801–804. Bibcode:2012JCAMD..26..801W. doi:10.1007/s10822-012-9577-7. ISSN 0920-654X. PMC 3414708. PMID 22644661.
  • ^ "Accelrys Enters Next Generation Sequencing Market with NGS Collection for Pipeline Pilot". Business Wire. 2011-02-23. Retrieved 15 February 2013.
  • ^ Rabal, Obdulia; Link, Wolfgang; G. Serelde, Beatriz; Bischoff, James R.; Oyarzabal, Julen (2010). "An integrated one-step system to extract, analyze and annotate all relevant information from image-based cell screening of chemical libraries". Molecular BioSystems. 6 (4): 711–20. doi:10.1039/b919830j. ISSN 1742-206X. PMID 20237649.
  • ^ Paveley, Ross A.; Mansour, Nuha R.; Hallyburton, Irene; Bleicher, Leo S.; Benn, Alex E.; Mikic, Ivana; Guidi, Alessandra; Gilbert, Ian H.; Hopkins, Andrew L.; Bickle, Quentin D. (2012). "Whole Organism High-Content Screening by Label-Free, Image-Based Bayesian Classification for Parasitic Diseases". PLOS Neglected Tropical Diseases. 6 (7): e1762. doi:10.1371/journal.pntd.0001762. ISSN 1935-2735. PMC 3409125. PMID 22860151.
  • ^ "Pipeline Pilot Component Collections". Accelrys. Archived from the original on January 15, 2013. Retrieved 26 January 2013.
  • ^ "Pipeline Pilot Integration Component Collection Datasheet" (PDF). Accelrys. Retrieved 8 February 2013.

  • Retrieved from "https://en.wikipedia.org/w/index.php?title=Pipeline_Pilot&oldid=1232920304"

    Categories: 
    Science software
    Enterprise application integration
    Extract, transform, load tools
    Bioinformatics software
    Computational chemistry software
    Computer vision software
    Data analysis software
    Data mining and machine learning software
    Data visualization software
    Laboratory software
    Mass spectrometry software
    Natural language processing software
    Numerical software
    Plotting software
    Proprietary software
    Visual programming languages
    Hidden categories: 
    Wikipedia articles with possible conflicts of interest from February 2013
    Articles with a promotional tone from February 2013
    All articles with a promotional tone
    Articles that may contain original research from February 2013
    All articles that may contain original research
    Articles with multiple maintenance issues
     



    This page was last edited on 6 July 2024, at 09:43 (UTC).

    Text is available under the Creative Commons Attribution-ShareAlike License 4.0; additional terms may apply. By using this site, you agree to the Terms of Use and Privacy Policy. Wikipedia® is a registered trademark of the Wikimedia Foundation, Inc., a non-profit organization.



    Privacy policy

    About Wikipedia

    Disclaimers

    Contact Wikipedia

    Code of Conduct

    Developers

    Statistics

    Cookie statement

    Mobile view



    Wikimedia Foundation
    Powered by MediaWiki