Jump to content
 







Main menu
   


Navigation  



Main page
Contents
Current events
Random article
About Wikipedia
Contact us
Donate
 




Contribute  



Help
Learn to edit
Community portal
Recent changes
Upload file
 








Search  

































Create account

Log in
 









Create account
 Log in
 




Pages for logged out editors learn more  



Contributions
Talk
 



















Contents

   



(Top)
 


1 What is RapidMiner ?  



1.1  What is it?  





1.2  What is it for?  





1.3  How does it work?  





1.4  Where to get it?  







2 Software Versions  





3 Who uses RapidMiner?  





4 Properties  





5 Extensions  





6 See also  





7 References  





8 External links  














RapidMiner: Difference between revisions






العربية
Deutsch
Español
فارسی

Bahasa Indonesia

Português
Türkçe
 

Edit links
 









Article
Talk
 

















Read
Edit
View history
 








Tools
   


Actions  



Read
Edit
View history
 




General  



What links here
Related changes
Upload file
Special pages
Permanent link
Page information
Cite this page
Get shortened URL
Download QR code
Wikidata item
 




Print/export  



Download as PDF
Printable version
 




Print/export  







In other projects  



Wikimedia Commons
 
















Appearance
   

 





Help
 

From Wikipedia, the free encyclopedia
 


Browse history interactively
 Previous editNext edit 
Content deleted Content added
m Notability is okay (see KDNuggets), but the main article part is advertisement not encyclopedic.
Line 74: Line 74:

== See also ==

== See also ==

* [[Weka (machine learning)|Weka]] - machine learning algorithms that can be integrated into RapidMiner

* [[Weka (machine learning)|Weka]] - machine learning algorithms that can be integrated into RapidMiner

* [[R-Project]] - statistical framework that can be integrared into RapidMiner

* [[R-Project]] - statistical framework that can be integrated into RapidMiner



== References ==

== References ==


Revision as of 13:20, 15 February 2011

RapidMiner (formerly YALE (Yet Another Learning Environment)) is an open source environment for machine learning, data mining, text mining, predictive analytics, and business analytics. Data mining processes in RapidMiner can be made up of a large number of arbitrarily nestable operators, described in XML files which are created with RapidMiner's graphical user interface. RapidMiner is used for research, education, training, rapid prototyping, application development, and industrial deployments. In a poll by the data-mining newspaper KDnuggets, RapidMiner ranked second between data mining/analytic tools used for real projects in 2009[1] and first in 2010.[2]

The RapidMiner open source project, formely named YALE, was initiated by Ralf Klinkenberg, Ingo Mierswa, and Simon Fischer. The initial version of RapidMiner was developed by the Artificial Intelligence Unit of University of Dortmund since 2001. It is distributed under the AGPL license, and has been hosted by SourceForge since 2004. In 2006, Ingo Mierswa and Ralf Klinkenberg founded the company Rapid-I that now supports the further development of RapidMiner as main contributor. In addition, more than 30 developers word-wide contribute improvements and extensions to the software.

RapidMiner provides more than 600 operators for all main data mining and machine learning procedures, including import, export, data loading and transformation (ETL), data preprocessing and visualization, modelling, evaluation, and deployment. RapidMiner is written in the Java programming language and therefore runs on all popular operating systems. It also integrates learning schemes and attribute evaluators of the Weka machine learning environment and statistical modelling schemes of the R-Project. According to SourceForge, RapidMiner is used in more than 60 countries world-wide.

What is RapidMiner ?

What is it?

The Community Edition of RapidMiner (formerly "Yale") is an open source toolkit for data mining. Its strengths reside in part in its ability to easily define analytical steps (especially when compared with R), and in generating graphs more easily[citation needed] than e.g., R, or more effectively[citation needed] than MS Excel.

What is it for?

RapidMiner is well suited[citation needed] for analyzing data generated by high-throughput instruments, e.g., genotyping, proteomics, and mass spectrometry.

Example applications:

Notable selected features of RapidMiner:

How does it work?

RapidMiner provides a GUI to design an analytical pipeline (the "operator tree" in RapidMiner parlance). The GUI generates an XML (eXtensible Markup Language) file that defines the analytical processes the user wishes to apply to the data. This file is then read by RapidMiner to run the analyses automatically.

While these are running, the GUI can also be used to interactively control and inspect running processes.

Other ways of using RapidMiner involve calling RapidMiner from e.g., a Perl program. The Java application programming interface ("API") provides clear interfaces for applying operators individually (i.e., no need to create an operator tree), providing the ability to bypass the GUI and controlling analytical processes directly.

Last, one can also call individual RapidMiner functions directly from the command line.

Where to get it?

RapidMiner runs on Java, it can be installed on any computer on which Java runs. I can run as a GUI application or also as a command-line tool on a server.

Software Versions

Although the core of RapidMiner is open-source and is offered free of charge as a "Community Edition", there is also "Enterprise Edition", that is, according to the site, "Community Edition + More Features + Services + Guarantees"[1] RapidMiner source is also offered under proprietary commercial license, to allow integration in closed-source solutions.

Who uses RapidMiner?

RapidMiner flexibility allows it use with text mining, multimedia mining, feature engineering, data stream mining and tracking drifting concepts, development of ensemble methods, and distributed data mining. RapidMiner is found in the Electronic Industry, Energy Industry, Automobile Industry, Commerce, Aviation, Telecommunications, Banking and Insurance, Production, IT Industry, Market Research, Pharmaceutical Industry, Universities and other Miscellaneous businesses (i.e. sports teams, train station, police station). For specific examples of each business area can be referenced # Reference: [2]

Properties

Some properties of RapidMiner are:

Extensions

The Rapdiminer can be easily extended by additional plugins.

Today the Rapidminer contain more than 15 extensions, which advances scope of its aplicability to: text mining, image processing, time series processing, web mining, statistics, visualization, semantics, paralelization of computation process, automatic process design (PaREn Automatic System Construction Wizard) and others).

Several of the extensions can be found directly in the application in so-called Extension manager. The other extensions can be downloaded from webs of their respective developers.

See also

References

  1. ^ "Data Mining Tools Used Poll (May 2009)".
  • ^ "Data Mining / Analytic Tools Used Poll (May 2010)".
  • External links


    Retrieved from "https://en.wikipedia.org/w/index.php?title=RapidMiner&oldid=414058556"

    Categories: 
    Free software programmed in Java
    Machine learning
    Artificial intelligence
    Data mining
    Free data analysis software
    Image processing
    Computer vision software
    Java platform
    Research and development
    Neuroimaging software
    Neurobiology
    Data analysis software
    Hidden categories: 
    Articles with multiple maintenance issues
    Pages using multiple issues with unknown parameters
    Wikipedia articles with style issues from February 2011
    All articles with style issues
    All articles with unsourced statements
    Articles with unsourced statements from November 2010
     



    This page was last edited on 15 February 2011, at 13:20 (UTC).

    This version of the page has been revised. Besides normal editing, the reason for revision may have been that this version contains factual inaccuracies, vandalism, or material not compatible with the Creative Commons Attribution-ShareAlike License.



    Privacy policy

    About Wikipedia

    Disclaimers

    Contact Wikipedia

    Code of Conduct

    Developers

    Statistics

    Cookie statement

    Mobile view



    Wikimedia Foundation
    Powered by MediaWiki