Jump to content
 







Main menu
   


Navigation  



Main page
Contents
Current events
Random article
About Wikipedia
Contact us
Donate
 




Contribute  



Help
Learn to edit
Community portal
Recent changes
Upload file
 








Search  

































Create account

Log in
 









Create account
 Log in
 




Pages for logged out editors learn more  



Contributions
Talk
 



















Contents

   



(Top)
 


1 Overview  





2 Managed providers  





3 References  





4 External links  














Apache Airflow






Español
Français


Русский
Shqip
Українська
 

Edit links
 









Article
Talk
 

















Read
Edit
View history
 








Tools
   


Actions  



Read
Edit
View history
 




General  



What links here
Related changes
Upload file
Special pages
Permanent link
Page information
Cite this page
Get shortened URL
Download QR code
Wikidata item
 




Print/export  



Download as PDF
Printable version
 




In other projects  



Wikimedia Commons
 
















Appearance
   

 






From Wikipedia, the free encyclopedia
 


Apache Airflow
Original author(s)Maxime Beauchemin / Airbnb
Developer(s)Apache Software Foundation
Initial releaseJune 3, 2015; 9 years ago (2015-06-03)
Stable release2.8.2[1] Edit this on Wikidata (26 February 2024; 4 months ago (26 February 2024)) [±]
Repository
Written inPython
Operating systemWindows, macOS, Linux
TypeWorkflow management platform
LicenseApache License 2.0
Websiteairflow.apache.org

Apache Airflow is an open-source workflow management platform for data engineering pipelines. It started at Airbnb in October 2014[2] as a solution to manage the company's increasingly complex workflows. Creating Airflow allowed Airbnb to programmatically author and schedule their workflows and monitor them via the built-in Airflow user interface.[3][4] From the beginning, the project was made open source, becoming an Apache Incubator project in March 2016 and a top-level Apache Software Foundation project in January 2019.

Airflow is written in Python, and workflows are created via Python scripts. Airflow is designed under the principle of "configuration as code". While other "configuration as code" workflow platforms exist using markup languages like XML, using Python allows developers to import libraries and classes to help them create their workflows.

Overview

[edit]

Airflow uses directed acyclic graphs (DAGs) to manage workflow orchestration. Tasks and dependencies are defined in Python and then Airflow manages the scheduling and execution. DAGs can be run either on a defined schedule (e.g. hourly or daily) or based on external event triggers (e.g. a file appearing in Hive[5]). Previous DAG-based schedulers like Oozie and Azkaban tended to rely on multiple configuration files and file system trees to create a DAG, whereas in Airflow, DAGs can often be written in one Python file.[6]

Managed providers

[edit]

Three notable providers offer ancillary services around the core open source project. Astronomer has built a SaaS tool and Kubernetes-deployable Airflow stack that assists with monitoring, alerting, devops, and cluster management.[7] Cloud Composer is a managed version of Airflow that runs on Google Cloud Platform (GCP) and integrates well with other GCP services.[8] Starting from November 2020, Amazon Web Services offers Managed Workflows for Apache Airflow.[9]

References

[edit]
  1. ^ https://airflow.apache.org/docs/apache-airflow/stable/release_notes.html#airflow-2-8-2-2024-02-26. {{cite web}}: Missing or empty |title= (help)
  • ^ "Apache Airflow". Apache Airflow. Archived from the original on August 12, 2019. Retrieved September 30, 2019.
  • ^ Beauchemin, Maxime (June 2, 2015). "Airflow: a workflow management platform". Medium. Archived from the original on August 13, 2019. Retrieved September 30, 2019.
  • ^ "Airflow". Archived from the original on July 6, 2019. Retrieved September 30, 2019.
  • ^ Trencseni, Marton (January 16, 2016). "Airflow review". BytePawn. Archived from the original on February 28, 2019. Retrieved October 1, 2019.
  • ^ "AirflowProposal". Apache Software Foundation. March 28, 2019. Retrieved October 1, 2019.
  • ^ Lipp, Cassie (July 13, 2018). "Astronomer is Now the Apache Airflow Company". americaninno. Retrieved September 18, 2019.
  • ^ "Google launches Cloud Composer, a new workflow automation tool for developers". TechCrunch. Retrieved 2019-09-18.[permanent dead link]
  • ^ "Introducing Amazon Managed Workflows for Apache Airflow (MWAA)". Amazon Web Services. 2020-11-24. Retrieved 2020-12-17.
  • [edit]
    Retrieved from "https://en.wikipedia.org/w/index.php?title=Apache_Airflow&oldid=1230723042"

    Categories: 
    Apache Software Foundation projects
    Big data products
    Hidden categories: 
    CS1 errors: missing title
    CS1 errors: bare URL
    All articles with dead external links
    Articles with dead external links from September 2023
    Articles with permanently dead external links
    Articles with short description
    Short description is different from Wikidata
     



    This page was last edited on 24 June 2024, at 09:55 (UTC).

    Text is available under the Creative Commons Attribution-ShareAlike License 4.0; additional terms may apply. By using this site, you agree to the Terms of Use and Privacy Policy. Wikipedia® is a registered trademark of the Wikimedia Foundation, Inc., a non-profit organization.



    Privacy policy

    About Wikipedia

    Disclaimers

    Contact Wikipedia

    Code of Conduct

    Developers

    Statistics

    Cookie statement

    Mobile view



    Wikimedia Foundation
    Powered by MediaWiki