Jump to content
 







Main menu
   


Navigation  



Main page
Contents
Current events
Random article
About Wikipedia
Contact us
Donate
 




Contribute  



Help
Learn to edit
Community portal
Recent changes
Upload file
 








Search  

































Create account

Log in
 









Create account
 Log in
 




Pages for logged out editors learn more  



Contributions
Talk
 



















Contents

   



(Top)
 


1 History  





2 References  





3 External links  














Google Cloud Dataflow







Русский
 

Edit links
 









Article
Talk
 

















Read
Edit
View history
 








Tools
   


Actions  



Read
Edit
View history
 




General  



What links here
Related changes
Upload file
Special pages
Permanent link
Page information
Cite this page
Get shortened URL
Download QR code
Wikidata item
 




Print/export  



Download as PDF
Printable version
 
















Appearance
   

 






From Wikipedia, the free encyclopedia
 


Google Cloud Dataflow is a fully managed service for executing Apache Beam pipelines within the Google Cloud Platform ecosystem. Dataflow provides a fully managed service for executing Apache Beam pipelines, offering features like autoscaling, dynamic work rebalancing, and a managed execution environment. [1]

Dataflow is suitable for large-scale, continuous data processing jobs, and is one of the major components of Google's big data architecture on the Google Cloud Platform. [2]

History[edit]

Google Cloud Dataflow was announced in June, 2014[3] and released to the general public as an open beta in April, 2015.[4] In January, 2016 Google donated the underlying SDK, the implementation of a local runner, and a set of IOs (data connectors) to access Google Cloud Platform data services to the Apache Software Foundation.[5] The donated code formed the original basis for Apache Beam.

In August 2022, there was an incident where user timers were broken for certain Dataflow streaming pipelines in multiple regions, which was later resolved. [6] Throughout 2023 and 2024, there have been various other updates and incidents affecting Google Cloud Dataflow, as documented in the release notes and service health history.[7]

References[edit]

  1. ^ "Cloud Dataflow Runner". beam.apache.org. Retrieved 2024-07-03.
  • ^ "GCP Dataflow and Apache Beam for ETL Data Pipeline". EPAM Anywhere. Retrieved 2024-07-03.
  • ^ "Sneak peek: Google Cloud Dataflow, a Cloud-native data processing service". Google Cloud Platform Blog. Retrieved 2018-09-08.
  • ^ "Google Opens Cloud Dataflow To All Developers, Launches European Zone For BigQuery". TechCrunch. Retrieved 2018-09-08.
  • ^ "Google wants to donate its Dataflow technology to Apache". Venture Beat. Retrieved 2019-02-21.
  • ^ "Google Cloud Service Health". status.cloud.google.com. Retrieved 2024-07-03.
  • ^ "Dataflow enhancements in 2023". Google Cloud Blog. Retrieved 2024-07-03.
  • External links[edit]

  • t
  • e

  • Retrieved from "https://en.wikipedia.org/w/index.php?title=Google_Cloud_Dataflow&oldid=1232347973"

    Categories: 
    Google Cloud
    Cloud computing
    Google stubs
    Hidden categories: 
    Articles with short description
    Short description is different from Wikidata
    All stub articles
     



    This page was last edited on 3 July 2024, at 08:28 (UTC).

    Text is available under the Creative Commons Attribution-ShareAlike License 4.0; additional terms may apply. By using this site, you agree to the Terms of Use and Privacy Policy. Wikipedia® is a registered trademark of the Wikimedia Foundation, Inc., a non-profit organization.



    Privacy policy

    About Wikipedia

    Disclaimers

    Contact Wikipedia

    Code of Conduct

    Developers

    Statistics

    Cookie statement

    Mobile view



    Wikimedia Foundation
    Powered by MediaWiki