data-pipeline
Here are 9 public repositories matching this topic...
Streaming data changes to a Data Lake with Debezium and Delta Lake pipeline
-
Updated
Jun 18, 2020 - HTML
-
Updated
Mar 15, 2018 - HTML
As a Data Engineer for a fictional E-commerce startup, this project addresses the task of analyzing the web server logs to find the number of product pages visited and the number of items in the cart.
-
Updated
Mar 28, 2021 - HTML
UC Davis Distributed Computing with Spark SQL (with Databricks) and Databricks Apache Spark SQL for Data Analysts
-
Updated
Jul 10, 2021 - HTML
A data-pipeline for high-resolution power meter data
-
Updated
Aug 10, 2021 - HTML
Bringing you the posts that matter.
-
Updated
Feb 2, 2018 - HTML
-
Updated
Sep 16, 2021 - HTML
A data-pipeline for high-resolution power meter data
-
Updated
Jul 3, 2019 - HTML
Improve this page
Add a description, image, and links to the data-pipeline topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the data-pipeline topic, visit your repo's landing page and select "manage topics."


We're using marshmallow to parse whylogs config from YAML
However, Pydantic is much more powerful as it allows users to set config via various mechanims, from YAML, JSON to Environment settings.
We should consider moving to pydantic