dataengineering
Here are 218 public repositories matching this topic...
Describe the bug
when updating anomaly_params for an Kpi if anomaly_params passed is null, it causes HTTP 500 response
Explain the environment
- Chaos Genius version: chaos-genius/chaos_genius@9e2ac69 reproduced on test builds from Develop branch
Current behavior
HTTP 500
Expected behavior
should cause valida
A free to use dbt package for creating and loading Data Vault 2.0 compliant Data Warehouses (powered by dbt, an open source data engineering tool, registered trademark of dbt Labs)
-
Updated
Jun 4, 2022
Your open source DataOps Infrastructure to manage and centralize all the data tools in your stack, and turn them into your ideal end-to-end data platform
-
Updated
Jun 6, 2022 - Python
A Data Platform built for AWS, powered by Kubernetes.
-
Updated
May 5, 2022 - Python
A No-code workflow executor that runs DAGs defined in a simple YAML format
-
Updated
Jun 6, 2022 - Go
An open source development framework to help you build data workflows and modern data architecture on AWS.
-
Updated
May 31, 2022 - Python
Build, test, deploy, iterate - Dev and prod tool for data science pipelines
-
Updated
Jul 15, 2019 - Python
Predict stock price based on financial news feeds
-
Updated
Apr 6, 2018 - Jupyter Notebook
Apply for a job at Olist's Data Team: https://olist.gupy.io/
-
Updated
Mar 4, 2022
Data engineering interviews Q&A for data community by data community
-
Updated
Jun 7, 2020 - Python
Instant search for and access to many datasets in Pyspark.
-
Updated
Nov 5, 2021 - Python
-
Updated
Apr 21, 2022 - Python
Forecasting Solar Power: Analysis of using a LSTM Neural Network
-
Updated
Feb 7, 2020 - Jupyter Notebook
kedro cli plugin for generating a static kedro viz site (html, css, js) that can be deployed on many serverless tools.
-
Updated
Jun 3, 2022 - Python
A GitHub Action to lint, test, build-docs, package, and run your kedro pipelines. Supports any Python version you'll give it (that is also supported by pyenv).
-
Updated
Jun 16, 2021 - Shell
Dockerizing an Apache Spark Standalone Cluster
-
Updated
Aug 7, 2021 - VBA
Contains basic things (Data structure, Algorithm, Cracking coding Interview Q&A...etc) for Data engineers.
-
Updated
Aug 23, 2019 - Jupyter Notebook
The goal of this project is to offer an AWS EMR template using Spot Fleet and On-Demand Instances that you can use quickly. Just focus on writing pyspark code.
-
Updated
Sep 24, 2021 - Python
Courses and projects on Data Camp
-
Updated
Jun 28, 2020 - Python
Материалы для курса Введение в Data Engineering: дата пайплайны
-
Updated
Oct 19, 2021 - Python
Provide an easy way with Python to protect your data sources by searching its metadata.
-
Updated
Jun 1, 2022 - Python
Quantum Black Hackathon organised by Analytics Vidya
-
Updated
Jul 23, 2019 - Jupyter Notebook
Projeto do grupo 3GTeam apresentado no Hackathon de Engenharia de Dados da A3Data no mês de Junho de 2021.
-
Updated
Jun 26, 2021 - Python
Build & Learn Data Engineering,Machine Learning over Kubernetes. No Shortcut approach.
-
Updated
May 15, 2022 - Python
Pipeline validation using Great Expectations library
-
Updated
Jul 25, 2019 - Python
an elegant datasets factory
-
Updated
Mar 10, 2022 - Python
Duke MIDS: Data Engineering and DataOps Course
-
Updated
Apr 8, 2022 - HTML
Improve this page
Add a description, image, and links to the dataengineering topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the dataengineering topic, visit your repo's landing page and select "manage topics."


Let's prepare a mixin for interacting with Roles and Policies with the Python client, in case users want to use the API directly.
Do not only have the list, get etc, but also utility methods, such as updating a default role. It should wrap the following logic: