Here are
253 public repositories
matching this topic...
Efficiently diff rows across two different databases.
Updated
Sep 23, 2022
Python
Open Standard for Metadata. A Single place to Discover, Collaborate and Get your data right.
Updated
Sep 25, 2022
TypeScript
Scalable identity resolution, entity resolution, data mastering and deduplication using ML
Updated
Sep 22, 2022
Java
ML powered analytics engine for outlier detection and root cause analysis.
Updated
Sep 25, 2022
Python
Your open source DataOps Platform Infrastructure to let you manage all the data tools in your stack in one place, and turn them into your ideal end-to-end data platform
Updated
Sep 23, 2022
Python
A free to use dbt package for creating and loading Data Vault 2.0 compliant Data Warehouses (powered by dbt, an open source data engineering tool, registered trademark of dbt Labs)
A Data Platform built for AWS, powered by Kubernetes.
Updated
Sep 19, 2022
Python
An open source development framework to help you build data workflows and modern data architecture on AWS.
Updated
Sep 1, 2022
Python
Build, test, deploy, iterate - Dev and prod tool for data science pipelines
Updated
Jun 21, 2022
Python
Data engineering interviews Q&A for data community by data community
Updated
Jun 7, 2020
Python
Predict stock price based on financial news feeds
Updated
Apr 6, 2018
Jupyter Notebook
Instant search for and access to many datasets in Pyspark.
Updated
Nov 5, 2021
Python
Dockerizing an Apache Spark Standalone Cluster
kedro cli plugin for generating a static kedro viz site (html, css, js) that can be deployed on many serverless tools.
Updated
Jul 20, 2022
Python
Updated
Apr 21, 2022
Python
Forecasting Solar Power: Analysis of using a LSTM Neural Network
Updated
Feb 7, 2020
Jupyter Notebook
Build & Learn Data Engineering,Machine Learning over Kubernetes. No Shortcut approach.
Updated
May 15, 2022
Python
Duke MIDS: Data Engineering and DataOps Course
Updated
Sep 15, 2022
HTML
Updated
Jun 13, 2022
Python
Improve this page
Add a description, image, and links to the
dataengineering
topic page so that developers can more easily learn about it.
Curate this topic
Add this topic to your repo
To associate your repository with the
dataengineering
topic, visit your repo's landing page and select "manage topics."
Learn more
You can’t perform that action at this time.
You signed in with another tab or window. Reload to refresh your session.
You signed out in another tab or window. Reload to refresh your session.