A curated list of awesome big data frameworks, ressources and other awesomeness.
-
Updated
May 30, 2023
A curated list of awesome big data frameworks, ressources and other awesomeness.
Greenplum Database - Massively Parallel PostgreSQL for Analytics. An open-source massively parallel data platform for analytics, machine learning and AI.
Privacy and Security focused Segment-alternative, in Golang and React
2019新型冠状病毒疫情时间序列数据仓库 | COVID-19/2019-nCoV Infection Time Series Data Warehouse
Light-weight Python OLAP framework for multi-dimensional data analysis
TensorBase is a new big data warehousing with modern efforts.
Open-source data observability for analytics engineers.
A modern, open source replacement for enterprise data warehouses
Few projects related to Data Engineering including Data Modeling, Infrastructure setup on cloud, Data Warehousing and Data Lake development.
Open source SQL Query Assistant service for Databases/Warehouses
Useful scripts, udfs, views, and other utilities for migration and data warehouse operations in BigQuery.
Optimus is an easy-to-use, reliable, and performant workflow orchestrator for data transformation, data modeling, pipelines, and data quality management.
Personal Data Engineering Projects
Supercharge BigQuery with BigFunctions
DomainMOD is an open source application written in PHP & MySQL used to manage your domains and other internet assets in a central location. DomainMOD also includes a Data Warehouse framework that allows you to import your web server data so that you can view, export, and report on your live data.
Build, run and manage your data pipelines with Python or SQL on any cloud
A powerful open source data warehouse system
Configurable Extract, Transform, and Load
Construct a modern data stack and orchestration the workflows to create high quality data for analytics and ML applications.
Add a description, image, and links to the data-warehouse topic page so that developers can more easily learn about it.
To associate your repository with the data-warehouse topic, visit your repo's landing page and select "manage topics."