A curated list of awesome big data frameworks, ressources and other awesomeness.
-
Updated
Dec 7, 2022
A curated list of awesome big data frameworks, ressources and other awesomeness.
Greenplum Database - Massively Parallel PostgreSQL for Analytics. An open-source massively parallel data platform for analytics, machine learning and AI.
Privacy and Security focused Segment-alternative, in Golang and React
2019新型冠状病毒疫情时间序列数据仓库 | COVID-19/2019-nCoV Infection Time Series Data Warehouse
Light-weight Python OLAP framework for multi-dimensional data analysis
TensorBase is a new big data warehousing with modern efforts.
Few projects related to Data Engineering including Data Modeling, Infrastructure setup on cloud, Data Warehousing and Data Lake development.
Open-source data observability for analytics engineers.
Useful scripts, udfs, views, and other utilities for migration and data warehouse operations in BigQuery.
Open source SQL Query Assistant service for Databases/Warehouses
Optimus is an easy-to-use, reliable, and performant workflow orchestrator for data transformation, data modeling, pipelines, and data quality management.
DomainMOD is an open source application written in PHP & MySQL used to manage your domains and other internet assets in a central location. DomainMOD also includes a Data Warehouse framework that allows you to import your web server data so that you can view, export, and report on your live data.
Personal Data Engineering Projects
A powerful open source data warehouse system
Build, run and manage your data pipelines with Python or SQL on any cloud
The open source Snowflake alternative. OLAP Postgres
Supercharge BigQuery with BigFunctions
Configurable Extract, Transform, and Load
A full data warehouse infrastructure with ETL pipelines running inside docker on Apache Airflow for data orchestration, AWS Redshift for cloud data warehouse and Metabase to serve the needs of data visualizations such as analytical dashboards.
Add a description, image, and links to the data-warehouse topic page so that developers can more easily learn about it.
To associate your repository with the data-warehouse topic, visit your repo's landing page and select "manage topics."