A curated list of awesome big data frameworks, ressources and other awesomeness.
-
Updated
Oct 4, 2023
A curated list of awesome big data frameworks, ressources and other awesomeness.
Greenplum Database - Massively Parallel PostgreSQL for Analytics. An open-source massively parallel data platform for analytics, machine learning and AI.
The data warehouse for operational workloads.
Privacy and Security focused Segment-alternative, in Golang and React
Hydra: Column-oriented Postgres. Add scalable analytics to your project in minutes.
2019新型冠状病毒疫情时间序列数据仓库 | COVID-19/2019-nCoV Infection Time Series Data Warehouse
Open-source data observability for analytics engineers.
[NOT MAINTAINED] Light-weight Python OLAP framework for multi-dimensional data analysis
TensorBase is a new big data warehousing with modern efforts.
Few projects related to Data Engineering including Data Modeling, Infrastructure setup on cloud, Data Warehousing and Data Lake development.
Open source SQL Query Assistant service for Databases/Warehouses
Useful scripts, udfs, views, and other utilities for migration and data warehouse operations in BigQuery.
Scratch is an open-source alternative to BigQuery, Redshift, and Snowflake. Runs on Clickhouse.
data load tool (dlt) is an open source Python library that makes data loading easy 🛠️
Optimus is an easy-to-use, reliable, and performant workflow orchestrator for data transformation, data modeling, pipelines, and data quality management.
Personal Data Engineering Projects
Open-source Analytical Data API Framework for data apps. It turns SQL queries into RESTful APIs in no time!
Supercharge BigQuery with BigFunctions
DomainMOD is an open source application written in PHP & MySQL used to manage your domains and other internet assets in a central location. DomainMOD also includes a Data Warehouse framework that allows you to import your web server data so that you can view, export, and report on your live data.
One framework to develop, deploy and operate data workflows with Python and SQL.
Add a description, image, and links to the data-warehouse topic page so that developers can more easily learn about it.
To associate your repository with the data-warehouse topic, visit your repo's landing page and select "manage topics."