#
data-warehousing
Here are 64 public repositories matching this topic...
Working with relational data models in R
-
Updated
Dec 31, 2021 - R
A full data warehouse infrastructure with ETL pipelines running inside docker on Apache Airflow for data orchestration, AWS Redshift for cloud data warehouse and Metabase to serve the needs of data visualizations such as analytical dashboards.
python
docker
airflow
sql
database
s3
s3-bucket
data-visualization
python3
data-warehouse
metabase
data-engineering
data-analytics
data-analysis
redshift
data-processing
data-cleaning
data-warehousing
data-orchestration
-
Updated
Apr 18, 2020 - Python
This is a top level repository for code examples related to Data Warehousing and Very Large Databases.
-
Updated
Jul 25, 2017 - PLSQL
Real-time Data Warehouse with Apache Flink & Apache Kafka & Apache Hudi
elasticsearch
sql
kafka
spark
hoodie
data-warehouse
delta
flink
cdc
change-data-capture
iceberg
datalake
debezium
spark-sql
data-warehousing
hudi
delta-lake
deltalake
flink-sql
real-time-data-warehouse
-
Updated
Nov 11, 2021 - Dockerfile
This repository holds the python files and notebooks associated with the Udacity Data Engineering Nanodegree.
aws
airflow
cassandra
aws-s3
postgresql
data-engineering
data-lake
data-modeling
udacity-nanodegree
data-pipeline
data-warehousing
aws-redshift
-
Updated
Dec 9, 2021 - PLpgSQL
is an hypercube of data
table
pivot-tables
data-warehouse
business-intelligence
report
olap
cube
dwh
pivot
data-warehousing
olap-cube
-
Updated
Sep 29, 2021 - JavaScript
An open source enterprise data warehousing and analysis platform.
-
Updated
Nov 8, 2021 - Jinja
-
Updated
May 14, 2020 - Vue
Open-source Twitter collection and archiving tool for tracking specific topics and collecting bulk data.
json
data-mining
uuid
tweets
disk
twitter-api
keyword
data-collection
streaming-api
data-warehousing
arx
collect-tweets
store-tweets
-
Updated
Nov 12, 2018 - Python
Make sense of it all. https://totalhack.github.io/zillion/
-
Updated
Dec 16, 2021 - Python
The CEDS Collaborative Exchange is a repository of code developed by the community that interacts with the CEDS Integration Data Store and the CEDS Elements repositories. All resources provided in this community are considered free and open source.
sql-server
data-warehouse
datawarehousing
relational-databases
relational-database
datawarehouse
data-standards
data-warehousing
education-data
education-database
ceds
education-data-standards
-
Updated
Oct 20, 2021 - TSQL
Modeled for longitudinal storage and reporting of P-20W data, the Common Education Data Standards (CEDS) Data Warehouse implements star schema data warehouse normalization techniques for improved query performance.
sql-server
data-warehouse
data-warehousing
education-data
education-database
ceds
education-data-standards
data-warehouses
-
Updated
Oct 14, 2021 - TSQL
Various Projects on Python related to Data Engineering
-
Updated
Jun 30, 2020 - Jupyter Notebook
Save data from Instagram takeout to a SQLite database
-
Updated
Sep 18, 2021 - Python
Programs for various subjects of Computer Engineering
data-mining
cryptography
algorithms
linkedin
artificial-intelligence
data-structures
computer-engineering
operating-systems
compute-engine
data-warehousing
object-oriented-programming
compiler-construction
big-data-analytics
-
Updated
Aug 5, 2020 - C
Business Intelligence and Data Warehousing Project
etl
business-intelligence
pentaho
data-warehousing
tableau-desktop
extract-transform-load
dimensional-model
entity-relationship-diagram
etl-pipeline
-
Updated
Dec 4, 2019 - TSQL
Starter project for building an ETL pipeline using SSIS in Visual Studio 2019
-
Updated
Oct 13, 2020
-
Updated
Apr 14, 2020 - Python
A data warehouse and business intelligence project on Stock market dataset to answer non-trivial BI queries.
data-visualization
data-warehouse
datascience
business-intelligence
datawarehousing
data-warehousing
tableau-desktop
dataanalytics
stock-datawarehouse
-
Updated
Oct 8, 2020 - R
Zillion Web: A Demo UI and Web API for Zillion
typescript
vue
analytics
warehouse
docker-swarm-mode
data-warehousing
dockerswarm
demo-ui
fastapi
zillion
-
Updated
Dec 22, 2021 - Vue
This course introduces the concept of data warehousing and data integration architecture and explains the role they play in overall business intelligence and analytics strategy of an organization. The course covers predominate architecture design strategies as well as hybrid designs that combine best practices from multiple areas. A key component of instruction is an emphasis on following industry best practices such as adhering to the requirements of an integrated data platform, selecting an appropriate design strategy and the tools to support it, selecting metrics for monitoring performance and data quality, and planning for future enhancements. The course provides hands-on experience in combining structural and design elements with best practices for data governance and coding standards to build a high-level plan for implementing a data warehouse and data integration system for organizations.
-
Updated
Oct 21, 2021
-
Updated
Dec 24, 2017 - Python
This is a flask application that converts an informational model of a decision problem to a snow-flaked star schema
-
Updated
May 5, 2019 - Python
Web Application provides several services for HR(s) and Managers to help them manage all aspects of the workforce in an efficient manner. Provide interfaces for employees to be more involved with Hr and Managers in a transparent mechanism to guarantee integrity environment. Use data warehousing to provide better performance and data analytics.
-
Updated
Aug 18, 2020 - JavaScript
Data Warehousing project | Outsourced for Autumn Group | Retail Outlet DW | Melbourne based | Sep - Feb 2018
-
Updated
Jun 5, 2018 - Batchfile
Data Warehousing for OFBiz and derivatives
-
Updated
Dec 22, 2021 - Groovy
-
Updated
Mar 5, 2020 - HTML
This repository includes the demos and codes I use to play around with Azure Synapse Anayltics
microsoft
python
machine-learning
scala
spark
analytics
azure
data-engineering
powerbi
datawarehouse
synapse
spark-sql
data-warehousing
azure-sql-datawarehouse
sql-data-warehouse
synapse-analytics
azure-synapse-analytics
spark-dotnet
azure-synapse-dwh
mdwh
-
Updated
Dec 30, 2021
A simple pipeline infrastructure with ETL pipeline contained in a Docker environment on Apache Airflow for orchestration and Postgres for data warehousing
python
docker
postgres
airflow
sql
database
etl
data-engineering
data-warehousing
etl-pipeline
data-orchestration
-
Updated
Sep 15, 2021 - Python
Improve this page
Add a description, image, and links to the data-warehousing topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the data-warehousing topic, visit your repo's landing page and select "manage topics."
Proposed changes
Close related #7513 (replace it with issue number if it exists).
Describe the overview of changes, and introduce why we need it.
Types of changes
What types of changes does your code introduce to Doris?
Put an
xin the boxes that apply