lakehouse
Here are 39 public repositories matching this topic...
Data Lake for Deep Learning. Multi-modal Vector Database for LLMs/LangChain. Store, query, version, & visualize datasets. Stream data in real-time to PyTorch/TensorFlow. https://activeloop.ai
-
Updated
Apr 27, 2023 - Python
StarRocks is a next-gen sub-second MPP database for full analytics scenarios, including multi-dimensional analytics, real-time analytics and ad-hoc query.
-
Updated
Apr 27, 2023 - Java
YTsaurus is a scalable and fault-tolerant open-source big data platform.
-
Updated
Apr 27, 2023 - C++
LakeSoul is an end-to-end, realtime and cloud native Lakehouse framework with fast data ingestion, concurrent update and incremental data analytics on cloud storages for both BI and AI applications.
-
Updated
Apr 27, 2023 - Scala
Use SQL to build ELT pipelines on a data lakehouse.
-
Updated
May 25, 2022 - JavaScript
A modern data marketplace that makes collaboration among diverse users (like business, analysts and engineers) easier, increasing efficiency and agility in data projects on AWS.
-
Updated
Apr 26, 2023 - Python
Examples of using Terraform to deploy Databricks resources
-
Updated
Apr 26, 2023 - HCL
Lakehouse storage system benchmark
-
Updated
Feb 22, 2023 - Scala
DeltaOMS is a solution that help build a centralized repository of Delta Transaction logs and associated operational metrics/statistics for your Delta Lakehouse
-
Updated
Oct 5, 2022 - Scala
Creation of a data lakehouse and an ELT pipeline to enable the efficient analysis and use of data
-
Updated
Dec 7, 2022 - Python
Open source stack lakehouse
-
Updated
Mar 27, 2023 - Python
Repositório dedicado a Workshop de Data Lakehouse com Delta Lake
-
Updated
Dec 6, 2021 - Jupyter Notebook
Sample Data Lakehouse deployed in Docker containers using Apache Iceberg, Minio, Trino and a Hive Metastore. Can be used for local testing.
-
Updated
Apr 4, 2023 - SQL
Automated provisioning of an industry Lakehouse with enterprise data model
-
Updated
Jan 4, 2023 - Python
Genomic BigData Warehousing with Apache Spark and LakeHouse Architecture
-
Updated
Jan 19, 2023 - Jupyter Notebook
Improve this page
Add a description, image, and links to the lakehouse topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the lakehouse topic, visit your repo's landing page and select "manage topics."