Data integration platform for ELT pipelines from APIs, databases & files to warehouses & lakes.
-
Updated
Aug 4, 2023 - Python
Data integration platform for ELT pipelines from APIs, databases & files to warehouses & lakes.
An orchestration platform for the development, production, and observation of data assets.
SeaTunnel is a distributed, high-performance data integration platform for the synchronization and transformation of massive data (offline & real-time).
The open source high performance data integration platform built for developers.
Upserts, Deletes And Incremental Processing on Big Data.
Kestra is an infinitely scalable orchestration and scheduling platform, creating, running, scheduling, and monitoring millions of complex pipelines.
Privacy and Security focused Segment-alternative, in Golang and React
Jitsu is an open-source Segment alternative. Fully-scriptable data ingestion engine for modern data teams. Set-up a real-time data pipeline in minutes, not days
Community-curated list of software packages and data resources for single-cell, including RNA-seq, ATAC-seq, etc.
Apache DevLake is an open-source dev data platform to ingest, analyze, and visualize the fragmented data from DevOps tools, extracting insights for engineering excellence, developer experience, and community growth.
A lightweight opinionated ETL framework, halfway between plain scripts and Apache Airflow
Composable unified data streaming platform powered by Rust and Web Assembly
BitSail is a distributed high-performance data integration engine which supports batch, streaming and incremental scenarios. BitSail is widely used to synchronize hundreds of trillions of data every day.
Kuwala is the no-code data platform for BI analysts and engineers enabling you to build powerful analytics workflows. We are set out to bring state-of-the-art data engineering tools you love, such as Airbyte, dbt, or Great Expectations together in one intuitive interface built with React Flow. In addition we provide third-party data into data sc…
汇总Apache Hudi相关资料
Fast, sensitive and accurate integration of single-cell data with Harmony
NicheNet: predict active ligand-target links between interacting cells
Conduit streams data between data stores. Kafka Connect replacement. No JVM required.
Add a description, image, and links to the data-integration topic page so that developers can more easily learn about it.
To associate your repository with the data-integration topic, visit your repo's landing page and select "manage topics."