Here are
87 public repositories
matching this topic...
A distributed data integration framework that simplifies common aspects of big data integration such as data ingestion, replication, organization and lifecycle management for both streaming and batch data ecosystems.
Updated
Sep 27, 2023
Java
Replace Splunk in your small company with this one weird trick!
Updated
Sep 29, 2023
Python
Enabling Continuous Data Processing with Apache Spark and Azure Event Hubs
Updated
Jan 24, 2023
Scala
Data Prepper is a component of the OpenSearch project that accepts, filters, transforms, enriches, and routes data at scale.
Updated
Sep 29, 2023
Java
Apache Spark examples exclusively in Java
Updated
Apr 21, 2023
Java
Use any public GitHub repository as a source and ask questions through ChatGPT about it
Updated
Sep 28, 2023
TypeScript
A free, open-source, web-based self-service BI tailor-made for clickhouse, google bigquery, mysql, postgresql, vertica
Updated
Sep 27, 2023
Scala
Service for bulk-loading data to databases with automatic schema management (Redshift, Snowflake, BigQuery, ClickHouse, Postgres, MySQL)
IBIS is a workflow creation-engine that abstracts the Hadoop internals of ingesting RDBMS data.
Updated
Apr 13, 2022
Python
Extensible streaming ingestion pipeline on top of Apache Spark
Updated
Jun 20, 2023
Scala
Media Management System: ingestion, processing, encoding, delivery, ...
Updated
Aug 24, 2020
Haskell
💰 A bot for maximizing the borrow subreddit
Updated
Feb 13, 2017
JavaScript
A simple demo application for uploading, ingesting, embedding videos and converting them to mp4s. From api.video (https://api.video )
Updated
Dec 20, 2022
JavaScript
Spark in Action, 2e - chapter 9 - Advanced ingestion: finding data sources and building your own
Updated
Apr 21, 2023
Java
Parallel Streaming Transformation Loader
Updated
Apr 23, 2019
Java
Periodically ingest incremental updates (inserts / deletes) into BigQuery using Cloud Composer / Airflow orchestration workflow
Updated
Dec 12, 2019
Python
👥 [WIP] An experimental High Available Reverse Proxy for Massive Asynchronous Message Consumption
Updated
Jan 23, 2023
Python
tagbase-server is a data management web service for working with eTUFF and nc-eTAG files.
Updated
Sep 28, 2023
Python
Improve this page
Add a description, image, and links to the
ingestion
topic page so that developers can more easily learn about it.
Curate this topic
Add this topic to your repo
To associate your repository with the
ingestion
topic, visit your repo's landing page and select "manage topics."
Learn more
You can’t perform that action at this time.