#
data-integration
Here are 227 public repositories matching this topic...
Privacy and Security focused Segment-alternative, in Golang and React
react
go
golang
security
privacy
data-warehouse
data-integration
hacktoberfest
warehouse-management
data-synchronization
hybrid-cloud
customer-data
rudder
customer-data-platform
rudder-labs
segment-alternative
rudderstack
customer-data-pipeline
customer-data-lake
warehouse-first
-
Updated
Oct 10, 2021 - Go
Upserts, Deletes And Incremental Processing on Big Data.
bigdata
stream-processing
data-integration
datalake
apachespark
hudi
apachehudi
incremental-processing
apacheflink
-
Updated
Oct 10, 2021 - Java
Community-curated list of software packages and data resources for single-cell, including RNA-seq, ATAC-seq, etc.
python
bioinformatics
analysis
clustering
gene-expression
data-visualization
dimensionality-reduction
awesome-list
data-integration
atac-seq
single-cell
rna-seq-data
scrna-seq-data
cell-cycle
cell-differentiation
gene-expression-profiles
analysis-pipeline
cell-populations
rna-seq-experiments
cell-clusters
-
Updated
Sep 13, 2021
A lightweight opinionated ETL framework, halfway between plain scripts and Apache Airflow
-
Updated
Oct 1, 2021 - Python
Jitsu is an open-source data collection platform
golang
bigquery
postgres
clickhouse
snowflake
data-integration
data-collection
redshift
data-connectors
-
Updated
Oct 10, 2021 - TypeScript
Fast, sensitive and accurate integration of single-cell data with Harmony
-
Updated
Sep 6, 2021 - R
Use SQL to build ELT pipelines on a data lakehouse.
sql
apache-spark
etl
pipelines
data-engineering
data-lake
data-transfer
delta
data-integration
upsert
elt
data-pipeline
datalake
data-ingestion
spark-sql
zeppelin-notebook
apache-iceberg
lakehouse
incremental-updates
-
Updated
Sep 16, 2021 - JavaScript
wisnesky
commented
Sep 11, 2019
Complete algorithm: http://web.cecs.pdx.edu/~mpj/pubs/polyrec.html
NicheNet: predict active ligand-target links between interacting cells
rna-seq
gene-expression
network-inference
data-integration
single-cell-rna-seq
single-cell-omics
intercellular-communication
ligand-receptor
ligand-target
-
Updated
Oct 5, 2021 - R
汇总Apache Hudi相关资料
bigdata
apache
stream-processing
data-integration
datalake
hudi
apachehudi
incremental-processing
hudi-resources
-
Updated
Oct 9, 2021
An example mini data warehouse for python project stats, template for new projects
-
Updated
Jul 21, 2020 - Python
mattigrthr
commented
Aug 17, 2021
We should ensure consistent naming of variables/features across the pipelines and modules. There are sometimes inconsistencies between camel case and snake case due to Python, JS, and JSON standards. We can achieve this by using a central key-value store for example.
Hetionet: an integrative network of disease
-
Updated
Dec 10, 2020 - HTML
Query to reference mapping for single-cell genomics
-
Updated
Sep 21, 2021 - Jupyter Notebook
scikit-fusion: Data fusion via collective latent factor models
-
Updated
May 27, 2021 - Python
WInte.r is a Java framework for end-to-end data integration. The WInte.r framework implements well-known methods for data pre-processing, schema matching, identity resolution, data fusion, and result evaluation.
-
Updated
Apr 28, 2021 - Java
The Common Core Ontology Repository holds the current released version of the Common Core Ontology suite.
semantics
interoperability
data-integration
ontologies
owl-ontology
bfo
cco
applied-ontology
semantic-consistency
ontology-suite
-
Updated
Oct 6, 2021
-
Updated
Feb 10, 2021 - Python
An Efficient RML-Compliant Engine for Knowledge Graph Construction
-
Updated
Oct 5, 2021 - Python
an data-centric integration platform
-
Updated
Aug 9, 2021 - Java
A .NET class library that allows you to import data from different sources into a unified destination
mysql
html
json
csv
sql-server
csharp
xml
sqlite
excel
tabular-data
data-import
oracle
databases
powerpoint
vcard
data-integration
schema-matching
sqlce
schema-mapping
msaccess
-
Updated
Dec 8, 2020 - C#
First Party data integration solution built for marketing teams to enable audience and conversion onboarding into Google Marketing products (Google Ads, Campaign Manager, Google Analytics).
python
bigquery
google
conversions
dataflow
data-integration
googleanalytics
googleads
audiences
customermatch
audience-targeting
-
Updated
Sep 27, 2021 - Python
Utilities for creating ETL pipelines with mara
-
Updated
Jul 7, 2020 - PLpgSQL
Toolbox for including enzyme constraints on a genome-scale model.
-
Updated
Aug 31, 2021 - MATLAB
Installer for Thymeflow, a personal knowledge management system.
-
Updated
Apr 17, 2018
Repo for Data Warehouse Concepts, Design, and Data Integration by University of Colorado System (coursera)(Notes,Assignments, quiz and research papers)
-
Updated
Jun 2, 2018
simulation
systems-biology
computational-biology
data-integration
curation
biological-expression-language
biocuration
knowledge-graph-embeddings
networks-biology
knowledge-graphs
-
Updated
Dec 16, 2019 - TeX
Research data management in biomedical and machine learning applications
python
workflow
machine-learning
automation
datastructures
neuroscience
pandas
datascience
data-structures
data-integration
neuroimaging
neuroinformatics
biomedical
userfriendly
medical-data
machine-learning-workflows
-
Updated
Aug 12, 2021 - Python
Improve this page
Add a description, image, and links to the data-integration topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the data-integration topic, visit your repo's landing page and select "manage topics."
Tell us about the problem you're trying to solve
currently we only support slack as a channel for reporting failed jobs. another reasonable one might be datadog.