#
data-integration
Here are 209 public repositories matching this topic...
Upserts, Deletes And Incremental Processing on Big Data.
bigdata
stream-processing
data-integration
datalake
apachespark
hudi
apachehudi
incremental-processing
apacheflink
-
Updated
Apr 12, 2021 - Java
Community-curated list of software packages and data resources for single-cell, including RNA-seq, ATAC-seq, etc.
python
bioinformatics
analysis
clustering
gene-expression
data-visualization
dimensionality-reduction
awesome-list
data-integration
atac-seq
single-cell
rna-seq-data
scrna-seq-data
cell-cycle
cell-differentiation
gene-expression-profiles
analysis-pipeline
cell-populations
rna-seq-experiments
cell-clusters
-
Updated
Apr 9, 2021
A lightweight opinionated ETL framework, halfway between plain scripts and Apache Airflow
-
Updated
Mar 8, 2021 - Python
xtreding
commented
Apr 6, 2021
Problem
At present Jitsu tracks, the only server starts/stops events and events from JS/API. We should improve telemetry for understanding how users use our product. There should be additional telemetry in the jitsucom/server as well as in jitsucom/configurarator frontend. We should understand:
- What types of sources/destinations are used;
- From what source to what destination event
Fast, sensitive and accurate integration of single-cell data with Harmony
-
Updated
Mar 28, 2021 - R
wisnesky
commented
Sep 11, 2019
Complete algorithm: http://web.cecs.pdx.edu/~mpj/pubs/polyrec.html
An example mini data warehouse for python project stats, template for new projects
-
Updated
Jul 21, 2020 - Python
NicheNet: predict active ligand-target links between interacting cells
rna-seq
gene-expression
network-inference
data-integration
single-cell-rna-seq
single-cell-omics
intercellular-communication
ligand-receptor
ligand-target
-
Updated
Mar 17, 2021 - R
Hetionet: an integrative network of disease
-
Updated
Dec 10, 2020 - HTML
scikit-fusion: Data fusion via collective latent factor models
-
Updated
Feb 14, 2020 - Python
汇总Apache Hudi相关资料
bigdata
apache
stream-processing
data-integration
datalake
hudi
apachehudi
incremental-processing
hudi-resources
-
Updated
Apr 11, 2021
WInte.r is a Java framework for end-to-end data integration. The WInte.r framework implements well-known methods for data pre-processing, schema matching, identity resolution, data fusion, and result evaluation.
-
Updated
Mar 26, 2021 - Java
The Common Core Ontology Repository holds the current released version of the Common Core Ontology suite.
semantics
interoperability
data-integration
ontologies
owl-ontology
bfo
cco
applied-ontology
semantic-consistency
ontology-suite
-
Updated
Mar 15, 2021
Query to reference mapping for single-cell genomics
-
Updated
Apr 7, 2021 - Jupyter Notebook
-
Updated
Feb 10, 2021 - Python
an data-centric integration platform
-
Updated
Mar 18, 2021 - Java
An Efficient RML-Compliant Engine for Knowledge Graph Construction
-
Updated
Mar 30, 2021 - Python
A .NET class library that allows you to import data from different sources into a unified destination
mysql
html
json
csv
sql-server
csharp
xml
sqlite
excel
tabular-data
data-import
oracle
databases
powerpoint
vcard
data-integration
schema-matching
sqlce
schema-mapping
msaccess
-
Updated
Dec 8, 2020 - C#
Utilities for creating ETL pipelines with mara
-
Updated
Jul 7, 2020 - PLpgSQL
Toolbox for including enzyme constraints on a genome-scale model.
-
Updated
Mar 31, 2021 - MATLAB
Installer for Thymeflow, a personal knowledge management system.
-
Updated
Apr 17, 2018
Research data management in biomedical and machine learning applications
python
workflow
machine-learning
automation
datastructures
neuroscience
pandas
datascience
data-structures
data-integration
neuroimaging
neuroinformatics
biomedical
userfriendly
medical-data
machine-learning-workflows
-
Updated
Apr 2, 2021 - Python
simulation
systems-biology
computational-biology
data-integration
curation
biological-expression-language
biocuration
knowledge-graph-embeddings
networks-biology
knowledge-graphs
-
Updated
Dec 16, 2019 - TeX
Repo for Data Warehouse Concepts, Design, and Data Integration by University of Colorado System (coursera)(Notes,Assignments, quiz and research papers)
-
Updated
Jun 2, 2018
Scripts and resources to create Hetionet v1.0, a heterogeneous network for drug repurposing
-
Updated
Sep 22, 2017 - Jupyter Notebook
Some of the projects i made when starting to learn R for Data Science at the university
-
Updated
Jul 9, 2019 - R
Development of the Gellish Communicator reference application and tools for universal data exchange and data integration supporting Formal English and other Gellish formalized natural languages.
nlp
taxonomies
natural-language-processing
databases
ontology
interoperability
classification
data-integration
family
data-management
reference-implementation
query-language
data-modeling
formal-languages
data-exchange
knowledge-representation
universal-interface
knowledge-modeling
interoperability-of-systems
gellish
formalized-natural-language
-
Updated
Nov 12, 2018 - Python
First Party data integration solution built for marketing teams to enable audience and conversion onboarding into Google Marketing products (Google Ads, Campaign Manager, Google Analytics).
python
bigquery
google
conversions
dataflow
data-integration
googleanalytics
googleads
audiences
customermatch
audience-targeting
-
Updated
Apr 8, 2021 - Python
Improve this page
Add a description, image, and links to the data-integration topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the data-integration topic, visit your repo's landing page and select "manage topics."
We have a lot of tap/target naming conventions left from early on. This would move us closer to using uniform naming schemes.