Collect, aggregate, and visualize a data ecosystem's metadata
-
Updated
Mar 11, 2023 - Java
Collect, aggregate, and visualize a data ecosystem's metadata
Open-source data observability for analytics engineers.
First open-source data discovery and observability platform. We make a life for data practitioners easy so you can focus on your business.
Egeria core
SQL Lineage Analysis Tool powered by Python
Generate and Visualize Data Lineage from query history
This Apache Atlas is built from the latest release source tarball and patched to be run in a Docker container.
Reference implementation for real-time Data Lineage tracking for BigQuery using Audit Logs, ZetaSQL and Dataflow.
Egeria's Guidance on Governance as well as large media files such as presentations and movies
Mapping of DWH database tables to business entities, attributes & metrics in Python, with automatic creation of flattened tables
A boilerplate solution for processing image and PDF documents for regulated industries, with lineage and pipeline operations metadata services.
Data-Export支持将链上数据导出到MySQL、ES等便于进行大数据处理的存储介质中,解决区块链数据复杂查询、分析、可视化和处理的问题。
Identify and tokenize sensitive data automatically using Cloud DLP and Dataflow
Data catalog for everything in your company
Data-Stash是基于FISCO-BCOS的数据仓库组件,通过解析节点的binlog日志,生成该节点状态的全量备份,从而使节点能够实现冷热数据分离和数据裁剪。
Data-Reconcile是一款基于区块链的对账组件,提供基于区块链智能合约账本的通用化数据对账解决方案,并提供了一套可动态扩展的对账框架,支持定制化开发。
Guide to data platforms and tools
An end-to-end data lineage tool, detects table dependencies from SQL statements.
Open-source metadata collector based on ODD Specification
an open source dataworks platform
Add a description, image, and links to the data-governance topic page so that developers can more easily learn about it.
To associate your repository with the data-governance topic, visit your repo's landing page and select "manage topics."