Skip to content
#

data-centric

Here are 32 public repositories matching this topic...

Data Lake for Deep Learning. Multi-modal Vector Database for LLMs/LangChain. Store, query, version, & visualize datasets. Stream data in real-time to PyTorch/TensorFlow. https://activeloop.ai

  • Updated Apr 15, 2023
  • Python

Modern columnar data format for ML implemented in Rust. Convert from parquet in 2 lines of code for 100x faster random access, vector index, and data versioning. Compatible with Pandas, DuckDB, Polars, Pyarrow, with more integrations coming..

  • Updated Apr 15, 2023
  • Rust

The open source active learning toolkit to find failure modes in your computer vision models, prioritize data to label next, and drive data curation to improve model performance.

  • Updated Apr 14, 2023
  • Python

Improve this page

Add a description, image, and links to the data-centric topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the data-centric topic, visit your repo's landing page and select "manage topics."

Learn more