Skip to content
#

data-centric

Here are 32 public repositories matching this topic...

AI Vector Database for LLMs/LangChain. Doubles as a Data Lake for Deep Learning. Store, query, version, & visualize any data. Stream data in real-time to PyTorch/TensorFlow. https://activeloop.ai

  • Updated May 2, 2023
  • Python

Modern columnar data format for ML implemented in Rust. Convert from parquet in 2 lines of code for 100x faster random access, vector index, and data versioning. Compatible with Pandas, DuckDB, Polars, Pyarrow, with more integrations coming..

  • Updated May 2, 2023
  • Rust

The toolkit to test, validate, and evaluate your models and surface, curate, and prioritize the most valuable data for labeling to supercharge model performance.

  • Updated May 2, 2023
  • Python

Improve this page

Add a description, image, and links to the data-centric topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the data-centric topic, visit your repo's landing page and select "manage topics."

Learn more