A system for quickly generating training data with weak supervision
-
Updated
Nov 11, 2022 - Python
A system for quickly generating training data with weak supervision
Training Data (Annotation, Catalog, Workflow) for all Data Types (Image, Video, 3D, Text, Geo, Audio, more) at scale.
skweak: A software toolkit for weak supervision applied to NLP tasks
Synthetic structured data generators
Computer vision based ML training data generation tool
A machine learning tool for automated prediction engineering. It allows you to easily structure prediction problems and generate labels for supervised learning.
Web application for image labeling and segmentation
Pure Python, lightweight, Pillow-based solver for Amazon's text captcha.
Augmentation pipeline for rendering synthetic paper printing, faxing, scanning and copy machine processes
A lightweight web application for brushing labels onto time series data; useful for building training sets.
Augmenty is an augmentation library based on spaCy for augmenting texts.
Natural Language Data Augmentation Tool for Conversational Systems
Collection of casual conversations that can be used with the Rasa Stack
Generating training data from the Carla driving simulator in the KITTI dataset format
Aubo i5 Dual Arm Collaborative Robot - RealSense D435 - 3D Object Pose Estimation - ROS
COVID-19 Coughs files for training AI models
Data Programming by Demonstration (DPBD) for Document Classification
A repository of NSFW images to be used for machine learning/image classification purposes
Add a description, image, and links to the training-data topic page so that developers can more easily learn about it.
To associate your repository with the training-data topic, visit your repo's landing page and select "manage topics."