A system for quickly generating training data with weak supervision
python
data-science
machine-learning
ai
weak-supervision
snorkel
labeling
data-augmentation
training-data
data-slicing
-
Updated
Aug 9, 2020 - Python
We should allow data augmentation using masked-token prediction models through
WordSwapMaskedLM. This would leverage the power of transformers like BERT to generate augmented inputs.