FastDup is a tool for gaining insights from a large image collection. It can find anomalies, duplicate and near duplicate images, clusters of similaritity, learn the normal behavior and temporal interactions between images. It can be used for smart subsampling of a higher quality dataset, outlier removal, novelty detection of new information to be sent for tagging. FastDup scales to millions of images running on CPU only.
python
machine-learning
image
deep-learning
image-processing
similarity
kaggle
dataset
image-classification
outlier-detection
object-detection
visual-search
data-augmentation
data-curation
visualization-tools
image-duplicate-detection
novelty-detection
image-classfication
-
Updated
Nov 2, 2022 - Python