Here are
276 public repositories
matching this topic...
Fast, secure, efficient backup program
Deduplicating archiver with compression and authenticated encryption.
Updated
Aug 28, 2022
Python
A C library for parsing/normalizing street addresses around the world. Powered by statistical NLP and open geo data.
Updated
Aug 7, 2022
Python
Cross-platform backup tool for Windows, macOS & Linux with fast, incremental backups, client-side end-to-end encryption, compression and data deduplication. CLI and GUI included.
Extremely fast tool to remove duplicates and other lint from your filesystem
Simple, configuration-driven backup software for servers and workstations
Updated
Aug 25, 2022
Python
A powerful duplicate file finder and an enhanced fork of 'fdupes'.
A fast high compression read-only file system
Data deduplication engine, supporting optional compression and public key encryption.
Updated
Aug 25, 2022
Rust
A powerful and modular toolkit for record linkage and duplicate detection in Python
Updated
Apr 19, 2022
Python
Straightforward fuzzy matching, information retrieval and NLP building blocks for JavaScript.
Updated
May 5, 2021
JavaScript
Scalable identity resolution, entity resolution, data mastering and deduplication using ML
Updated
Aug 25, 2022
Java
Config driven, easy backup cli for restic.
A list of free data matching and record linkage software.
Fast, accurate and scalable probabilistic data linkage using your choice of SQL backend
Updated
Aug 27, 2022
Python
Locality Sensitive Hashing using MinHash in Python/Cython to detect near duplicate text documents
Updated
Jun 7, 2020
Python
A pair of kernel modules which provide pools of deduplicated and/or compressed block storage.
Коллекция готовых SQL запросов для PostgreSQL по часто возникающим задачам (получение и модификация данных, ускорение запросов, обслуживание БД)
Updated
Aug 23, 2022
PLpgSQL
Improve this page
Add a description, image, and links to the
deduplication
topic page so that developers can more easily learn about it.
Curate this topic
Add this topic to your repo
To associate your repository with the
deduplication
topic, visit your repo's landing page and select "manage topics."
Learn more
You can’t perform that action at this time.
You signed in with another tab or window. Reload to refresh your session.
You signed out in another tab or window. Reload to refresh your session.