Skip to content
@dedupeio

Dedupe.io

De-duplicate and find matches in your Excel spreadsheet or database

Pinned

  1. dedupe Public

    🆔 A python library for accurate and scalable fuzzy matching, record deduplication and entity-resolution.

    Python 3.8k 528

  2. csvdedupe Public

    🆔 Command line tool for deduplicating CSV files

    Python 394 82

  3. 🆔 Examples for using the dedupe library

    Python 385 216

  4. affinegap Public

    📐 A Cython implementation of the affine gap string distance

    Cython 57 9

  5. pyhacrf Public

    Forked from dirko/pyhacrf

    📐 Hidden alignment conditional random field for classifying string pairs.

    Python 24 11

  6. 🔉 Python wrapper for a C++ Double Metaphone

    C++ 12 6

Repositories

Showing 10 of 31 repositories

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Top languages

Loading…

Most used topics

Loading…