Skip to content
@dedupeio

Dedupe.io

De-duplicate and find matches in your Excel spreadsheet or database

Pinned

  1. dedupe Public

    🆔 A python library for accurate and scalable fuzzy matching, record deduplication and entity-resolution.

    Python 3.4k 475

  2. csvdedupe Public

    🆔 Command line tool for deduplicating CSV files

    Python 349 78

  3. 🆔 Examples for using the dedupe library

    Python 317 200

  4. affinegap Public

    📐 A Cython implementation of the affine gap string distance

    Cython 52 7

  5. pyhacrf Public

    Forked from dirko/pyhacrf

    📐 Hidden alignment conditional random field for classifying string pairs.

    Python 24 9

  6. 🔉 Python wrapper for a C++ Double Metaphone

    C++ 10 6

Repositories

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Top languages

Loading…

Most used topics

Loading…