Dedupe.io
- Chicago. IL
- https://dedupe.io/
- dedupe@datamade.us
Grow your team on GitHub
GitHub is home to over 50 million developers working together. Join them to grow your own development teams, manage permissions, and collaborate on projects.
Sign up
Pinned repositories
Repositories
-
dedupe
🆔 A python library for accurate and scalable fuzzy matching, record deduplication and entity-resolution. -
-
dedupe-variable-ilcs
Dedupe variable for Illinois Compiled Statute (ILCS) codes
-
dedupeio-web-api-docs
Dedupe.io web API allows for matching and training against projects using a standard RESTful framework.
-
-
pyhacrf
Forked from dirko/pyhacrf📐 Hidden alignment conditional random field for classifying string pairs. -
dedupe-examples
🆔 Examples for using the dedupe library -
-
dedupe-variable-number
Try to cast strings to numbers, then compare
-
-
-
-
categorical-distance
📐 Compare categorical variables -
datetime-distance
📐 Compare dates and times -
fuzzycategory
📐 Fuzzy Categorical Distances -
-
-
soft-tfidf
Mispelling tolerant tf-idf similarity metric
-
learned-string-alignments
Forked from iesl/learned-string-alignmentsLearning String Alignments for Entity Aliases
-
dedupe-variable-datetime
DateTime variable for dedupe
-
dedupe-geocoder
📍 Demonstration of how dedupe might be used as geocoder -
dedupe-vowpal
Vowpal Wabbit Active Labeler for Dedupe
-
dedupe-variable-person
Dedupe variable for person names. just people. no companies.
-
address-matching
Python script for matching a list of messy addresses against a gazetteer using dedupe.
-
-
-
affinegap
📐 A Cython implementation of the affine gap string distance -
csvdedupe
🆔 Command line tool for deduplicating CSV files -
Levenshtein_search
Forked from mattandahalfew/Levenshtein_searchPython search module for fast approximate string matching
Most used topics
People
This organization has no public members. You must be a member to see who’s a part of this organization.