Here are
100 public repositories
matching this topic...
🆔 A python library for accurate and scalable fuzzy matching, record deduplication and entity-resolution.
Updated
Dec 20, 2021
Python
A toolkit for record linkage and duplicate detection in Python
Updated
Dec 10, 2021
Python
1 line for thousands of State of The Art NLP models in hundreds of languages The fastest and most accurate way to solve text problems.
Updated
Dec 22, 2021
Python
Scalable data mastering, deduplication and entity resolution.
Updated
Dec 23, 2021
Java
🆔 Command line tool for deduplicating CSV files
Updated
Mar 31, 2020
Python
🆔 Examples for using the dedupe library
Updated
Jun 17, 2021
Python
Recent trends of Entity Linking, Disambiguation, and Representation.
A list of free data matching and record linkage software.
An open source, high scalability toolkit in Java for Entity Resolution.
Updated
Dec 13, 2021
Java
Implementation of Fellegi-Sunter's canonical model of record linkage in Apache Spark, including EM algorithm to estimate parameters
Updated
Dec 23, 2021
Roff
Entity resolution for Elasticsearch.
Updated
Dec 14, 2021
Java
This repository contains code and datasets related to entity/knowledge papers from the VERT (Versatile Entity Recognition & disambiguation Toolkit) project, by the Knowledge Computing group at Microsoft Research Asia (MSRA).
Updated
Oct 14, 2021
Python
OpenRefine reconciliation services for VIAF, ORCID, and Open Library + framework for creating more.
Updated
Nov 18, 2020
Java
PyTorch library for transforming entities like companies, products, etc. into vectors to support scalable Record Linkage / Entity Resolution using Approximate Nearest Neighbors.
Updated
Jul 16, 2021
Jupyter Notebook
Record Linkage ToolKit (Find and link entities)
Updated
Dec 13, 2021
Python
Link Wikidata items to large catalogs
Updated
Dec 10, 2021
Python
Resources for tackling record linkage / deduplication / data matching problems
Python implementation of anonymous linkage using cryptographic linkage keys
Updated
Dec 23, 2021
Python
SparkER: an Entity Resolution framework for Apache Spark
Updated
Dec 9, 2021
Scala
Distributed Bayesian Entity Resolution in Apache Spark
Updated
Jun 10, 2021
Scala
Learning String Alignments for Entity Aliases
Updated
Mar 21, 2019
Python
Merge Dirty Data with Clean Reference Tables
Updated
Aug 3, 2021
Python
ReCiter: an enterprise open source author disambiguation system for academic institutions
Updated
Dec 22, 2021
Java
Curated list of awesome software and resources for Senzing, The First Real-Time AI for Entity Resolution.
Updated
Dec 21, 2021
Python
A browser user interface for manual labeling of record pairs.
Updated
Oct 6, 2021
JavaScript
Learned string similarity for entity names using optimal transport.
Updated
Nov 17, 2020
Python
Fork of the Freely Extensible Biomedical Record Linkage program
Updated
Nov 4, 2016
Python
Welcome to Snowman App – a Data Matching Benchmark Platform.
Updated
Nov 14, 2021
TypeScript
WhatIs.this: simple entity resolution through Wikipedia
A maximum-strength name parser for record linkage.
Updated
Oct 19, 2021
Python
Improve this page
Add a description, image, and links to the
entity-resolution
topic page so that developers can more easily learn about it.
Curate this topic
Add this topic to your repo
To associate your repository with the
entity-resolution
topic, visit your repo's landing page and select "manage topics."
Learn more
You can’t perform that action at this time.
You signed in with another tab or window. Reload to refresh your session.
You signed out in another tab or window. Reload to refresh your session.