Here are
105 public repositories
matching this topic...
🆔 A python library for accurate and scalable fuzzy matching, record deduplication and entity-resolution.
Updated
Jun 6, 2022
Python
A powerful and modular toolkit for record linkage and duplicate detection in Python
Updated
Apr 19, 2022
Python
1 line for thousands of State of The Art NLP models in hundreds of languages The fastest and most accurate way to solve text problems.
Updated
Jun 6, 2022
Python
🆔 Command line tool for deduplicating CSV files
Updated
Mar 31, 2020
Python
🆔 Examples for using the dedupe library
Updated
Jan 19, 2022
Python
Recent trends of Entity Linking, Disambiguation, and Representation.
A list of free data matching and record linkage software.
Implementation of Fellegi-Sunter's canonical model of record linkage in Apache Spark, including EM algorithm to estimate parameters
An open source, high scalability toolkit in Java for Entity Resolution.
Updated
May 25, 2022
Java
This repository contains code and datasets related to entity/knowledge papers from the VERT (Versatile Entity Recognition & disambiguation Toolkit) project, by the Knowledge Computing group at Microsoft Research Asia (MSRA).
Updated
May 27, 2022
Python
Entity resolution for Elasticsearch.
Updated
May 16, 2022
Java
PyTorch library for transforming entities like companies, products, etc. into vectors to support scalable Record Linkage / Entity Resolution using Approximate Nearest Neighbors.
Updated
Apr 26, 2022
Jupyter Notebook
OpenRefine reconciliation services for VIAF, ORCID, and Open Library + framework for creating more.
Updated
Nov 18, 2020
Java
Record Linkage ToolKit (Find and link entities)
Updated
Dec 13, 2021
Python
Link Wikidata items to large catalogs
Updated
Dec 10, 2021
Python
Resources for tackling record linkage / deduplication / data matching problems
SparkER: an Entity Resolution framework for Apache Spark
Updated
May 18, 2022
Scala
Python implementation of anonymous linkage using cryptographic linkage keys
Updated
Jun 3, 2022
Python
Distributed Bayesian Entity Resolution in Apache Spark
Updated
Jun 10, 2021
Scala
Learning String Alignments for Entity Aliases
Updated
Mar 21, 2019
Python
Curated list of awesome software and resources for Senzing, The First Real-Time AI for Entity Resolution.
Updated
Jun 3, 2022
Python
Merge Dirty Data with Clean Reference Tables
Updated
Aug 3, 2021
Python
ReCiter: an enterprise open source author disambiguation system for academic institutions
Learned string similarity for entity names using optimal transport.
Updated
Nov 17, 2020
Python
A browser user interface for manual labeling of record pairs.
Updated
Jun 1, 2022
JavaScript
Welcome to Snowman App – a Data Matching Benchmark Platform.
Updated
Jun 1, 2022
TypeScript
Fork of the Freely Extensible Biomedical Record Linkage program
Updated
Nov 4, 2016
Python
WhatIs.this: simple entity resolution through Wikipedia
A maximum-strength name parser for record linkage.
Updated
Oct 19, 2021
Python
Improve this page
Add a description, image, and links to the
entity-resolution
topic page so that developers can more easily learn about it.
Curate this topic
Add this topic to your repo
To associate your repository with the
entity-resolution
topic, visit your repo's landing page and select "manage topics."
Learn more
You can’t perform that action at this time.
You signed in with another tab or window. Reload to refresh your session.
You signed out in another tab or window. Reload to refresh your session.