-
Updated
Dec 30, 2021 - Python
entity-extraction
Here are 137 public repositories matching this topic...
-
Updated
Dec 31, 2021 - JavaScript
-
Updated
Jan 1, 2022 - Scala
-
Updated
Dec 19, 2021 - Python
-
Updated
Jan 2, 2022 - C#
-
Updated
Jun 1, 2020 - Python
-
Updated
Mar 8, 2018 - Scala
-
Updated
Jul 25, 2019 - Python
-
Updated
Jun 3, 2020 - Python
-
Updated
May 14, 2019 - Python
-
Updated
Dec 24, 2021 - Java
-
Updated
Dec 26, 2021 - Python
-
Updated
Oct 14, 2021 - Python
-
Updated
Jul 7, 2017 - Python
-
Updated
Dec 7, 2021 - Python
-
Updated
Oct 21, 2020 - Python
Users misspell things. Having spell-check and synonyms helps a lot, but doesn't catch everything.
One solution would be to use the python metaphone package's implementation of the Double Metaphone algorithm.
At component train time, it could look at the normal entity lists, find the DM representation of all the synonyms, and store them.
-
Updated
Aug 30, 2016 - JavaScript
-
Updated
Oct 26, 2021 - HTML
-
Updated
Aug 15, 2021 - Python
[Updated after reading sotera.github.io/newman/features].
In the "Top Addresses" screenshot below, jeb@jeb.org shows 79% in the donut plot and 0.988 in the bar plot.

I thought the 0.988 was a proportion -- w
-
Updated
Oct 7, 2021 - Python
-
Updated
Oct 25, 2021 - Python
-
Updated
Sep 19, 2020 - PHP
I have mostly tested htmldate on a set of English, German and French web pages I had run into by surfing or during web crawls. There are definitely further web pages and cases in other languages for which the extraction of a date doesn't work so far.
Please install the dateparser library beforehand as it significantly extends linguistic coverage: pipor pip3 install -U dateparser or `pi
-
Updated
Sep 2, 2019 - Python
-
Updated
Dec 15, 2021 - Python
Improve this page
Add a description, image, and links to the entity-extraction topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the entity-extraction topic, visit your repo's landing page and select "manage topics."