Skip to content
#

text-mining

Here are 1,223 public repositories matching this topic...

texthero
henrifroese
henrifroese commented Sep 3, 2020

Lemmatization can be thought of as a more advanced stemming that we already have in the preprocessing module. You can read about it e.g. here. Implementation should be done with spaCy.

ToDo

Implement a function hero.lemmatize(s: TokenSeries) (or mayber rather TextSeries?). Using spaCy this should be fairly straightforward. It should go

Open Semantic Search Engine and Open Source Text Mining & Text Analytics platform (Integrates ETL for document processing, OCR for images & PDF, named entity recognition for persons, organizations & locations, metadata management by thesaurus & ontologies, search user interface & search apps for fulltext search, faceted search & knowledge graph)

  • Updated Jul 31, 2020
  • Dockerfile

Improve this page

Add a description, image, and links to the text-mining topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the text-mining topic, visit your repo's landing page and select "manage topics."

Learn more

You can’t perform that action at this time.