a contextual, biasable, word-or-sentence-or-paragraph extractive summarizer powered by the latest in text embeddings (Bert, Universal Sentence Encoder, Flair)
python
cuda
embeddings
summarization
summarizer
semantic-search
extractive-summarization
flair
universal-sentence-encoder
token-level-summarization
semantic-summarization
contextual-summarization
debate-evidence
-
Updated
May 21, 2021 - Python
Users misspell things. Having spell-check and synonyms helps a lot, but doesn't catch everything.
One solution would be to use the python metaphone package's implementation of the Double Metaphone algorithm.
At component train time, it could look at the normal entity lists, find the DM representation of all the synonyms, and store them.