Skip to content
#

language-detection

Here are 170 public repositories matching this topic...

bzz
bzz commented Jan 9, 2019

To assist debugging in dev mode, it would be nice to have some visibility into the decision-making logic that Enry uses at runtime.

Problem: after getting a final prediction e.g though enry.GetLanguage() it's very hard to tell:

  • what strategies were used
  • what suggestions each strategy made
  • what was the winning strategy

Such introspection would simplify maintenance and reduce the

beowulfenator
beowulfenator commented Mar 29, 2019

When trying to attach language detector to multiple instances of spaCy, it fails. Example source code:

import spacy
from spacy_cld import LanguageDetector

nlp1 = spacy.load('en_core_web_sm')
language_detector1 = LanguageDetector()
nlp1.add_pipe(language_detector1)

nlp2 = spacy.load('en_core_web_md')
language_detector2 = LanguageDetector()
nlp2.add_pipe(language_detector2)
charset_normalizer

专注于解决自然语言处理领域的几个核心问题:词法分析,句法分析,语义分析,语种检测,信息抽取,文本聚类和文本分类. 为相关领域的研发人员提供完整的通用设计与参考实现. 涵盖了多种自然语言处理算法,适配了多个自然语言处理框架. 兼容Lucene/Solr/ElasticSearch插件.

  • Updated May 19, 2020
  • Java
donnydongchen
donnydongchen commented Apr 19, 2020

Hello there,

I'm new to the spaCy universe. I've encountered this issue but it could be something I did wrong.

How to reproduce the behavior

import spacy
from spacy_langdetect import LanguageDetector

nlp = spacy.load('en_core_web_sm')
language_detector = LanguageDetector()
nlp.add_pipe(nlp.create_pipe('sentencizer'))
nlp.add_pipe(language_detector, name='language_detector',

Improve this page

Add a description, image, and links to the language-detection topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the language-detection topic, visit your repo's landing page and select "manage topics."

Learn more

You can’t perform that action at this time.