Skip to content
#

spacy

Here are 532 public repositories matching this topic...

ines
ines commented Sep 29, 2019

I was going though the existing enhancement issues again and though it'd be nice to collect ideas for spaCy plugins and related projects. There are always people in the community who are looking for new things to build, so here's some inspiration For existing plugins and projects, check out the spaCy universe.

If you have questions about the projects I suggested,

rasa
solyarisoftware
solyarisoftware commented Apr 30, 2020

I propose this topic as feature request, but it's also a documentation issue, as lack of details in user guide paragraph: https://rasa.com/docs/rasa/core/actions/#custom-actions.

What specified in paragraph Execute Actions in Other Code is obscure to me, and details at the API documentation link [Action Server](]https://rasa.com/docs/rasa/api/acti

textacy
loricelli
loricelli commented Feb 12, 2020

I'm trying to find verbs in a sentence with python for a NLP problem but the new function returns any match and not only the longest match (which pos_regex_matches does).

pattern = r'<VERB>*<ADV>*<VERB>+<PART>*'
verb_pattern = [{"POS": "VERB", "OP": "*"},{"POS": "ADV", "OP": "*"},{"POS": "VERB", "OP": "+"},{"POS": "PART", "OP": "*"}]

t_list_1 = textacy.extract.pos_regex_matches(text, pat
bug
fukidzon
fukidzon commented Feb 21, 2020

I wanted to use pytextrank together with spacy_udpipe to get keywords from texts in other languages (see https://stackoverflow.com/questions/59824405/spacy-udpipe-with-pytextrank-to-extract-keywords-from-non-english-text) but I realized, that udpipe-spacy somehow "overrides" the original spacy's pipeline so the noun_chunks are not generated (btw: the noun_chunks are created in lang/en/syntax_itera

lamthuy
lamthuy commented Apr 1, 2020

Hi,
When we try to tokenize the following sentence:

If we use spacy

a = spacy.load('en_core_web_lg')

doc = a("I like the link http://www.idph.iowa.gov/ohds/oral-health-center/coordinator")

list(doc)

We got

[I, like, the, link, http://www.idph.iowa.gov, /, ohds, /, oral, -, health, -, center, /, coordinator]

But if we use the Spacy transformer tokenizer:

SergeyShk
SergeyShk commented Mar 23, 2020

spaCy version: 2.1.9
spaCy-stanza version: 0.2.1

import stanza
from spacy_stanza import StanzaLanguage

stanza.download('ru')
snlp = stanza.Pipeline(lang="ru")
nlp = StanzaLanguage(snlp)
text = "Мама мыла раму"

Using stanza, i get this:

for sentence in snlp(text).senteces:
	for word in sentence.words:
		print(word.feats)

# Animacy=Anim|Case=Nom|Gender=

Improve this page

Add a description, image, and links to the spacy topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the spacy topic, visit your repo's landing page and select "manage topics."

Learn more

You can’t perform that action at this time.