corpora
Here are 101 public repositories matching this topic...
-
Updated
Mar 16, 2018 - Python
@stevenbird
Initial release of the corpus available at: https://doi.org/10.7488/ds/1411
Suggested NLTK name: ARCOSG
I have updated and corrected the corpus for inclusion in NLTK. (The one at the link above is older and shouldn't be used).
Corpus reader code verified:
`arcosg = LazyCorpusLoader(
'arcosg',
CategorizedTaggedCorpusReader,
r'.*.txt',
cat_file='cats.prn',
-
Updated
Jan 7, 2019
-
Updated
May 8, 2019 - PHP
-
Updated
Jun 17, 2020
-
Updated
Jan 1, 2019 - Python
-
Updated
Jun 4, 2020 - R
Feature just like "Suggest" in PML-TQ web, which allows a user to pick nodes in a tree in search results (words in a concordance line, in case of Kontext), visualise the nodes (words) with all their attributes and pick the ones that should be part of the new query.
See e.g. http://hdl.handle.net/11346/PMLTQ-ELDJ, execute the query and click on some nodes in a resulting tree. Suggest button w
-
Updated
May 20, 2020 - Python
-
Updated
Jun 18, 2019 - Python
-
Updated
Jul 5, 2019 - PHP
As of now, the codebase lacks a proper set of offline documentation (docstrings). I think it may be a good idea to add it.
-
Updated
Jul 2, 2018 - Python
-
Updated
Aug 7, 2017
-
Updated
Mar 15, 2018
-
Updated
Aug 13, 2018
-
Updated
Jun 12, 2019 - Java
-
Updated
Oct 3, 2018 - R
The current version of corpus provides raw text of tweets which violates Twitter policy. We should provide raw message ids instead and re-construct the corpus from these ids.
-
Updated
Jun 14, 2020 - Python
-
Updated
May 15, 2017 - Shell
-
Updated
Jun 27, 2016
Improve this page
Add a description, image, and links to the corpora topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the corpora topic, visit your repo's landing page and select "manage topics."


flake8 testing of https://github.com/juand-r/entity-recognition-datasets on Python 3.7.0
$ flake8 . --count --select=E901,E999,F821,F822,F823 --show-source --statistics