corpora
Here are 102 public repositories matching this topic...
@stevenbird
Initial release of the corpus available at: https://doi.org/10.7488/ds/1411
Suggested NLTK name: ARCOSG
I have updated and corrected the corpus for inclusion in NLTK. (The one at the link above is older and shouldn't be used).
Corpus reader code verified:
`arcosg = LazyCorpusLoader(
'arcosg',
CategorizedTaggedCorpusReader,
r'.*.txt',
cat_file='cats.prn',
-
Updated
Mar 16, 2018 - Python
-
Updated
Jan 7, 2019
-
Updated
Jul 10, 2020 - PHP
-
Updated
Jul 15, 2020
-
Updated
Jan 1, 2019 - Python
-
Updated
Jun 23, 2020 - R
-
Updated
May 20, 2020 - Python
Feature just like "Suggest" in PML-TQ web, which allows a user to pick nodes in a tree in search results (words in a concordance line, in case of Kontext), visualise the nodes (words) with all their attributes and pick the ones that should be part of the new query.
See e.g. http://hdl.handle.net/11346/PMLTQ-ELDJ, execute the query and click on some nodes in a resulting tree. Suggest button w
-
Updated
Jun 18, 2019 - Python
-
Updated
Jul 9, 2020 - PHP
As of now, the codebase lacks a proper set of offline documentation (docstrings). I think it may be a good idea to add it.
-
Updated
Aug 7, 2017
-
Updated
Jul 2, 2018 - Python
-
Updated
Mar 15, 2018
-
Updated
Jun 12, 2019 - Java
-
Updated
Aug 13, 2018
-
Updated
Oct 3, 2018 - R
The current version of corpus provides raw text of tweets which violates Twitter policy. We should provide raw message ids instead and re-construct the corpus from these ids.
-
Updated
Jun 30, 2020 - Python
-
Updated
May 15, 2017 - Shell
-
Updated
Jun 25, 2020 - C#
Improve this page
Add a description, image, and links to the corpora topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the corpora topic, visit your repo's landing page and select "manage topics."


flake8 testing of https://github.com/juand-r/entity-recognition-datasets on Python 3.7.0
$ flake8 . --count --select=E901,E999,F821,F822,F823 --show-source --statistics