natural-language-processing
Natural language processing (NLP) is a field of computer science that studies how computers and humans interact. In the 1950s, Alan Turing published an article that proposed a measure of intelligence, now called the Turing test. More modern techniques, such as deep learning, have produced results in the fields of language modeling, parsing, and natural-language tasks.
Here are 7,818 public repositories matching this topic...
-
Updated
Jun 2, 2021 - Python
-
Updated
Jun 14, 2021 - Jupyter Notebook
-
Updated
Jun 18, 2021 - Python
-
Updated
Jun 18, 2021 - Python
-
Updated
Jun 18, 2021 - Python
-
Updated
Jun 14, 2021 - Python
-
Updated
Jun 12, 2017
Change tensor.data to tensor.detach() due to
pytorch/pytorch#6990 (comment)
tensor.detach() is more robust than tensor.data.
(triggered by SO question: https://stackoverflow.com/questions/67944732/using-my-own-stopword-list-with-gensim-corpora-textcorpus-textcorpus/67951592#67951592)
Gensim has two remove_stopwords() functions with similar, but slightly-different behavior that risks confusing users.
gensim.parsing.preprocessing.remove_stopwords takes a space-delimited string, and always consults the current
-
Updated
Jun 16, 2021
-
Updated
May 2, 2021
-
Updated
Jun 18, 2021 - Python
-
Updated
Jun 16, 2021 - Python
-
Updated
Jun 4, 2021
Is your feature request related to a problem? Please describe.
I typically used compressed datasets (e.g. gzipped) to save disk space. This works fine with AllenNLP during training because I can write my dataset reader to load the compressed data. However, the predict command opens the file and reads lines for the Predictor. This fails when it tries to load data from my compressed files.
-
Updated
Jun 17, 2021 - Python
-
Updated
Dec 22, 2020 - Python
-
Updated
Jun 17, 2021 - Python
-
Updated
May 21, 2021
-
Updated
Jun 14, 2021 - Python
-
Updated
May 2, 2021 - Jupyter Notebook
-
Updated
Jun 17, 2021 - Java
-
Updated
May 29, 2021 - Python
-
Updated
Jun 17, 2021 - Python
Hello spoooopyyy hackers
This is a Hacktoberfest only issue!
This is also data-sciency!
The Problem
Our English dictionary contains words that aren't English, and does not contain common English words.
Examples of non-common words in the dictionary:
"hlithskjalf",
"hlorrithi",
"hlqn",
"hm",
"hny",
"ho",
"hoactzin",
"hoactzine
-
Updated
Jun 12, 2021 - Python
-
Updated
Jun 15, 2021 - Python
Created by Alan Turing
- Wikipedia
- Wikipedia
Let's use this Issue to track performance issues and enhancement requests, so it's easier to prioritize the work.
This is for pytorch
transformersAlso I will label it as a
Good Difficult Issuein case someone is ready for a challenging but rewarding experience of figuring things out. If you do want to take the challenge comment in the corresponding Issue/PR that resonates with you s