Here are
182 public repositories
matching this topic...
Modern spell checking library - accurate, fast, multi-language
NeuSpell: A Neural Spelling Correction Toolkit
Updated
Feb 16, 2022
Python
Covid-19 Twitter dataset for non-commercial research use and pre-processing scripts - under active development
Updated
Apr 10, 2022
Jupyter Notebook
Colibri core is an NLP tool as well as a C++ and Python library for working with basic linguistic constructions such as n-grams and skipgrams (i.e patterns with one or more gaps, either of fixed or dynamic size) in a quick and memory-efficient way. At the core is the tool ``colibri-patternmodeller`` whi ch allows you to build, view, manipulate and query pattern models.
A C++ library providing fast language model queries in compressed space.
NLP Functions for amplifying negations, managing elisions, creating ngrams, stems, phonetic codes to tokens and more.
Updated
Jan 31, 2022
JavaScript
A flexible and general-purpose ngrams library written in Ruby. Raingrams supports ngram sizes greater than 1, text/non-text grams, multiple parsing styles and open/closed vocabulary models.
Updated
Feb 28, 2021
Ruby
Poetry generation via natural language markov models
Updated
Jan 5, 2017
Python
A fast and reliable PHP library for detecting languages
Typing Assistant provides the ability to autocomplete words and suggests predictions for the next word. This makes typing faster, more intelligent and reduces effort.
Python implementation of an N-gram language model with Laplace smoothing and sentence generation.
Updated
Feb 9, 2018
Python
Next Word Prediction using n-gram Probabilistic Model with various Smoothing Techniques
Updated
Jul 23, 2018
Jupyter Notebook
Using Python to detect cyber-bullying on Twitter based on categories and kinds of language used with at-risk tweets
Updated
Jun 22, 2020
Python
🦜 NLP for Tibetan, in Python.
Updated
Apr 28, 2021
Python
An R-based guide to sampling Google n-gram data, building historical term-feature matrices & investigating lexical semantic change historically.
Random Thai text generator
Word generation based on n-gram models, and a cli utility to generate said models.
Updated
Sep 1, 2016
JavaScript
Rust library providing fast language model queries in compressed space
Updated
Jan 13, 2022
Rust
A fast, compact trigram library for Icelandic
Updated
Jan 21, 2022
Python
🍰 A library for creating n-grams, skip-grams, bag of words, bag of n-grams, bag of skip-grams.
fast and comprehensive k-mer counting package
A C++ library implementing fast language models estimation using the 1-Sort algorithm.
Jupyter Notebook for Natural Language Processing learning
Updated
Apr 28, 2017
Jupyter Notebook
Updated
Dec 2, 2021
Python
The NLTK Model Submodule.
Updated
Jan 10, 2019
Python
Detect the language of text
Updated
Jul 29, 2017
Erlang
text mining, regex, N-grams, fuzzy matching
Updated
Jan 22, 2021
Jupyter Notebook
Updated
Nov 30, 2021
JavaScript
Syllable counting and detection using an n-gram language model.
Updated
Feb 18, 2017
Clojure
Improve this page
Add a description, image, and links to the
ngrams
topic page so that developers can more easily learn about it.
Curate this topic
Add this topic to your repo
To associate your repository with the
ngrams
topic, visit your repo's landing page and select "manage topics."
Learn more
You can’t perform that action at this time.
You signed in with another tab or window. Reload to refresh your session.
You signed out in another tab or window. Reload to refresh your session.