Skip to content
#

stemmer

Here are 138 public repositories matching this topic...

smit1678
smit1678 commented May 10, 2019

Opening this ticket as I'm looking to add Bahasa Indonesia as a supported language. At https://github.com/hotosm we're working on various documentation sites that need support across a number of languages, including Indonesian.

If anyone has already worked on this, please chime in. Otherwise, we will be looking to add the support and create a PR when ready.

cadmium
rmarronnier
rmarronnier commented Sep 30, 2019

As you can see browsing Cadmium shards source code, several entities (for lack of a better word) are declared in different locations and in different ways.

This issue is not just a namespace or redundancy issue but we'd benefit by having fundamental classes or structs describing the tokens, sentences and documents we're dealing with.

I've started in the pos_tagger declaring such structs an

nevf
nevf commented Nov 24, 2016

You mention:

Allows saving/loading the index to/from disk, but for small datasets you can feed the index on-the-fly.

however I can't see any documentation about this. I'd like to store the index in a database (MongoDB) and query that.

FYI I'm looking at and evaluating the various full-text search libraries available for Node.js and the Browser and have only just found thinker-fts and from

Improve this page

Add a description, image, and links to the stemmer topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the stemmer topic, visit your repo's landing page and select "manage topics."

Learn more

You can’t perform that action at this time.