Selected Machine Learning algorithms for natural language processing and semantic analysis in Golang
-
Updated
May 11, 2021 - Go
Selected Machine Learning algorithms for natural language processing and semantic analysis in Golang
Interesting (non-cryptographic) hashes implemented in pure Python.
A simple implementation of simhash algorithm by java.
Removes most frequent words (stop words) from a text content. Based on a Curated list of language statistics.
Simhash implementation in Javascript
Dynatrace hash library for Java
semantic-sh is a SimHash implementation to detect and group similar texts by taking power of word vectors and transformer-based language models (BERT).
A fast python implementation of the SimHash algorithm.
SuperMinHash: A New Minwise Hashing Algorithm for Jaccard Similarity Estimation, Simhash and SimhashIndex
Locality Sensitive Hashing
A library for cosine similarity & simhash calculation
Remove duplicate documents/videos/images via popular algorithms such as SimHash, SpotSig, Shingling, etc.
Find duplicate text files.
基于springboot和Google开源simhash算法实现的作业查重/抄袭检测/文本相似度分析可视化系统,,集成jplag、MOSS、singleCloud工具套件进行多方位查重 Ref: https://github.com/ALuShu/checksystem
Add a description, image, and links to the simhash topic page so that developers can more easily learn about it.
To associate your repository with the simhash topic, visit your repo's landing page and select "manage topics."