Topic Modelling for Humans
-
Updated
Aug 8, 2023 - Python
Topic Modelling for Humans
Compute Sentence Embeddings Fast!
Telegram Data Clustering contest solution by Mindful Squirrel
Web Application for checking the similarity between query and document using the concept of Cosine Similarity.
Document similarity algorithms experiment - Jaccard, TF-IDF, Doc2vec, USE, and BERT.
A Clojure library for querying large data-sets on similarity
Document Search Engine Tool
Repo for Aspire - A scientific document similarity model based on matching fine-grained aspects of scientific papers.
Document Search Engine project with TF-IDF abd Google universal sentence encoder model
Compilation of Natural Language Processing (NLP) codes. BONUS: Link to Information Retrieval (IR) codes compilation. (checkout the readme)
A tool which can find your any document using semantic search
Document Similarity with Apache Spark using Locality Sesitive Hashing and Python
Using Jaccard-Similarity and Minhashing to determine similarity between two text documents
Rust-based text search engine from scratch supporting multiple document similarity metrics (TF-IDF, BM25, BM25VA)
Survey data and Python code for the ICADL 2021 paper "A Qualitative Evaluation of User Preference for Link-based vs. Text-based Recommendations of Wikipedia Articles"
Telegram Data Clustering Contest (Bossy Gnu's submission )
DocxMatch is a Streamlit app that analyzes the similarity between Word files.
The Bitnation Jurisdiction Public Notary DApp
Document searching from queries using Inverted index
Add a description, image, and links to the document-similarity topic page so that developers can more easily learn about it.
To associate your repository with the document-similarity topic, visit your repo's landing page and select "manage topics."