The implementation of 'Effective Document Labeling with Very Few Seed Words: A Topic Modeling Approach', Chenliang Li, Jian Xing, Aixin Sun, Zongyang Ma, https://doi.org/10.1145/2983323.2983721
The java implementation of "Enhancing Topic Modeling for Short Texts with Auxiliary Word Embeddings" TOIS 2017, Chenliang Li, Yu Duan, Haoran Wang, Zhiqian Zhang, Aixin Sun, Zongyang Ma, https://dl.acm.org/citation.cfm?doid=3133943.3091108
Determine a Prototype from a number of runs of Latent Dirichlet Allocation (LDA) measuring its similarities with S-CLOP: A procedure to select the LDA run with highest mean pairwise similarity, which is measured by S-CLOP (Similarity of multiple sets by Clustering with Local Pruning), to all other runs.
a fast Cpp-implementation Hierarchy Latent Dirichlet Allocation algorithm, can aggregate stop-words/meaningless-high-frequency-words into "common-topic"(a rubbish words bucket) and generate K(number of topics you set) more pure "special-topics".
This is the java implementation of paper "Multi-label Dataless Text Classification with Topic Modeling", Daochen Zha, Chenliang Li, https://arxiv.org/abs/1711.01563
ContentsPlanet is integrative with other systems via the OS standard file system, using outline-based content writing and management based on topic model across directories.
Content-based recommender system for scientific research articles, with Dash application for browsing 100+ subdomains developed through extensive NMF topic modeling.