Topic Modelling for Humans
Support
Quality
Security
License
Reuse
Leveraging BERT and c-TF-IDF to create easily interpretable topics.
Support
Quality
Security
License
Reuse
Top2Vec learns jointly embedded topic, document and word vectors.
Support
Quality
Security
License
Reuse
A Toolkit for Industrial Topic Modeling
Support
Quality
Security
License
Reuse
Topic modeling with latent Dirichlet allocation using Gibbs sampling
Support
Quality
Security
License
Reuse
R package for web-based interactive topic model visualization.
Support
Quality
Security
License
Reuse
Python package of Tomoto, the Topic Modeling Tool
Support
Quality
Security
License
Reuse
Hierarchical unsupervised and semi-supervised topic models for sparse count data with CorEx
Support
Quality
Security
License
Reuse
中文詞向量訓練教學
Support
Quality
Security
License
Reuse
tensorflow port of the lda2vec model for unsupervised learning of document + topic + word embeddings
Support
Quality
Security
License
Reuse
Topic Modeling in Embedding Spaces
Support
Quality
Security
License
Reuse
semi supervised guided topic model with custom guidedLDA
Support
Quality
Security
License
Reuse
Code for Biterm Topic Model (published in WWW 2013)
Support
Quality
Security
License
Reuse
Keyphrase or Keyword Extraction 基于预训练模型的中文关键词抽取方法(论文SIFRank: A New Baseline for Unsupervised Keyphrase Extraction Based on Pre-trained Language Model 的中文版代码)
Support
Quality
Security
License
Reuse
p
python-topic-modelby dongwookim-ml
Jupyter Notebook 348 Version:Current License: Permissive (Apache-2.0)
Implementation of various topic models
Support
Quality
Security
License
Reuse
Latent Dirichlet Allocation (LDA) model for Microblogs (Twitter, weibo etc.)
Support
Quality
Security
License
Reuse
An R Package for the Structural Topic Model
Support
Quality
Security
License
Reuse
LDA topic modeling for node.js
Support
Quality
Security
License
Reuse
A Package of Keyphrase Extraction and Social Tag Suggestion, the project has moved to
Support
Quality
Security
License
Reuse
A dataset of 15 million CAD sketches with geometric constraint graphs.
Support
Quality
Security
License
Reuse
Real-time Skeletonization for Sketch-based Modeling
Support
Quality
Security
License
Reuse
A systems dynamics economics modeling software
Support
Quality
Security
License
Reuse
Dynamic Topic Modeling via Non-negative Matrix Factorization
Support
Quality
Security
License
Reuse
Words categorized by topic.
Support
Quality
Security
License
Reuse
cw2vec: Learning Chinese Word Embeddings with Stroke n-gram Information
Support
Quality
Security
License
Reuse
Open Source Package for Gibbs Sampling of LDA
Support
Quality
Security
License
Reuse
The TensorFlow reference implementation of 'GEMSEC: Graph Embedding with Self Clustering' (ASONAM 2019).
Support
Quality
Security
License
Reuse
Text Mining and Topic Modeling Toolkit for Python with parallel processing power
Support
Quality
Security
License
Reuse
A Java implemention of LDA(Latent Dirichlet Allocation)
Support
Quality
Security
License
Reuse
A Pytorch implementation of "Splitter: Learning Node Representations that Capture Multiple Social Contexts" (WWW 2019).
Support
Quality
Security
License
Reuse
Improving topic models LDA and DMM (one-topic-per-document model for short texts) with word embeddings (TACL 2015)
Support
Quality
Security
License
Reuse
Palmetto is a quality measuring tool for topics
Support
Quality
Security
License
Reuse
This implements topics that change over time (Dynamic Topic Models) and a model of how individual documents predict that change.
Support
Quality
Security
License
Reuse
Topic modeling with gensim and LDA
Support
Quality
Security
License
Reuse
Go library for performing computations in word2vec binary models
Support
Quality
Security
License
Reuse
This is a C implementation of variational EM for latent Dirichlet allocation (LDA), a topic model for text or other discrete data.
Support
Quality
Security
License
Reuse
Models, properties, and write-up of LTEInspector (NDSS'18)
Support
Quality
Security
License
Reuse
Hierarchical Dirichlet processes. Topic models where the data determine the number of topics. This implements Gibbs sampling.
Support
Quality
Security
License
Reuse
Support
Quality
Security
License
Reuse
A scalable Gensim implementation of "Learning Role-based Graph Embeddings" (IJCAI 2018).
Support
Quality
Security
License
Reuse
Object modeling system for Kohana, inspired by Django
Support
Quality
Security
License
Reuse
The reference implementation of "Multi-scale Attributed Node Embedding". (Journal of Complex Networks 2021)
Support
Quality
Security
License
Reuse
A Ruby wrapper for Latent Dirichlet Allocation (LDA).
Support
Quality
Security
License
Reuse
Concept Modeling: Topic Modeling on Images and Text
Support
Quality
Security
License
Reuse
Online inference for the Hierarchical Dirichlet Process. Fits hierarchical Dirichlet process topic models to massive data. The algorithm determines the number of topics.
Support
Quality
Security
License
Reuse
A project with topic model implementations
Support
Quality
Security
License
Reuse
Gibbs sampler for the Hierarchical Latent Dirichlet Allocation topic model
Support
Quality
Security
License
Reuse
Distributed skipgram mixture model for multisense word embedding
Support
Quality
Security
License
Reuse
Topic modeling with word vectors
Support
Quality
Security
License
Reuse
Constraint-based diagram editor
Support
Quality
Security
License
Reuse
g
gensimby RaRe-Technologies
Topic Modelling for Humans
Python 14417Updated: 1 y ago License: Weak Copyleft (LGPL-2.1)
Support
Quality
Security
License
Reuse
B
BERTopicby MaartenGr
Leveraging BERT and c-TF-IDF to create easily interpretable topics.
Python 4329Updated: 1 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
T
Top2Vecby ddangelov
Top2Vec learns jointly embedded topic, document and word vectors.
Python 2558Updated: 1 y ago License: Permissive (BSD-3-Clause)
Support
Quality
Security
License
Reuse
F
Familiaby baidu
A Toolkit for Industrial Topic Modeling
C++ 2420Updated: 3 y ago License: Permissive (BSD-3-Clause)
Support
Quality
Security
License
Reuse
l
ldaby lda-project
Topic modeling with latent Dirichlet allocation using Gibbs sampling
Python 1122Updated: 2 y ago License: Weak Copyleft (MPL-2.0)
Support
Quality
Security
License
Reuse
L
LDAvisby cpsievert
R package for web-based interactive topic model visualization.
JavaScript 532Updated: 2 y ago License: Proprietary (Proprietary)
Support
Quality
Security
License
Reuse
t
tomotopyby bab2min
Python package of Tomoto, the Topic Modeling Tool
C++ 474Updated: 2 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
c
corex_topicby gregversteeg
Hierarchical unsupervised and semi-supervised topic models for sparse count data with CorEx
Python 462Updated: 3 y ago License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
w
Support
Quality
Security
License
Reuse
l
lda2vec-tfby meereeum
tensorflow port of the lda2vec model for unsupervised learning of document + topic + word embeddings
Python 431Updated: 2 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
E
ETMby adjidieng
Topic Modeling in Embedding Spaces
Python 422Updated: 3 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
G
GuidedLDAby vi3k6i5
semi supervised guided topic model with custom guidedLDA
Python 404Updated: 3 y ago License: Weak Copyleft (MPL-2.0)
Support
Quality
Security
License
Reuse
B
BTMby xiaohuiyan
Code for Biterm Topic Model (published in WWW 2013)
C++ 389Updated: 1 y ago License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
S
SIFRank_zhby sunyilgdx
Keyphrase or Keyword Extraction 基于预训练模型的中文关键词抽取方法(论文SIFRank: A New Baseline for Unsupervised Keyphrase Extraction Based on Pre-trained Language Model 的中文版代码)
Python 369Updated: 2 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
p
python-topic-modelby dongwookim-ml
Implementation of various topic models
Jupyter Notebook 348Updated: 3 y ago License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
T
Twitter-LDAby minghui
Latent Dirichlet Allocation (LDA) model for Microblogs (Twitter, weibo etc.)
Java 309Updated: 4 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
s
stmby bstewart
An R Package for the Structural Topic Model
R 308Updated: 3 y ago License: Proprietary (Proprietary)
Support
Quality
Security
License
Reuse
l
ldaby primaryobjects
LDA topic modeling for node.js
JavaScript 263Updated: 4 y ago License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
T
THUTagby YeDeming
A Package of Keyphrase Extraction and Social Tag Suggestion, the project has moved to
Java 257Updated: 4 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
S
SketchGraphsby PrincetonLIPS
A dataset of 15 million CAD sketches with geometric constraint graphs.
Python 254Updated: 2 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
R
RealSkelby jingma-git
Real-time Skeletonization for Sketch-based Modeling
C++ 248Updated: 2 y ago License: Strong Copyleft (GPL-3.0)
Support
Quality
Security
License
Reuse
m
minskyby highperformancecoder
A systems dynamics economics modeling software
C++ 246Updated: 1 y ago License: Strong Copyleft (GPL-3.0)
Support
Quality
Security
License
Reuse
d
dynamic-nmfby derekgreene
Dynamic Topic Modeling via Non-negative Matrix Factorization
Python 239Updated: 3 y ago License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
w
wordlistsby imsky
Words categorized by topic.
JavaScript 235Updated: 2 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
c
cw2vecby bamtercelboo
cw2vec: Learning Chinese Word Embeddings with Stroke n-gram Information
C++ 232Updated: 3 y ago License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
L
LDAGibbsSamplingby yangliuy
Open Source Package for Gibbs Sampling of LDA
Java 222Updated: 4 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
G
GEMSECby benedekrozemberczki
The TensorFlow reference implementation of 'GEMSEC: Graph Embedding with Self Clustering' (ASONAM 2019).
Python 217Updated: 3 y ago License: Strong Copyleft (GPL-3.0)
Support
Quality
Security
License
Reuse
t
tmtoolkitby WZBSocialScienceCenter
Text Mining and Topic Modeling Toolkit for Python with parallel processing power
Python 192Updated: 2 y ago License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
L
LDA4jby hankcs
A Java implemention of LDA(Latent Dirichlet Allocation)
Java 191Updated: 4 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
S
Splitterby benedekrozemberczki
A Pytorch implementation of "Splitter: Learning Node Representations that Capture Multiple Social Contexts" (WWW 2019).
Python 180Updated: 3 y ago License: Strong Copyleft (GPL-3.0)
Support
Quality
Security
License
Reuse
L
LFTMby datquocnguyen
Improving topic models LDA and DMM (one-topic-per-document model for short texts) with word embeddings (TACL 2015)
Java 169Updated: 4 y ago License: Proprietary (Proprietary)
Support
Quality
Security
License
Reuse
P
Palmettoby dice-group
Palmetto is a quality measuring tool for topics
Java 168Updated: 3 y ago License: Strong Copyleft (AGPL-3.0)
Support
Quality
Security
License
Reuse
d
dtmby blei-lab
This implements topics that change over time (Dynamic Topic Models) and a model of how individual documents predict that change.
Shell 165Updated: 4 y ago License: Strong Copyleft (GPL-2.0)
Support
Quality
Security
License
Reuse
t
topicsby vladsandulescu
Topic modeling with gensim and LDA
Python 158Updated: 4 y ago License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
w
word2vecby sajari
Go library for performing computations in word2vec binary models
Go 152Updated: 3 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
l
lda-cby blei-lab
This is a C implementation of variational EM for latent Dirichlet allocation (LDA), a topic model for text or other discrete data.
C 144Updated: 4 y ago License: Weak Copyleft (LGPL-2.1)
Support
Quality
Security
License
Reuse
L
LTEInspectorby relentless-warrior
Models, properties, and write-up of LTEInspector (NDSS'18)
Python 141Updated: 2 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
h
hdpby blei-lab
Hierarchical Dirichlet processes. Topic models where the data determine the number of topics. This implements Gibbs sampling.
C++ 141Updated: 4 y ago License: Strong Copyleft (GPL-2.0)
Support
Quality
Security
License
Reuse
G
Gaussian_LDAby rajarshd
HTML 138Updated: 4 y ago License: Proprietary (Proprietary)
Support
Quality
Security
License
Reuse
r
role2vecby benedekrozemberczki
A scalable Gensim implementation of "Learning Role-based Graph Embeddings" (IJCAI 2018).
Python 137Updated: 4 y ago License: Strong Copyleft (GPL-3.0)
Support
Quality
Security
License
Reuse
s
sprigby sittercity
Object modeling system for Kohana, inspired by Django
PHP 136Updated: 4 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
M
MUSAEby benedekrozemberczki
The reference implementation of "Multi-scale Attributed Node Embedding". (Journal of Complex Networks 2021)
Python 133Updated: 2 y ago License: Strong Copyleft (GPL-3.0)
Support
Quality
Security
License
Reuse
l
lda-rubyby ealdent
A Ruby wrapper for Latent Dirichlet Allocation (LDA).
C 132Updated: 4 y ago License: Weak Copyleft (LGPL-2.1)
Support
Quality
Security
License
Reuse
C
Conceptby MaartenGr
Concept Modeling: Topic Modeling on Images and Text
Python 132Updated: 2 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
o
online-hdpby blei-lab
Online inference for the Hierarchical Dirichlet Process. Fits hierarchical Dirichlet process topic models to massive data. The algorithm determines the number of topics.
Python 131Updated: 4 y ago License: Strong Copyleft (GPL-2.0)
Support
Quality
Security
License
Reuse
t
topicModellingby balikasg
A project with topic model implementations
Python 130Updated: 1 y ago License: Strong Copyleft (GPL-3.0)
Support
Quality
Security
License
Reuse
h
hldaby joewandy
Gibbs sampler for the Hierarchical Latent Dirichlet Allocation topic model
Jupyter Notebook 126Updated: 3 y ago License: Strong Copyleft (GPL-3.0)
Support
Quality
Security
License
Reuse
d
distributed_skipgram_mixtureby microsoft
Distributed skipgram mixture model for multisense word embedding
C++ 117Updated: 4 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
l
lda2vec-pytorchby TropComplique
Topic modeling with word vectors
Jupyter Notebook 112Updated: 3 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
d
dunnartby mjwybrow
Constraint-based diagram editor
C++ 110Updated: 4 y ago License: Proprietary (Proprietary)
Support
Quality
Security
License
Reuse