Topic Modelling for Humans
Support
Quality
Security
License
Reuse
Leveraging BERT and c-TF-IDF to create easily interpretable topics.
Support
Quality
Security
License
Reuse
Top2Vec learns jointly embedded topic, document and word vectors.
Support
Quality
Security
License
Reuse
A Toolkit for Industrial Topic Modeling
Support
Quality
Security
License
Reuse
Topic modeling with latent Dirichlet allocation using Gibbs sampling
Support
Quality
Security
License
Reuse
R package for web-based interactive topic model visualization.
Support
Quality
Security
License
Reuse
Python package of Tomoto, the Topic Modeling Tool
Support
Quality
Security
License
Reuse
Hierarchical unsupervised and semi-supervised topic models for sparse count data with CorEx
Support
Quality
Security
License
Reuse
中文詞向量訓練教學
Support
Quality
Security
License
Reuse
tensorflow port of the lda2vec model for unsupervised learning of document + topic + word embeddings
Support
Quality
Security
License
Reuse
Topic Modeling in Embedding Spaces
Support
Quality
Security
License
Reuse
semi supervised guided topic model with custom guidedLDA
Support
Quality
Security
License
Reuse
Code for Biterm Topic Model (published in WWW 2013)
Support
Quality
Security
License
Reuse
Keyphrase or Keyword Extraction 基于预训练模型的中文关键词抽取方法(论文SIFRank: A New Baseline for Unsupervised Keyphrase Extraction Based on Pre-trained Language Model 的中文版代码)
Support
Quality
Security
License
Reuse
p
python-topic-modelby dongwookim-ml
Jupyter Notebook 
348
Version:Current
License: Permissive (Apache-2.0)
Implementation of various topic models
Support
Quality
Security
License
Reuse
Latent Dirichlet Allocation (LDA) model for Microblogs (Twitter, weibo etc.)
Support
Quality
Security
License
Reuse
An R Package for the Structural Topic Model
Support
Quality
Security
License
Reuse
LDA topic modeling for node.js
Support
Quality
Security
License
Reuse
A Package of Keyphrase Extraction and Social Tag Suggestion, the project has moved to
Support
Quality
Security
License
Reuse
A dataset of 15 million CAD sketches with geometric constraint graphs.
Support
Quality
Security
License
Reuse
Real-time Skeletonization for Sketch-based Modeling
Support
Quality
Security
License
Reuse
A systems dynamics economics modeling software
Support
Quality
Security
License
Reuse
Dynamic Topic Modeling via Non-negative Matrix Factorization
Support
Quality
Security
License
Reuse
Words categorized by topic.
Support
Quality
Security
License
Reuse
cw2vec: Learning Chinese Word Embeddings with Stroke n-gram Information
Support
Quality
Security
License
Reuse
Open Source Package for Gibbs Sampling of LDA
Support
Quality
Security
License
Reuse
The TensorFlow reference implementation of 'GEMSEC: Graph Embedding with Self Clustering' (ASONAM 2019).
Support
Quality
Security
License
Reuse
Text Mining and Topic Modeling Toolkit for Python with parallel processing power
Support
Quality
Security
License
Reuse
A Java implemention of LDA(Latent Dirichlet Allocation)
Support
Quality
Security
License
Reuse
A Pytorch implementation of "Splitter: Learning Node Representations that Capture Multiple Social Contexts" (WWW 2019).
Support
Quality
Security
License
Reuse
Improving topic models LDA and DMM (one-topic-per-document model for short texts) with word embeddings (TACL 2015)
Support
Quality
Security
License
Reuse
Palmetto is a quality measuring tool for topics
Support
Quality
Security
License
Reuse
This implements topics that change over time (Dynamic Topic Models) and a model of how individual documents predict that change.
Support
Quality
Security
License
Reuse
Topic modeling with gensim and LDA
Support
Quality
Security
License
Reuse
Go library for performing computations in word2vec binary models
Support
Quality
Security
License
Reuse
This is a C implementation of variational EM for latent Dirichlet allocation (LDA), a topic model for text or other discrete data.
Support
Quality
Security
License
Reuse
Models, properties, and write-up of LTEInspector (NDSS'18)
Support
Quality
Security
License
Reuse
Hierarchical Dirichlet processes. Topic models where the data determine the number of topics. This implements Gibbs sampling.
Support
Quality
Security
License
Reuse
Support
Quality
Security
License
Reuse
A scalable Gensim implementation of "Learning Role-based Graph Embeddings" (IJCAI 2018).
Support
Quality
Security
License
Reuse
Object modeling system for Kohana, inspired by Django
Support
Quality
Security
License
Reuse
The reference implementation of "Multi-scale Attributed Node Embedding". (Journal of Complex Networks 2021)
Support
Quality
Security
License
Reuse
A Ruby wrapper for Latent Dirichlet Allocation (LDA).
Support
Quality
Security
License
Reuse
Concept Modeling: Topic Modeling on Images and Text
Support
Quality
Security
License
Reuse
Online inference for the Hierarchical Dirichlet Process. Fits hierarchical Dirichlet process topic models to massive data. The algorithm determines the number of topics.
Support
Quality
Security
License
Reuse
A project with topic model implementations
Support
Quality
Security
License
Reuse
Gibbs sampler for the Hierarchical Latent Dirichlet Allocation topic model
Support
Quality
Security
License
Reuse
Distributed skipgram mixture model for multisense word embedding
Support
Quality
Security
License
Reuse
Topic modeling with word vectors
Support
Quality
Security
License
Reuse
Constraint-based diagram editor
Support
Quality
Security
License
Reuse
g
gensimby RaRe-Technologies
Topic Modelling for Humans
Python
14417
Updated: 2 y ago
License: Weak Copyleft (LGPL-2.1)
Support
Quality
Security
License
Reuse
B
BERTopicby MaartenGr
Leveraging BERT and c-TF-IDF to create easily interpretable topics.
Python
4329
Updated: 2 y ago
License: Permissive (MIT)
Support
Quality
Security
License
Reuse
T
Top2Vecby ddangelov
Top2Vec learns jointly embedded topic, document and word vectors.
Python
2558
Updated: 2 y ago
License: Permissive (BSD-3-Clause)
Support
Quality
Security
License
Reuse
F
Familiaby baidu
A Toolkit for Industrial Topic Modeling
C++
2420
Updated: 4 y ago
License: Permissive (BSD-3-Clause)
Support
Quality
Security
License
Reuse
l
ldaby lda-project
Topic modeling with latent Dirichlet allocation using Gibbs sampling
Python
1122
Updated: 2 y ago
License: Weak Copyleft (MPL-2.0)
Support
Quality
Security
License
Reuse
L
LDAvisby cpsievert
R package for web-based interactive topic model visualization.
JavaScript
532
Updated: 2 y ago
License: Proprietary (Proprietary)
Support
Quality
Security
License
Reuse
t
tomotopyby bab2min
Python package of Tomoto, the Topic Modeling Tool
C++
474
Updated: 2 y ago
License: Permissive (MIT)
Support
Quality
Security
License
Reuse
c
corex_topicby gregversteeg
Hierarchical unsupervised and semi-supervised topic models for sparse count data with CorEx
Python
462
Updated: 4 y ago
License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
w
Support
Quality
Security
License
Reuse
l
lda2vec-tfby meereeum
tensorflow port of the lda2vec model for unsupervised learning of document + topic + word embeddings
Python
431
Updated: 2 y ago
License: No License (No License)
Support
Quality
Security
License
Reuse
E
ETMby adjidieng
Topic Modeling in Embedding Spaces
Python
422
Updated: 3 y ago
License: Permissive (MIT)
Support
Quality
Security
License
Reuse
G
GuidedLDAby vi3k6i5
semi supervised guided topic model with custom guidedLDA
Python
404
Updated: 4 y ago
License: Weak Copyleft (MPL-2.0)
Support
Quality
Security
License
Reuse
B
BTMby xiaohuiyan
Code for Biterm Topic Model (published in WWW 2013)
C++
389
Updated: 2 y ago
License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
S
SIFRank_zhby sunyilgdx
Keyphrase or Keyword Extraction 基于预训练模型的中文关键词抽取方法(论文SIFRank: A New Baseline for Unsupervised Keyphrase Extraction Based on Pre-trained Language Model 的中文版代码)
Python
369
Updated: 2 y ago
License: No License (No License)
Support
Quality
Security
License
Reuse
p
python-topic-modelby dongwookim-ml
Implementation of various topic models
Jupyter Notebook
348
Updated: 4 y ago
License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
T
Twitter-LDAby minghui
Latent Dirichlet Allocation (LDA) model for Microblogs (Twitter, weibo etc.)
Java
309
Updated: 4 y ago
License: No License (No License)
Support
Quality
Security
License
Reuse
s
stmby bstewart
An R Package for the Structural Topic Model
R
308
Updated: 4 y ago
License: Proprietary (Proprietary)
Support
Quality
Security
License
Reuse
l
ldaby primaryobjects
LDA topic modeling for node.js
JavaScript
263
Updated: 4 y ago
License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
T
THUTagby YeDeming
A Package of Keyphrase Extraction and Social Tag Suggestion, the project has moved to
Java
257
Updated: 4 y ago
License: No License (No License)
Support
Quality
Security
License
Reuse
S
SketchGraphsby PrincetonLIPS
A dataset of 15 million CAD sketches with geometric constraint graphs.
Python
254
Updated: 2 y ago
License: Permissive (MIT)
Support
Quality
Security
License
Reuse
R
RealSkelby jingma-git
Real-time Skeletonization for Sketch-based Modeling
C++
248
Updated: 2 y ago
License: Strong Copyleft (GPL-3.0)
Support
Quality
Security
License
Reuse
m
minskyby highperformancecoder
A systems dynamics economics modeling software
C++
246
Updated: 2 y ago
License: Strong Copyleft (GPL-3.0)
Support
Quality
Security
License
Reuse
d
dynamic-nmfby derekgreene
Dynamic Topic Modeling via Non-negative Matrix Factorization
Python
239
Updated: 4 y ago
License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
w
wordlistsby imsky
Words categorized by topic.
JavaScript
235
Updated: 2 y ago
License: Permissive (MIT)
Support
Quality
Security
License
Reuse
c
cw2vecby bamtercelboo
cw2vec: Learning Chinese Word Embeddings with Stroke n-gram Information
C++
232
Updated: 4 y ago
License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
L
LDAGibbsSamplingby yangliuy
Open Source Package for Gibbs Sampling of LDA
Java
222
Updated: 4 y ago
License: No License (No License)
Support
Quality
Security
License
Reuse
G
GEMSECby benedekrozemberczki
The TensorFlow reference implementation of 'GEMSEC: Graph Embedding with Self Clustering' (ASONAM 2019).
Python
217
Updated: 4 y ago
License: Strong Copyleft (GPL-3.0)
Support
Quality
Security
License
Reuse
t
tmtoolkitby WZBSocialScienceCenter
Text Mining and Topic Modeling Toolkit for Python with parallel processing power
Python
192
Updated: 2 y ago
License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
L
LDA4jby hankcs
A Java implemention of LDA(Latent Dirichlet Allocation)
Java
191
Updated: 4 y ago
License: No License (No License)
Support
Quality
Security
License
Reuse
S
Splitterby benedekrozemberczki
A Pytorch implementation of "Splitter: Learning Node Representations that Capture Multiple Social Contexts" (WWW 2019).
Python
180
Updated: 4 y ago
License: Strong Copyleft (GPL-3.0)
Support
Quality
Security
License
Reuse
L
LFTMby datquocnguyen
Improving topic models LDA and DMM (one-topic-per-document model for short texts) with word embeddings (TACL 2015)
Java
169
Updated: 4 y ago
License: Proprietary (Proprietary)
Support
Quality
Security
License
Reuse
P
Palmettoby dice-group
Palmetto is a quality measuring tool for topics
Java
168
Updated: 3 y ago
License: Strong Copyleft (AGPL-3.0)
Support
Quality
Security
License
Reuse
d
dtmby blei-lab
This implements topics that change over time (Dynamic Topic Models) and a model of how individual documents predict that change.
Shell
165
Updated: 4 y ago
License: Strong Copyleft (GPL-2.0)
Support
Quality
Security
License
Reuse
t
topicsby vladsandulescu
Topic modeling with gensim and LDA
Python
158
Updated: 4 y ago
License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
w
word2vecby sajari
Go library for performing computations in word2vec binary models
Go
152
Updated: 3 y ago
License: Permissive (MIT)
Support
Quality
Security
License
Reuse
l
lda-cby blei-lab
This is a C implementation of variational EM for latent Dirichlet allocation (LDA), a topic model for text or other discrete data.
C
144
Updated: 4 y ago
License: Weak Copyleft (LGPL-2.1)
Support
Quality
Security
License
Reuse
L
LTEInspectorby relentless-warrior
Models, properties, and write-up of LTEInspector (NDSS'18)
Python
141
Updated: 2 y ago
License: No License (No License)
Support
Quality
Security
License
Reuse
h
hdpby blei-lab
Hierarchical Dirichlet processes. Topic models where the data determine the number of topics. This implements Gibbs sampling.
C++
141
Updated: 4 y ago
License: Strong Copyleft (GPL-2.0)
Support
Quality
Security
License
Reuse
G
Gaussian_LDAby rajarshd
HTML
138
Updated: 4 y ago
License: Proprietary (Proprietary)
Support
Quality
Security
License
Reuse
r
role2vecby benedekrozemberczki
A scalable Gensim implementation of "Learning Role-based Graph Embeddings" (IJCAI 2018).
Python
137
Updated: 4 y ago
License: Strong Copyleft (GPL-3.0)
Support
Quality
Security
License
Reuse
s
sprigby sittercity
Object modeling system for Kohana, inspired by Django
PHP
136
Updated: 4 y ago
License: Permissive (MIT)
Support
Quality
Security
License
Reuse
M
MUSAEby benedekrozemberczki
The reference implementation of "Multi-scale Attributed Node Embedding". (Journal of Complex Networks 2021)
Python
133
Updated: 2 y ago
License: Strong Copyleft (GPL-3.0)
Support
Quality
Security
License
Reuse
l
lda-rubyby ealdent
A Ruby wrapper for Latent Dirichlet Allocation (LDA).
C
132
Updated: 4 y ago
License: Weak Copyleft (LGPL-2.1)
Support
Quality
Security
License
Reuse
C
Conceptby MaartenGr
Concept Modeling: Topic Modeling on Images and Text
Python
132
Updated: 2 y ago
License: Permissive (MIT)
Support
Quality
Security
License
Reuse
o
online-hdpby blei-lab
Online inference for the Hierarchical Dirichlet Process. Fits hierarchical Dirichlet process topic models to massive data. The algorithm determines the number of topics.
Python
131
Updated: 4 y ago
License: Strong Copyleft (GPL-2.0)
Support
Quality
Security
License
Reuse
t
topicModellingby balikasg
A project with topic model implementations
Python
130
Updated: 2 y ago
License: Strong Copyleft (GPL-3.0)
Support
Quality
Security
License
Reuse
h
hldaby joewandy
Gibbs sampler for the Hierarchical Latent Dirichlet Allocation topic model
Jupyter Notebook
126
Updated: 3 y ago
License: Strong Copyleft (GPL-3.0)
Support
Quality
Security
License
Reuse
d
distributed_skipgram_mixtureby microsoft
Distributed skipgram mixture model for multisense word embedding
C++
117
Updated: 4 y ago
License: Permissive (MIT)
Support
Quality
Security
License
Reuse
l
lda2vec-pytorchby TropComplique
Topic modeling with word vectors
Jupyter Notebook
112
Updated: 4 y ago
License: Permissive (MIT)
Support
Quality
Security
License
Reuse
d
dunnartby mjwybrow
Constraint-based diagram editor
C++
110
Updated: 4 y ago
License: Proprietary (Proprietary)
Support
Quality
Security
License
Reuse