kandi background
Explore Kits

15 best Python Topic Modelling libraries in 2023

by reegs20 Updated: Dec 20, 2022

Find patterns or themes in large document sets, create links, pinpoint important subjects, implement popular algorithms like LSA/LSI/SVD, and more for your Artificial Intelligence, Topic Modeling, Bert, Neural Network, Transformer, and NLP applications. Topic modeling is a method for locating hidden subjects in vast amounts of text. Extensive collections of unstructured text bodies can be organized and understood using topic models. Topic models have been used to find instructional structures in data, including genetic information, pictures, and networks, since they were first created as a text-mining technique. The method falls under the category of an unsupervised machine learning algorithm. Latent Dirichlet Allocation (LDA) is the algorithm's name, a component of Python's Gensim module. 

 

Topic modeling is applied to several tasks, including document segmentation, classification, and summarization. Social networks, population genetics, and computer vision are some of the most novel applications. Topic modeling aids in query expansion in information retrieval. It also customizes search results or provides recommendations by associating user preferences with topics. 


Some key features of the Python Topic Modelling libraries are intuitive interfaces, the ease with which you can plug in your input corpus or datastream, distributed computing, state-of-the-art multilingual word embeddings, large-scale, high-quality bilingual dictionaries for training and evaluation, etc. 


Check out the below list to find the best Python topic modeling libraries for your application: 

gensimby RaRe-Technologies

Python star image 13903 Version:4.3.0

License: Weak Copyleft (LGPL-2.1)

Topic Modelling for Humans

Support
Quality
Security
License
Reuse

gensimby RaRe-Technologies

Python star image 13903 Version:4.3.0 License: Weak Copyleft (LGPL-2.1)

Topic Modelling for Humans
Support
Quality
Security
License
Reuse

MUSEby facebookresearch

Python star image 3033 Version:Current

License: Others (Non-SPDX)

A library for Multilingual Unsupervised or Supervised word Embeddings

Support
Quality
Security
License
Reuse

MUSEby facebookresearch

Python star image 3033 Version:Current License: Others (Non-SPDX)

A library for Multilingual Unsupervised or Supervised word Embeddings
Support
Quality
Security
License
Reuse

textheroby jbesomi

Python star image 2696 Version:1.1.0

License: Permissive (MIT)

Text preprocessing, representation and visualization from zero to hero.

Support
Quality
Security
License
Reuse

textheroby jbesomi

Python star image 2696 Version:1.1.0 License: Permissive (MIT)

Text preprocessing, representation and visualization from zero to hero.
Support
Quality
Security
License
Reuse

BERTopicby MaartenGr

Python star image 3675 Version:0.13.0

License: Permissive (MIT)

Leveraging BERT and c-TF-IDF to create easily interpretable topics.

Support
Quality
Security
License
Reuse

BERTopicby MaartenGr

Python star image 3675 Version:0.13.0 License: Permissive (MIT)

Leveraging BERT and c-TF-IDF to create easily interpretable topics.
Support
Quality
Security
License
Reuse

awesome-sentence-embeddingby Separius

Python star image 2090 Version:Current

License: Strong Copyleft (GPL-3.0)

A curated list of pretrained sentence and word embedding models

Support
Quality
Security
License
Reuse

awesome-sentence-embeddingby Separius

Python star image 2090 Version:Current License: Strong Copyleft (GPL-3.0)

A curated list of pretrained sentence and word embedding models
Support
Quality
Security
License
Reuse

scattertextby JasonKessler

Python star image 2021 Version:0.1.10

License: Permissive (Apache-2.0)

Beautiful visualizations of how language differs among document types.

Support
Quality
Security
License
Reuse

scattertextby JasonKessler

Python star image 2021 Version:0.1.10 License: Permissive (Apache-2.0)

Beautiful visualizations of how language differs among document types.
Support
Quality
Security
License
Reuse

word2vec-apiby 3Top

Python star image 1387 Version:v0.1

License: No License (null)

Simple web service providing a word embedding model

Support
Quality
Security
License
Reuse

word2vec-apiby 3Top

Python star image 1387 Version:v0.1 License: No License

Simple web service providing a word embedding model
Support
Quality
Security
License
Reuse

deep-siamese-text-similarityby dhwajraj

Python star image 1372 Version:Current

License: Permissive (MIT)

Tensorflow based implementation of deep siamese LSTM network to capture phrase/sentence similarity using character/word embeddings

Support
Quality
Security
License
Reuse

deep-siamese-text-similarityby dhwajraj

Python star image 1372 Version:Current License: Permissive (MIT)

Tensorflow based implementation of deep siamese LSTM network to capture phrase/sentence similarity using character/word embeddings
Support
Quality
Security
License
Reuse

nlp-journeyby msgi

Python star image 1497 Version:v1.0

License: Permissive (Apache-2.0)

Documents, papers and codes related to Natural Language Processing, including Topic Model, Word Embedding, Named Entity Recognition, Text Classificatin, Text Generation, Text Similarity, Machine Translation),etc. All codes are implemented intensorflow 2.0.

Support
Quality
Security
License
Reuse

nlp-journeyby msgi

Python star image 1497 Version:v1.0 License: Permissive (Apache-2.0)

Documents, papers and codes related to Natural Language Processing, including Topic Model, Word Embedding, Named Entity Recognition, Text Classificatin, Text Generation, Text Similarity, Machine Translation),etc. All codes are implemented intensorflow 2.0.
Support
Quality
Security
License
Reuse

ldaby lda-project

Python star image 1122 Version:0.3.2

License: Weak Copyleft (MPL-2.0)

Topic modeling with latent Dirichlet allocation using Gibbs sampling

Support
Quality
Security
License
Reuse

ldaby lda-project

Python star image 1122 Version:0.3.2 License: Weak Copyleft (MPL-2.0)

Topic modeling with latent Dirichlet allocation using Gibbs sampling
Support
Quality
Security
License
Reuse

contextualized-topic-modelsby MilaNLProc

Python star image 998 Version:2.4.2

License: Permissive (MIT)

A python package to run contextualized topic modeling. CTMs combine contextualized embeddings (e.g., BERT) with topic models to get coherent topics. Published at EACL and ACL 2021.

Support
Quality
Security
License
Reuse

contextualized-topic-modelsby MilaNLProc

Python star image 998 Version:2.4.2 License: Permissive (MIT)

A python package to run contextualized topic modeling. CTMs combine contextualized embeddings (e.g., BERT) with topic models to get coherent topics. Published at EACL and ACL 2021.
Support
Quality
Security
License
Reuse

ETMby adjidieng

Python star image 422 Version:Current

License: Permissive (MIT)

Topic Modeling in Embedding Spaces

Support
Quality
Security
License
Reuse

ETMby adjidieng

Python star image 422 Version:Current License: Permissive (MIT)

Topic Modeling in Embedding Spaces
Support
Quality
Security
License
Reuse

GuidedLDAby vi3k6i5

Python star image 404 Version:Current

License: Weak Copyleft (MPL-2.0)

semi supervised guided topic model with custom guidedLDA

Support
Quality
Security
License
Reuse

GuidedLDAby vi3k6i5

Python star image 404 Version:Current License: Weak Copyleft (MPL-2.0)

semi supervised guided topic model with custom guidedLDA
Support
Quality
Security
License
Reuse

dynamic-nmfby derekgreene

Python star image 239 Version:Current

License: Permissive (Apache-2.0)

Dynamic Topic Modeling via Non-negative Matrix Factorization

Support
Quality
Security
License
Reuse

dynamic-nmfby derekgreene

Python star image 239 Version:Current License: Permissive (Apache-2.0)

Dynamic Topic Modeling via Non-negative Matrix Factorization
Support
Quality
Security
License
Reuse

topicsby vladsandulescu

Python star image 158 Version:Current

License: Permissive (Apache-2.0)

Topic modeling with gensim and LDA

Support
Quality
Security
License
Reuse

topicsby vladsandulescu

Python star image 158 Version:Current License: Permissive (Apache-2.0)

Topic modeling with gensim and LDA
Support
Quality
Security
License
Reuse