technology logo
technology logo

15 best Python Topic Modelling libraries in 2023

share link

by reegs20 dot icon Updated: Dec 20, 2022

Guide Kit Guide Kit  

Find patterns or themes in large document sets, create links, pinpoint important subjects, implement popular algorithms like LSA/LSI/SVD, and more for your Artificial Intelligence, Topic Modeling, Bert, Neural Network, Transformer, and NLP applications. Topic modeling is a method for locating hidden subjects in vast amounts of text. Extensive collections of unstructured text bodies can be organized and understood using topic models. Topic models have been used to find instructional structures in data, including genetic information, pictures, and networks, since they were first created as a text-mining technique. The method falls under the category of an unsupervised machine learning algorithm. Latent Dirichlet Allocation (LDA) is the algorithm's name, a component of Python's Gensim module. 

 

Topic modeling is applied to several tasks, including document segmentation, classification, and summarization. Social networks, population genetics, and computer vision are some of the most novel applications. Topic modeling aids in query expansion in information retrieval. It also customizes search results or provides recommendations by associating user preferences with topics. 


Some key features of the Python Topic Modelling libraries are intuitive interfaces, the ease with which you can plug in your input corpus or datastream, distributed computing, state-of-the-art multilingual word embeddings, large-scale, high-quality bilingual dictionaries for training and evaluation, etc. 


Check out the below list to find the best Python topic modeling libraries for your application: 

gensimby RaRe-Technologies

Python doticonstar image 14361 doticonVersion:4.3.0doticon
License: Weak Copyleft (LGPL-2.1)

Topic Modelling for Humans

Support
    Quality
      Security
        License
          Reuse

            gensimby RaRe-Technologies

            Python doticon star image 14361 doticonVersion:4.3.0doticon License: Weak Copyleft (LGPL-2.1)

            Topic Modelling for Humans
            Support
              Quality
                Security
                  License
                    Reuse

                      MUSEby facebookresearch

                      Python doticonstar image 3082 doticonVersion:Currentdoticon
                      License: Others (Non-SPDX)

                      A library for Multilingual Unsupervised or Supervised word Embeddings

                      Support
                        Quality
                          Security
                            License
                              Reuse

                                MUSEby facebookresearch

                                Python doticon star image 3082 doticonVersion:Currentdoticon License: Others (Non-SPDX)

                                A library for Multilingual Unsupervised or Supervised word Embeddings
                                Support
                                  Quality
                                    Security
                                      License
                                        Reuse

                                          textheroby jbesomi

                                          Python doticonstar image 2741 doticonVersion:1.1.0doticon
                                          License: Permissive (MIT)

                                          Text preprocessing, representation and visualization from zero to hero.

                                          Support
                                            Quality
                                              Security
                                                License
                                                  Reuse

                                                    textheroby jbesomi

                                                    Python doticon star image 2741 doticonVersion:1.1.0doticon License: Permissive (MIT)

                                                    Text preprocessing, representation and visualization from zero to hero.
                                                    Support
                                                      Quality
                                                        Security
                                                          License
                                                            Reuse

                                                              BERTopicby MaartenGr

                                                              Python doticonstar image 4305 doticonVersion:v0.15.0doticon
                                                              License: Permissive (MIT)

                                                              Leveraging BERT and c-TF-IDF to create easily interpretable topics.

                                                              Support
                                                                Quality
                                                                  Security
                                                                    License
                                                                      Reuse

                                                                        BERTopicby MaartenGr

                                                                        Python doticon star image 4305 doticonVersion:v0.15.0doticon License: Permissive (MIT)

                                                                        Leveraging BERT and c-TF-IDF to create easily interpretable topics.
                                                                        Support
                                                                          Quality
                                                                            Security
                                                                              License
                                                                                Reuse
                                                                                  Python doticonstar image 2099 doticonVersion:Currentdoticon
                                                                                  License: Strong Copyleft (GPL-3.0)

                                                                                  A curated list of pretrained sentence and word embedding models

                                                                                  Support
                                                                                    Quality
                                                                                      Security
                                                                                        License
                                                                                          Reuse

                                                                                            awesome-sentence-embeddingby Separius

                                                                                            Python doticon star image 2099 doticonVersion:Currentdoticon License: Strong Copyleft (GPL-3.0)

                                                                                            A curated list of pretrained sentence and word embedding models
                                                                                            Support
                                                                                              Quality
                                                                                                Security
                                                                                                  License
                                                                                                    Reuse

                                                                                                      scattertextby JasonKessler

                                                                                                      Python doticonstar image 2072 doticonVersion:0.0.2.4.4doticon
                                                                                                      License: Permissive (Apache-2.0)

                                                                                                      Beautiful visualizations of how language differs among document types.

                                                                                                      Support
                                                                                                        Quality
                                                                                                          Security
                                                                                                            License
                                                                                                              Reuse

                                                                                                                scattertextby JasonKessler

                                                                                                                Python doticon star image 2072 doticonVersion:0.0.2.4.4doticon License: Permissive (Apache-2.0)

                                                                                                                Beautiful visualizations of how language differs among document types.
                                                                                                                Support
                                                                                                                  Quality
                                                                                                                    Security
                                                                                                                      License
                                                                                                                        Reuse
                                                                                                                          Python doticonstar image 1388 doticonVersion:Currentdoticon
                                                                                                                          no licences License: No License (null)

                                                                                                                          Simple web service providing a word embedding model

                                                                                                                          Support
                                                                                                                            Quality
                                                                                                                              Security
                                                                                                                                License
                                                                                                                                  Reuse

                                                                                                                                    word2vec-apiby 3Top

                                                                                                                                    Python doticon star image 1388 doticonVersion:Currentdoticonno licences License: No License

                                                                                                                                    Simple web service providing a word embedding model
                                                                                                                                    Support
                                                                                                                                      Quality
                                                                                                                                        Security
                                                                                                                                          License
                                                                                                                                            Reuse
                                                                                                                                              Python doticonstar image 1390 doticonVersion:Currentdoticon
                                                                                                                                              License: Permissive (MIT)

                                                                                                                                              Tensorflow based implementation of deep siamese LSTM network to capture phrase/sentence similarity using character/word embeddings

                                                                                                                                              Support
                                                                                                                                                Quality
                                                                                                                                                  Security
                                                                                                                                                    License
                                                                                                                                                      Reuse

                                                                                                                                                        deep-siamese-text-similarityby dhwajraj

                                                                                                                                                        Python doticon star image 1390 doticonVersion:Currentdoticon License: Permissive (MIT)

                                                                                                                                                        Tensorflow based implementation of deep siamese LSTM network to capture phrase/sentence similarity using character/word embeddings
                                                                                                                                                        Support
                                                                                                                                                          Quality
                                                                                                                                                            Security
                                                                                                                                                              License
                                                                                                                                                                Reuse
                                                                                                                                                                  Python doticonstar image 1528 doticonVersion:v1.0doticon
                                                                                                                                                                  License: Permissive (Apache-2.0)

                                                                                                                                                                  Documents, papers and codes related to Natural Language Processing, including Topic Model, Word Embedding, Named Entity Recognition, Text Classificatin, Text Generation, Text Similarity, Machine Translation),etc. All codes are implemented intensorflow 2.0.

                                                                                                                                                                  Support
                                                                                                                                                                    Quality
                                                                                                                                                                      Security
                                                                                                                                                                        License
                                                                                                                                                                          Reuse

                                                                                                                                                                            nlp-journeyby msgi

                                                                                                                                                                            Python doticon star image 1528 doticonVersion:v1.0doticon License: Permissive (Apache-2.0)

                                                                                                                                                                            Documents, papers and codes related to Natural Language Processing, including Topic Model, Word Embedding, Named Entity Recognition, Text Classificatin, Text Generation, Text Similarity, Machine Translation),etc. All codes are implemented intensorflow 2.0.
                                                                                                                                                                            Support
                                                                                                                                                                              Quality
                                                                                                                                                                                Security
                                                                                                                                                                                  License
                                                                                                                                                                                    Reuse

                                                                                                                                                                                      ldaby lda-project

                                                                                                                                                                                      Python doticonstar image 1122 doticonVersion:0.3.2doticon
                                                                                                                                                                                      License: Weak Copyleft (MPL-2.0)

                                                                                                                                                                                      Topic modeling with latent Dirichlet allocation using Gibbs sampling

                                                                                                                                                                                      Support
                                                                                                                                                                                        Quality
                                                                                                                                                                                          Security
                                                                                                                                                                                            License
                                                                                                                                                                                              Reuse

                                                                                                                                                                                                ldaby lda-project

                                                                                                                                                                                                Python doticon star image 1122 doticonVersion:0.3.2doticon License: Weak Copyleft (MPL-2.0)

                                                                                                                                                                                                Topic modeling with latent Dirichlet allocation using Gibbs sampling
                                                                                                                                                                                                Support
                                                                                                                                                                                                  Quality
                                                                                                                                                                                                    Security
                                                                                                                                                                                                      License
                                                                                                                                                                                                        Reuse
                                                                                                                                                                                                          Python doticonstar image 1053 doticonVersion:Currentdoticon
                                                                                                                                                                                                          License: Permissive (MIT)

                                                                                                                                                                                                          A python package to run contextualized topic modeling. CTMs combine contextualized embeddings (e.g., BERT) with topic models to get coherent topics. Published at EACL and ACL 2021.

                                                                                                                                                                                                          Support
                                                                                                                                                                                                            Quality
                                                                                                                                                                                                              Security
                                                                                                                                                                                                                License
                                                                                                                                                                                                                  Reuse

                                                                                                                                                                                                                    contextualized-topic-modelsby MilaNLProc

                                                                                                                                                                                                                    Python doticon star image 1053 doticonVersion:Currentdoticon License: Permissive (MIT)

                                                                                                                                                                                                                    A python package to run contextualized topic modeling. CTMs combine contextualized embeddings (e.g., BERT) with topic models to get coherent topics. Published at EACL and ACL 2021.
                                                                                                                                                                                                                    Support
                                                                                                                                                                                                                      Quality
                                                                                                                                                                                                                        Security
                                                                                                                                                                                                                          License
                                                                                                                                                                                                                            Reuse

                                                                                                                                                                                                                              ETMby adjidieng

                                                                                                                                                                                                                              Python doticonstar image 422 doticonVersion:Currentdoticon
                                                                                                                                                                                                                              License: Permissive (MIT)

                                                                                                                                                                                                                              Topic Modeling in Embedding Spaces

                                                                                                                                                                                                                              Support
                                                                                                                                                                                                                                Quality
                                                                                                                                                                                                                                  Security
                                                                                                                                                                                                                                    License
                                                                                                                                                                                                                                      Reuse

                                                                                                                                                                                                                                        ETMby adjidieng

                                                                                                                                                                                                                                        Python doticon star image 422 doticonVersion:Currentdoticon License: Permissive (MIT)

                                                                                                                                                                                                                                        Topic Modeling in Embedding Spaces
                                                                                                                                                                                                                                        Support
                                                                                                                                                                                                                                          Quality
                                                                                                                                                                                                                                            Security
                                                                                                                                                                                                                                              License
                                                                                                                                                                                                                                                Reuse

                                                                                                                                                                                                                                                  GuidedLDAby vi3k6i5

                                                                                                                                                                                                                                                  Python doticonstar image 404 doticonVersion:Currentdoticon
                                                                                                                                                                                                                                                  License: Weak Copyleft (MPL-2.0)

                                                                                                                                                                                                                                                  semi supervised guided topic model with custom guidedLDA

                                                                                                                                                                                                                                                  Support
                                                                                                                                                                                                                                                    Quality
                                                                                                                                                                                                                                                      Security
                                                                                                                                                                                                                                                        License
                                                                                                                                                                                                                                                          Reuse

                                                                                                                                                                                                                                                            GuidedLDAby vi3k6i5

                                                                                                                                                                                                                                                            Python doticon star image 404 doticonVersion:Currentdoticon License: Weak Copyleft (MPL-2.0)

                                                                                                                                                                                                                                                            semi supervised guided topic model with custom guidedLDA
                                                                                                                                                                                                                                                            Support
                                                                                                                                                                                                                                                              Quality
                                                                                                                                                                                                                                                                Security
                                                                                                                                                                                                                                                                  License
                                                                                                                                                                                                                                                                    Reuse

                                                                                                                                                                                                                                                                      dynamic-nmfby derekgreene

                                                                                                                                                                                                                                                                      Python doticonstar image 239 doticonVersion:Currentdoticon
                                                                                                                                                                                                                                                                      License: Permissive (Apache-2.0)

                                                                                                                                                                                                                                                                      Dynamic Topic Modeling via Non-negative Matrix Factorization

                                                                                                                                                                                                                                                                      Support
                                                                                                                                                                                                                                                                        Quality
                                                                                                                                                                                                                                                                          Security
                                                                                                                                                                                                                                                                            License
                                                                                                                                                                                                                                                                              Reuse

                                                                                                                                                                                                                                                                                dynamic-nmfby derekgreene

                                                                                                                                                                                                                                                                                Python doticon star image 239 doticonVersion:Currentdoticon License: Permissive (Apache-2.0)

                                                                                                                                                                                                                                                                                Dynamic Topic Modeling via Non-negative Matrix Factorization
                                                                                                                                                                                                                                                                                Support
                                                                                                                                                                                                                                                                                  Quality
                                                                                                                                                                                                                                                                                    Security
                                                                                                                                                                                                                                                                                      License
                                                                                                                                                                                                                                                                                        Reuse

                                                                                                                                                                                                                                                                                          topicsby vladsandulescu

                                                                                                                                                                                                                                                                                          Python doticonstar image 158 doticonVersion:Currentdoticon
                                                                                                                                                                                                                                                                                          License: Permissive (Apache-2.0)

                                                                                                                                                                                                                                                                                          Topic modeling with gensim and LDA

                                                                                                                                                                                                                                                                                          Support
                                                                                                                                                                                                                                                                                            Quality
                                                                                                                                                                                                                                                                                              Security
                                                                                                                                                                                                                                                                                                License
                                                                                                                                                                                                                                                                                                  Reuse

                                                                                                                                                                                                                                                                                                    topicsby vladsandulescu

                                                                                                                                                                                                                                                                                                    Python doticon star image 158 doticonVersion:Currentdoticon License: Permissive (Apache-2.0)

                                                                                                                                                                                                                                                                                                    Topic modeling with gensim and LDA
                                                                                                                                                                                                                                                                                                    Support
                                                                                                                                                                                                                                                                                                      Quality
                                                                                                                                                                                                                                                                                                        Security
                                                                                                                                                                                                                                                                                                          License
                                                                                                                                                                                                                                                                                                            Reuse