M
Mining_Of_Massive_Datasetsby ryancheunggit
Python 11 Version:Current License: No License (No License)
Sketch code along the reading of Mining Massive Datasets
Support
Quality
Security
License
Reuse
Graph clustering and Node embeddings with word2vec
Support
Quality
Security
License
Reuse
基于维基百科语料,使用 gensim 的 word2vec 来训练词向量
Support
Quality
Security
License
Reuse
Superfast CUDA implementation of Word2Vec and Latent Dirichlet Allocation (LDA)
Support
Quality
Security
License
Reuse
重构论文A Biterm Topic Model for Short Texts提供的源代码,编译成一个python 扩展模块,并用python 包装了一下,提供一个user-friendly python package
Support
Quality
Security
License
Reuse
Code for topic model evaluation experiments for Scott & Baldridge 2013, AISTATS
Support
Quality
Security
License
Reuse
Handy Jupyter Notebooks that I use in for Topic Modeling. Including text mining from PDF files, text preprocessing, Latent Dirichlet Allocation (LDA), hyperparameters grid search and Topic Modeling visualiation.
Support
Quality
Security
License
Reuse
Support
Quality
Security
License
Reuse
Support
Quality
Security
License
Reuse
Support
Quality
Security
License
Reuse
Code for the CIKM 2013 paper "Discovering Coherent Topics Using General Knowledge"
Support
Quality
Security
License
Reuse
Support
Quality
Security
License
Reuse
An alternative approach for probabilistic topic modeling based on agglomerative clustering of topics (not documents)
Support
Quality
Security
License
Reuse
Latent Dirichlet Allocation for Topic Modelling
Support
Quality
Security
License
Reuse
An Online Latent Dirichlet Allocation with Infinite Vocabulary implementation in Python.
Support
Quality
Security
License
Reuse
RhymerFinder predicts the rhymes in a song based on preceding lyrics by using gensim's Word2Vec implementation.
Support
Quality
Security
License
Reuse
Crowdsourced Time-sync Video Tagging using Temporal and Personalized Topic Modeling
Support
Quality
Security
License
Reuse
Word2vec in gensim and Tensorflow
Support
Quality
Security
License
Reuse
Greedy Adaptive Dictionary (GAD) is a learning algorithm that sets out to find sparse atoms for speech signals.
Support
Quality
Security
License
Reuse
Spatio-temporal pattern contruct and model fusion
Support
Quality
Security
License
Reuse
Python implementation of Sap et al.'s gender prediction algorithm for Twitter.
Support
Quality
Security
License
Reuse
Course Samples
Support
Quality
Security
License
Reuse
1、结合opencv,利用特征提取方法(LDA LBP PCA)进行特征提取建立模型库;2、利用电脑摄像头进行拍照,每隔3秒提取一个正面照进行特征提取,然后与模型库中的样本进行余弦距离相似度计算,实现人脸匹配识别
Support
Quality
Security
License
Reuse
Adds browsing users to topics
Support
Quality
Security
License
Reuse
This is the central repository for literature on and applications of the topic modeling methodology. This page was specifically designed for the professional development workshop (PDW) on topic modeling that have taken place at the 2017 and 2018 iterations of the annual Academy of Management Meeting, but is open to anyone interested.
Support
Quality
Security
License
Reuse
Surface and foil modeling via CST method
Support
Quality
Security
License
Reuse
Unsupervised model for clustering and summarisation of Polish news articles.
Support
Quality
Security
License
Reuse
A bayesian hierarchical topic model for texts in R. The package implements Grimmer (2010).
Support
Quality
Security
License
Reuse
word2vec wordembedding embedding google
Support
Quality
Security
License
Reuse
Topic Modeling using LDA and NMF in Python
Support
Quality
Security
License
Reuse
Structural Topic Modeling of the Facebook posts of NC State Senators
Support
Quality
Security
License
Reuse
golang实现的消息队列服务,支持消费主题(topic)以及消费者分组(group)
Support
Quality
Security
License
Reuse
Implementation of a Latent Concept Topic Model (LCTM).
Support
Quality
Security
License
Reuse
Text similarity based on Word2Vec vectors.
Support
Quality
Security
License
Reuse
Implements several Markov chain Monte Carlo (MCMC) algorithms for the latent Dirichlet allocation (LDA) model
Support
Quality
Security
License
Reuse
s
short_text_topic_modelingby Matyyas
Jupyter Notebook 10 Version:Current License: No License (No License)
Short Text Topic Modeling notebook example
Support
Quality
Security
License
Reuse
Application of topic models for topic extraction and similarity search
Support
Quality
Security
License
Reuse
L
LDA_Viblo_Recommender_Systemby huyhoang17
Jupyter Notebook 10 Version:Current License: Permissive (MIT)
Simple Recommender System for Viblo Website using LDA (Latent Dirichlet Allocation)
Support
Quality
Security
License
Reuse
p
poincare-embedding-using-gensimby harmanpreet93
Jupyter Notebook 10 Version:Current License: No License (No License)
Train poincare embedding using gensim
Support
Quality
Security
License
Reuse
Machine learning models on different topics and competitions
Support
Quality
Security
License
Reuse
Fun with Game of Thrones word embeddings
Support
Quality
Security
License
Reuse
Interactive textbook on state space models
Support
Quality
Security
License
Reuse
Topic models for microblogging content
Support
Quality
Security
License
Reuse
In PAKDD 2015. Incorporating Probabilistic Knowledge into Topic Models
Support
Quality
Security
License
Reuse
LDA主题模型Gibbs采样并行实现
Support
Quality
Security
License
Reuse
Code for Short Text Topic Modeling with Flexible Word Patterns
Support
Quality
Security
License
Reuse
L
Landau-Lifshitz-Gilbert-ODE-modelby davidshepherd7
Python 9 Version:Current License: Strong Copyleft (GPL-3.0)
Tools and a simple model for the spatially constant LLG equation
Support
Quality
Security
License
Reuse
Large-scale topic discovery with Sampled-MinHashing
Support
Quality
Security
License
Reuse
Implementation of Disjoint Author-Document Topic Model
Support
Quality
Security
License
Reuse
This is chat bot which is based on term frequency and inverse document frequency and uses cosine similarity to calculate the same.
Support
Quality
Security
License
Reuse
M
Mining_Of_Massive_Datasetsby ryancheunggit
Sketch code along the reading of Mining Massive Datasets
Python 11Updated: 6 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
u
url2vecby chrisPiemonte
Graph clustering and Node embeddings with word2vec
Python 11Updated: 4 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
w
word2vecby frendyxzc
基于维基百科语料,使用 gensim 的 word2vec 来训练词向量
Python 11Updated: 4 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
c
cusimby js1010
Superfast CUDA implementation of Word2Vec and Latent Dirichlet Allocation (LDA)
Python 11Updated: 3 y ago License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
b
biterm-topic-modelby liuzhenhai93
重构论文A Biterm Topic Model for Short Texts提供的源代码,编译成一个python 扩展模块,并用python 包装了一下,提供一个user-friendly python package
C++ 11Updated: 4 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
t
topicmodel-evalby utcompling
Code for topic model evaluation experiments for Scott & Baldridge 2013, AISTATS
Scala 11Updated: 7 y ago License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
t
topic-modellingby storopoli
Handy Jupyter Notebooks that I use in for Topic Modeling. Including text mining from PDF files, text preprocessing, Latent Dirichlet Allocation (LDA), hyperparameters grid search and Topic Modeling visualiation.
Jupyter Notebook 11Updated: 3 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
A
ArchPyby randlab
Python 11Updated: 2 y ago License: Strong Copyleft (GPL-3.0)
Support
Quality
Security
License
Reuse
l
leto-modelizer-plugin-coreby ditrit
JavaScript 11Updated: 2 y ago License: Weak Copyleft (MPL-2.0)
Support
Quality
Security
License
Reuse
p
posteriordb-pythonby stan-dev
Python 11Updated: 2 y ago License: Strong Copyleft (GPL-3.0)
Support
Quality
Security
License
Reuse
G
GKLDAby czyuan
Code for the CIKM 2013 paper "Discovering Coherent Topics Using General Knowledge"
Java 10Updated: 4 y ago License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
E
ETMby qiang2100
Java 10Updated: 5 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
T
TopicGrouperJby pfeiferd
An alternative approach for probabilistic topic modeling based on agglomerative clustering of topics (not documents)
Java 10Updated: 4 y ago License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
l
ldaby nkoilada
Latent Dirichlet Allocation for Topic Modelling
Python 10Updated: 6 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
P
PyInfVocby kzhai
An Online Latent Dirichlet Allocation with Infinite Vocabulary implementation in Python.
Python 10Updated: 4 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
r
rhymerfinderby johnchuckcase
RhymerFinder predicts the rhymes in a song based on preceding lyrics by using gensim's Word2Vec implementation.
Python 10Updated: 4 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
T
TPTMby Wind-Ward
Crowdsourced Time-sync Video Tagging using Temporal and Personalized Topic Modeling
Python 10Updated: 4 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
w
word2vec_tutorialby hadifar
Word2vec in gensim and Tensorflow
Python 10Updated: 4 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
G
Greedy-Adaptive-Dictionaryby DavideNardone
Greedy Adaptive Dictionary (GAD) is a learning algorithm that sets out to find sparse atoms for speech signals.
Python 10Updated: 3 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
T
TrackVizby ahangchen
Spatio-temporal pattern contruct and model fusion
Python 10Updated: 4 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
T
TwitterGenderPredictorby jtwool
Python implementation of Sap et al.'s gender prediction algorithm for Twitter.
Python 10Updated: 3 y ago License: Weak Copyleft (MPL-2.0)
Support
Quality
Security
License
Reuse
D
DataScience_OttawaU_2019by rahgoar
Course Samples
Python 10Updated: 4 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
f
face-recognize-by-comeraby fish-kong
1、结合opencv,利用特征提取方法(LDA LBP PCA)进行特征提取建立模型库;2、利用电脑摄像头进行拍照,每隔3秒提取一个正面照进行特征提取,然后与模型库中的样本进行余弦距离相似度计算,实现人脸匹配识别
Python 10Updated: 4 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
n
nodebb-plugin-browsing-usersby barisusakli
Adds browsing users to topics
JavaScript 10Updated: 2 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
t
topicmodelingby RFJHaans
This is the central repository for literature on and applications of the topic modeling methodology. This page was specifically designed for the professional development workshop (PDW) on topic modeling that have taken place at the 2017 and 2018 iterations of the annual Academy of Management Meeting, but is open to anyone interested.
JavaScript 10Updated: 4 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
c
cst-modeling3dby swayli94
Surface and foil modeling via CST method
Python 10Updated: 2 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
N
News_Selectorby jkubajek
Unsupervised model for clustering and summarisation of Polish news articles.
R 10Updated: 4 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
E
ExpAgendaby christophergandrud
A bayesian hierarchical topic model for texts in R. The package implements Grimmer (2010).
R 10Updated: 4 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
w
word2vec-googleby zhyq
word2vec wordembedding embedding google
C 10Updated: 5 y ago License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
t
topic_modelingby ravishchawla
Topic Modeling using LDA and NMF in Python
HTML 10Updated: 4 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
N
NCStateSenateFacebookby wesslen
Structural Topic Modeling of the Facebook posts of NC State Senators
HTML 10Updated: 4 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
s
simpleqby wenzuojing
golang实现的消息队列服务,支持消费主题(topic)以及消费者分组(group)
Go 10Updated: 4 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
L
LCTMby weihua916
Implementation of a Latent Concept Topic Model (LCTM).
C++ 10Updated: 5 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
p
pio-template-text-similarityby goliasz
Text similarity based on Word2Vec vectors.
Scala 10Updated: 4 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
l
ldamcmcby clintpgeorge
Implements several Markov chain Monte Carlo (MCMC) algorithms for the latent Dirichlet allocation (LDA) model
C++ 10Updated: 4 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
s
short_text_topic_modelingby Matyyas
Short Text Topic Modeling notebook example
Jupyter Notebook 10Updated: 4 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
n
nlp-topic-modelsby goerlitz
Application of topic models for topic extraction and similarity search
Jupyter Notebook 10Updated: 4 y ago License: Strong Copyleft (GPL-3.0)
Support
Quality
Security
License
Reuse
L
LDA_Viblo_Recommender_Systemby huyhoang17
Simple Recommender System for Viblo Website using LDA (Latent Dirichlet Allocation)
Jupyter Notebook 10Updated: 4 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
p
poincare-embedding-using-gensimby harmanpreet93
Train poincare embedding using gensim
Jupyter Notebook 10Updated: 4 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
M
Modelsby devforfu
Machine learning models on different topics and competitions
Jupyter Notebook 10Updated: 5 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
g
got-word-embeddingsby jctestud
Fun with Game of Thrones word embeddings
Jupyter Notebook 10Updated: 4 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
s
ssm-bookby ssm-jax
Interactive textbook on state space models
Jupyter Notebook 10Updated: 3 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
t
ttmby smutahoang
Topic models for microblogging content
Java 9Updated: 4 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
P
ProbaseLDAby yao8839836
In PAKDD 2015. Incorporating Probabilistic Knowledge into Topic Models
Java 9Updated: 4 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
P
ParallelGibbsLdaby fishermanff
LDA主题模型Gibbs采样并行实现
Java 9Updated: 5 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
M
Multiterm-Topic-Modelby BobXWu
Code for Short Text Topic Modeling with Flexible Word Patterns
Java 9Updated: 4 y ago License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
L
Landau-Lifshitz-Gilbert-ODE-modelby davidshepherd7
Tools and a simple model for the spatially constant LLG equation
Python 9Updated: 4 y ago License: Strong Copyleft (GPL-3.0)
Support
Quality
Security
License
Reuse
S
SMH-Topic-Discoveryby gibranfp
Large-scale topic discovery with Sampled-MinHashing
Python 9Updated: 4 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
D
DADTby webis-de
Implementation of Disjoint Author-Document Topic Model
Python 9Updated: 4 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
r
retrieval-based-chatbotby vaibhavgeek
This is chat bot which is based on term frequency and inverse document frequency and uses cosine similarity to calculate the same.
Python 9Updated: 4 y ago License: No License (No License)
Support
Quality
Security
License
Reuse