The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning. Learn more →
Top 13 Python topic-modeling Projects
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
-
contextualized-topic-models
A python package to run contextualized topic modeling. CTMs combine contextualized embeddings (e.g., BERT) with topic models to get coherent topics. Published at EACL and ACL 2021.
-
OCTIS
OCTIS: Comparing Topic Models is Simple! A python package to optimize and evaluate topic models (accepted at EACL2021 demo track)
-
corex_topic
Hierarchical unsupervised and semi-supervised topic models for sparse count data with CorEx
-
WorkOS
The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
-
embedded-topic-model
A package to run embedded topic modelling with ETM. Adapted from the original at: https://github.com/adjidieng/ETM
-
Auto-Research
Generate custom detailed survey paper with topic clustered sections and proper citations, from just a single query in just under 30 mins !!
-
jouresearch-nlp
A python package for generating topics, named entities and a wordcloud visualization. It leverages the SpaCy framework and sentence transformers.
Try experimenting with different hyperparameters, clustering algorithms and embedding representations. Try https://github.com/MaartenGr/BERTopic/tree/master/bertopic
Project mention: [D] Is it better to create a different set of Doc2Vec embeddings for each group in my dataset, rather than generating embeddings for the entire dataset? | /r/MachineLearning | 2023-10-28I'm using Top2Vec with Doc2Vec embeddings to find topics in a dataset of ~4000 social media posts. This dataset has three groups:
Python topic-modeling related posts
- [D] Is it better to create a different set of Doc2Vec embeddings for each group in my dataset, rather than generating embeddings for the entire dataset?
- Aggregating news from different sources
- how can a top2vec output be improved
- Tips for best Top2Vec (HDBSCAN) usage
- [Project]Topic modelling of tweets from the same user
- SBERT Embeddings from Conversations
- Sentence transformers (BERTopic) on a Macbook Air
-
A note from our sponsor - WorkOS
workos.com | 24 Apr 2024
Index
What are some of the best open-source topic-modeling projects in Python? This list will help you:
Project | Stars | |
---|---|---|
1 | gensim | 15,212 |
2 | BERTopic | 5,543 |
3 | Top2Vec | 2,839 |
4 | scattertext | 2,197 |
5 | contextualized-topic-models | 1,157 |
6 | OCTIS | 681 |
7 | corex_topic | 622 |
8 | GuidedLDA | 494 |
9 | embedded-topic-model | 82 |
10 | GitModel | 60 |
11 | Auto-Research | 47 |
12 | cusim | 40 |
13 | jouresearch-nlp | 3 |
Sponsored