InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now. Learn more →
Top 13 Python topic-modeling Projects
-
-
InfluxDB
InfluxDB – Built for High-Performance Time Series Workloads. InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.
-
-
-
-
contextualized-topic-models
A python package to run contextualized topic modeling. CTMs combine contextualized embeddings (e.g., BERT) with topic models to get coherent topics. Published at EACL and ACL 2021 (Bianchi et al.).
-
OCTIS
OCTIS: Comparing Topic Models is Simple! A python package to optimize and evaluate topic models (accepted at EACL2021 demo track)
-
corex_topic
Hierarchical unsupervised and semi-supervised topic models for sparse count data with CorEx
-
Sevalla
Deploy and host your apps and databases, now with $50 credit! Sevalla is the PaaS you have been looking for! Advanced deployment pipelines, usage-based pricing, preview apps, templates, human support by developers, and much more!
-
-
embedded-topic-model
A package to run embedded topic modelling with ETM. Adapted from the original at: https://github.com/adjidieng/ETM
-
-
Auto-Research
Generate custom detailed survey paper with topic clustered sections and proper citations, from just a single query in just under 30 mins !!
-
-
jouresearch-nlp
A python package for generating topics, named entities and a wordcloud visualization. It leverages the SpaCy framework and sentence transformers.
Python topic-modeling discussion
Python topic-modeling related posts
-
[D] Is it better to create a different set of Doc2Vec embeddings for each group in my dataset, rather than generating embeddings for the entire dataset?
-
Aggregating news from different sources
-
how can a top2vec output be improved
-
Tips for best Top2Vec (HDBSCAN) usage
-
[Project]Topic modelling of tweets from the same user
-
SBERT Embeddings from Conversations
-
Sentence transformers (BERTopic) on a Macbook Air
-
A note from our sponsor - InfluxDB
www.influxdata.com | 1 Sep 2025
Index
What are some of the best open-source topic-modeling projects in Python? This list will help you:
# | Project | Stars |
---|---|---|
1 | gensim | 16,157 |
2 | BERTopic | 6,996 |
3 | Top2Vec | 3,076 |
4 | scattertext | 2,311 |
5 | contextualized-topic-models | 1,242 |
6 | OCTIS | 775 |
7 | corex_topic | 635 |
8 | GuidedLDA | 510 |
9 | embedded-topic-model | 95 |
10 | GitModel | 61 |
11 | Auto-Research | 58 |
12 | cusim | 45 |
13 | jouresearch-nlp | 3 |