Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality. Learn more →
Top 19 topic-modeling Open-Source Projects
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
-
contextualized-topic-models
A python package to run contextualized topic modeling. CTMs combine contextualized embeddings (e.g., BERT) with topic models to get coherent topics. Published at EACL and ACL 2021.
-
OCTIS
OCTIS: Comparing Topic Models is Simple! A python package to optimize and evaluate topic models (accepted at EACL2021 demo track)
-
WorkOS
The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
-
corex_topic
Hierarchical unsupervised and semi-supervised topic models for sparse count data with CorEx
-
embedded-topic-model
A package to run embedded topic modelling with ETM. Adapted from the original at: https://github.com/adjidieng/ETM
-
Auto-Research
Generate custom detailed survey paper with topic clustered sections and proper citations, from just a single query in just under 30 mins !!
-
jouresearch-nlp
A python package for generating topics, named entities and a wordcloud visualization. It leverages the SpaCy framework and sentence transformers.
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
Try experimenting with different hyperparameters, clustering algorithms and embedding representations. Try https://github.com/MaartenGr/BERTopic/tree/master/bertopic
Project mention: [D] Is it better to create a different set of Doc2Vec embeddings for each group in my dataset, rather than generating embeddings for the entire dataset? | /r/MachineLearning | 2023-10-28I'm using Top2Vec with Doc2Vec embeddings to find topics in a dataset of ~4000 social media posts. This dataset has three groups:
Project mention: Owl project (OCaml scientific computing) formally concluded | news.ycombinator.com | 2024-02-19
topic-modeling related posts
- Owl project (OCaml scientific computing) formally concluded
- [D] Is it better to create a different set of Doc2Vec embeddings for each group in my dataset, rather than generating embeddings for the entire dataset?
- Aggregating news from different sources
- how can a top2vec output be improved
- Tips for best Top2Vec (HDBSCAN) usage
- [Project]Topic modelling of tweets from the same user
- SBERT Embeddings from Conversations
-
A note from our sponsor - InfluxDB
www.influxdata.com | 25 Apr 2024
Index
What are some of the best open-source topic-modeling projects? This list will help you:
Project | Stars | |
---|---|---|
1 | gensim | 15,236 |
2 | BERTopic | 5,543 |
3 | Top2Vec | 2,839 |
4 | scattertext | 2,197 |
5 | owl | 1,178 |
6 | contextualized-topic-models | 1,157 |
7 | OCTIS | 681 |
8 | corex_topic | 622 |
9 | LDAvis | 551 |
10 | GuidedLDA | 494 |
11 | converse | 176 |
12 | TopMost | 139 |
13 | BTM | 88 |
14 | stripnet | 85 |
15 | embedded-topic-model | 82 |
16 | GitModel | 60 |
17 | Auto-Research | 47 |
18 | cusim | 40 |
19 | jouresearch-nlp | 3 |
Sponsored