Python topic-modeling

Open-source Python projects categorized as topic-modeling

Top 13 Python topic-modeling Projects

  • gensim

    Topic Modelling for Humans

  • Project mention: Aggregating news from different sources | /r/learnprogramming | 2023-07-08
  • BERTopic

    Leveraging BERT and c-TF-IDF to create easily interpretable topics.

  • Project mention: how can a top2vec output be improved | /r/learnmachinelearning | 2023-07-04

    Try experimenting with different hyperparameters, clustering algorithms and embedding representations. Try https://github.com/MaartenGr/BERTopic/tree/master/bertopic

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
  • Top2Vec

    Top2Vec learns jointly embedded topic, document and word vectors.

  • Project mention: [D] Is it better to create a different set of Doc2Vec embeddings for each group in my dataset, rather than generating embeddings for the entire dataset? | /r/MachineLearning | 2023-10-28

    I'm using Top2Vec with Doc2Vec embeddings to find topics in a dataset of ~4000 social media posts. This dataset has three groups:

  • scattertext

    Beautiful visualizations of how language differs among document types.

  • contextualized-topic-models

    A python package to run contextualized topic modeling. CTMs combine contextualized embeddings (e.g., BERT) with topic models to get coherent topics. Published at EACL and ACL 2021.

  • OCTIS

    OCTIS: Comparing Topic Models is Simple! A python package to optimize and evaluate topic models (accepted at EACL2021 demo track)

  • corex_topic

    Hierarchical unsupervised and semi-supervised topic models for sparse count data with CorEx

  • WorkOS

    The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.

    WorkOS logo
  • GuidedLDA

    semi supervised guided topic model with custom guidedLDA (by vi3k6i5)

  • embedded-topic-model

    A package to run embedded topic modelling with ETM. Adapted from the original at: https://github.com/adjidieng/ETM

  • GitModel

    machine code+i git matrix + user

  • Auto-Research

    Generate custom detailed survey paper with topic clustered sections and proper citations, from just a single query in just under 30 mins !!

  • cusim

    Superfast CUDA implementation of Word2Vec and Latent Dirichlet Allocation (LDA)

  • jouresearch-nlp

    A python package for generating topics, named entities and a wordcloud visualization. It leverages the SpaCy framework and sentence transformers.

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

Python topic-modeling related posts

Index

What are some of the best open-source topic-modeling projects in Python? This list will help you:

Project Stars
1 gensim 15,212
2 BERTopic 5,543
3 Top2Vec 2,839
4 scattertext 2,197
5 contextualized-topic-models 1,157
6 OCTIS 681
7 corex_topic 622
8 GuidedLDA 494
9 embedded-topic-model 82
10 GitModel 60
11 Auto-Research 47
12 cusim 40
13 jouresearch-nlp 3

Sponsored
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com