Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality. Learn more →
Top2Vec Alternatives
Similar projects and alternatives to Top2Vec
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
-
contextualized-topic-models
A python package to run contextualized topic modeling. CTMs combine contextualized embeddings (e.g., BERT) with topic models to get coherent topics. Published at EACL and ACL 2021.
NOTE:
The number of mentions on this list indicates mentions on common posts plus user suggested alternatives.
Hence, a higher number means a better Top2Vec alternative or higher similarity.
Top2Vec reviews and mentions
Posts with mentions or reviews of Top2Vec.
We have used some of these posts to build our list of alternatives
and similar projects. The last one was on 2022-11-30.
-
[D] Is it better to create a different set of Doc2Vec embeddings for each group in my dataset, rather than generating embeddings for the entire dataset?
I'm using Top2Vec with Doc2Vec embeddings to find topics in a dataset of ~4000 social media posts. This dataset has three groups:
-
Tips for best Top2Vec (HDBSCAN) usage
I asked in a previous post for advice about how to find insight in unstructured text data. Almost everyone recommended BERTopic, but I wasn't able to run BERTopic on my machine locally (segmentation fault). Fortunately, I found Top2Vec, which uses HBDSCAN and UMAP to quickly find good topics in uncleaned(!) text data.
- How can I group domain specific keywords based on their word embeddings?
-
Introducing the Semantic Graph
A number of excellent topic modeling libraries exist in Python today. BERTopic and Top2Vec are two of the most popular. Both use sentence-transformers to encode data into vectors, UMAP for dimensionality reduction and HDBSCAN to cluster nodes.
- Top2Vec: Embed topics, documents and word vectors
- How to cluster articles about software vulnerabilities?
- Ciencia de Dados - Classificacao de texto
-
Extracting topics from 250k facebook posts
Since you already have the facebook posts, you can use top2vec https://github.com/ddangelov/Top2Vec
- [D] Good algorithm for clustering big data (sentences represented as embeddings)?
-
SOTA for Topic Modeling
Here's an implementation that uses UMAP and HDBSCAN: https://github.com/ddangelov/Top2Vec but you could use a semi-supervised algorithm in the clustering step if you wanted specific topics.
-
A note from our sponsor - InfluxDB
www.influxdata.com | 4 May 2024
Stats
Basic Top2Vec repo stats
13
2,843
7.0
6 months ago
ddangelov/Top2Vec is an open source project licensed under BSD 3-clause "New" or "Revised" License which is an OSI approved license.
The primary programming language of Top2Vec is Python.
Sponsored
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com