Our great sponsors
-
WorkOS
The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
I asked in a previous post for advice about how to find insight in unstructured text data. Almost everyone recommended BERTopic, but I wasn't able to run BERTopic on my machine locally (segmentation fault). Fortunately, I found Top2Vec, which uses HBDSCAN and UMAP to quickly find good topics in uncleaned(!) text data.
NOTE:
The number of mentions on this list indicates mentions on common posts plus user suggested alternatives.
Hence, a higher number means a more popular project.
Related posts
- [D] Is it better to create a different set of Doc2Vec embeddings for each group in my dataset, rather than generating embeddings for the entire dataset?
- Top2Vec: Embed topics, documents and word vectors
- How to cluster articles about software vulnerabilities?
- Ciencia de Dados - Classificacao de texto
- Extracting topics from 250k facebook posts