scattertext
texthero
Our great sponsors
scattertext | texthero | |
---|---|---|
3 | 1 | |
2,194 | 2,857 | |
- | - | |
4.7 | 4.5 | |
23 days ago | 7 months ago | |
Python | Python | |
Apache License 2.0 | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
scattertext
- [Data] Principali parole degli ultimi (circa) 200 post sul sub
-
Alternate approaches to TF-IDF?
Other suggestions: Take a look at Scattertext. Compare keywords to the problem of aspect extraction. I think an underutilized way to look at textual data when you have a single group of interest is the word-frequency-based odds ratio.
texthero
What are some alternatives?
BERTopic - Leveraging BERT and c-TF-IDF to create easily interpretable topics.
KeyBERT - Minimal keyword extraction with BERT
word_cloud - A little word cloud generator in Python
stopwords-it - Italian stopwords collection
shifterator - Interpretable data visualizations for understanding how texts differ at the word level
lit - The Learning Interpretability Tool: Interactively analyze ML models to understand their behavior in an extensible and framework agnostic interface.
yake - Single-document unsupervised keyword extraction
faiss - A library for efficient similarity search and clustering of dense vectors.
dutch-word-embeddings - Dutch word embeddings, trained on a large collection of Dutch social media messages and news/blog/forum posts.
guietta
textshot - Python tool for grabbing text via screenshot
pywal - 🎨 Generate and change color-schemes on the fly.