scattertext vs KeyBERT

scattertext

Beautiful visualizations of how language differs among document types. (by JasonKessler)

Source Code

Suggest alternative

Edit details

KeyBERT

Minimal keyword extraction with BERT (by MaartenGr)

keyword-extraction keyphrase-extraction Bert mmr

Source Code

maartengr.github.io

Suggest alternative

Edit details

Our great sponsors

WorkOS - The modern identity platform for B2B SaaS

InfluxDB - Power Real-Time Data Analytics at Scale

SaaSHub - Software Alternatives and Reviews

Our great sponsors

scattertext		KeyBERT
	Project
3	Mentions	5
2,197	Stars	3,213
-	Growth	-
4.7	Activity	6.1
about 2 months ago	Latest Commit	about 1 month ago
Python	Language	Python
Apache License 2.0	License	MIT License

The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

scattertext

Posts with mentions or reviews of scattertext. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2021-04-27.

Clustering of text - Where to start?
1 project | /r/LanguageTechnology | 4 Aug 2021

If what you want is to determine how similar two categories are, or to learn something about the structure or words that compose those categories, you might consider word shift graphs or Scattertext.
[Data] Principali parole degli ultimi (circa) 200 post sul sub
4 projects | /r/italy | 27 Apr 2021
Alternate approaches to TF-IDF?
4 projects | /r/LanguageTechnology | 14 Mar 2021

Other suggestions: Take a look at Scattertext. Compare keywords to the problem of aspect extraction. I think an underutilized way to look at textual data when you have a single group of interest is the word-frequency-based odds ratio.

KeyBERT

Posts with mentions or reviews of KeyBERT. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2022-04-28.

I want to extract important keywords from large documents...
1 project | /r/LangChain | 7 Dec 2023

Use something else like KeyBERT or BERTopic: https://github.com/MaartenGr/KeyBERT It's much faster.
[D]: Predict the most probable document including the answer to a given question
3 projects | /r/MachineLearning | 28 Apr 2022

Using keyword similarity using KeyBERT:https://github.com/MaartenGr/KeyBERT (i.e. loading keywords for each of the given documents and compare to the keywords of the question)
BERT execution time
1 project | /r/AskProgramming | 12 Jan 2022

Would anyone know an equation or a general rule of thumb for how long it would take this BERT algorithm (KeyBERT: https://github.com/MaartenGr/KeyBERT) to select n keywords from a string of character length m on a GPU of certain relevant specs?
[P] Building model to extract keywords from legal documents
5 projects | /r/MachineLearning | 24 Aug 2021

Look into rake, pke, phrasemachine, pyate, keybert.
Alternate approaches to TF-IDF?
4 projects | /r/LanguageTechnology | 14 Mar 2021

What are some alternatives?

When comparing scattertext and KeyBERT you can also consider the following projects:

BERTopic - Leveraging BERT and c-TF-IDF to create easily interpretable topics.

yake - Single-document unsupervised keyword extraction

stopwords-it - Italian stopwords collection

RAKE-tutorial - A python implementation of the Rapid Automatic Keyword Extraction

word_cloud - A little word cloud generator in Python

flashtext - Extract Keywords from sentence or Replace keywords in sentences.

shifterator - Interpretable data visualizations for understanding how texts differ at the word level

pke - Python Keyphrase Extraction module

lit - The Learning Interpretability Tool: Interactively analyze ML models to understand their behavior in an extensible and framework agnostic interface.

faiss - A library for efficient similarity search and clustering of dense vectors.

spaCy - 💫 Industrial-strength Natural Language Processing (NLP) in Python

scattertext vs BERTopic KeyBERT vs yake scattertext vs stopwords-it KeyBERT vs RAKE-tutorial scattertext vs word_cloud KeyBERT vs flashtext scattertext vs shifterator KeyBERT vs pke scattertext vs lit KeyBERT vs faiss scattertext vs yake KeyBERT vs spaCy

Compare scattertext vs KeyBERT and see what are their differences.

scattertext

KeyBERT

scattertext

KeyBERT

What are some alternatives?