KeyBERT
bert_score
KeyBERT | bert_score | |
---|---|---|
5 | 1 | |
3,217 | 1,426 | |
- | - | |
6.1 | 0.0 | |
about 2 months ago | 23 days ago | |
Python | Jupyter Notebook | |
MIT License | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
KeyBERT
-
I want to extract important keywords from large documents...
Use something else like KeyBERT or BERTopic: https://github.com/MaartenGr/KeyBERT It's much faster.
-
[D]: Predict the most probable document including the answer to a given question
Using keyword similarity using KeyBERT:https://github.com/MaartenGr/KeyBERT (i.e. loading keywords for each of the given documents and compare to the keywords of the question)
-
BERT execution time
Would anyone know an equation or a general rule of thumb for how long it would take this BERT algorithm (KeyBERT: https://github.com/MaartenGr/KeyBERT) to select n keywords from a string of character length m on a GPU of certain relevant specs?
-
[P] Building model to extract keywords from legal documents
Look into rake, pke, phrasemachine, pyate, keybert.
- Alternate approaches to TF-IDF?
bert_score
What are some alternatives?
yake - Single-document unsupervised keyword extraction
spaCy - 💫 Industrial-strength Natural Language Processing (NLP) in Python
RAKE-tutorial - A python implementation of the Rapid Automatic Keyword Extraction
nlpaug - Data augmentation for NLP
flashtext - Extract Keywords from sentence or Replace keywords in sentences.
CodeSearchNet - Datasets, tools, and benchmarks for representation learning of code.
pke - Python Keyphrase Extraction module
Data-science - Collection of useful data science topics along with articles, videos, and code
faiss - A library for efficient similarity search and clustering of dense vectors.
Made-With-ML - Learn how to design, develop, deploy and iterate on production-grade ML applications.
FLAML - A fast library for AutoML and tuning. Join our Discord: https://discord.gg/Cppx2vSPVP.