Fast_Sentence_Embeddings
inltk
Our great sponsors
Fast_Sentence_Embeddings | inltk | |
---|---|---|
2 | 1 | |
603 | 809 | |
- | - | |
0.0 | 1.8 | |
about 1 year ago | over 1 year ago | |
Jupyter Notebook | Python | |
GNU General Public License v3.0 only | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
Fast_Sentence_Embeddings
-
You probably shouldn't use OpenAI's embeddings
You can find some comparisons and evaluation datasets/tasks here: https://www.sbert.net/docs/pretrained_models.html
Generally MiniLM is a good baseline. For faster models you want this library:
https://github.com/oborchers/Fast_Sentence_Embeddings
For higher quality ones, just take the bigger/slower models in the SentenceTransformers library
-
[D] Unsupervised document similarity state of the art
Links: fse: https://github.com/oborchers/Fast_Sentence_Embeddings Sentence-transformers: https://github.com/oborchers/sentence-transformers
inltk
We haven't tracked posts mentioning inltk yet.
Tracking mentions began in Dec 2020.
What are some alternatives?
allennlp - An open-source NLP research library, built on PyTorch.
DiffCSE - Code for the NAACL 2022 long paper "DiffCSE: Difference-based Contrastive Learning for Sentence Embeddings"
smaller-labse - Applying "Load What You Need: Smaller Versions of Multilingual BERT" to LaBSE
gensim - Topic Modelling for Humans
clip-as-service - 🏄 Scalable embedding, reasoning, ranking for images and sentences with CLIP
SimCSE - [EMNLP 2021] SimCSE: Simple Contrastive Learning of Sentence Embeddings https://arxiv.org/abs/2104.08821
cso-classifier - Python library that classifies content from scientific papers with the topics of the Computer Science Ontology (CSO).
RecSys_Course_AT_PoliMi - This is the official repository for the Recommender Systems course at Politecnico di Milano.
kgtk - Knowledge Graph Toolkit
sentence-transformers - Sentence Embeddings with BERT & XLNet
wembedder - Wikidata embedding