Our great sponsors
-
SimCSE
[EMNLP 2021] SimCSE: Simple Contrastive Learning of Sentence Embeddings https://arxiv.org/abs/2104.08821
-
WorkOS
The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
Base BERT sentence embeddings are just not good for a couple of reasons and there's some research papers that show this. You can try SimCSE, Google's USE or SBERT as mentioned previously and you'll get better output. It's just an inherent flaw to base BERT that it can't produce good sentence embeddings. Papers have shown you probably will get better scores using GloVe embeddings from scratch than base BERT.
NOTE:
The number of mentions on this list indicates mentions on common posts plus user suggested alternatives.
Hence, a higher number means a more popular project.