State of the Art in Sentence Embeddings

This page summarizes the projects mentioned and recommended in the original post on /r/LanguageTechnology

InfluxDB - Power Real-Time Data Analytics at Scale
Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
www.influxdata.com
featured
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com
featured
  • SimCSE

    [EMNLP 2021] SimCSE: Simple Contrastive Learning of Sentence Embeddings https://arxiv.org/abs/2104.08821

  • To answer your question about sentence embedding SOTA, it is not s-Bert and hasn't been for a while. SimCSE officially takes the crown since it's been presented at a conference, though according to paperswithcode's benchmark leaderboard there are other papers on arxiv that report higher performance on STS and similar tasks such as DCPCSE. Having tried both of these for my use case I found SimCSE to be better but YMMV.

  • PromCSE

    Code for "Improved Universal Sentence Embeddings with Prompt-based Contrastive Learning and Energy-based Learning (EMNLP 2022)"

  • To answer your question about sentence embedding SOTA, it is not s-Bert and hasn't been for a while. SimCSE officially takes the crown since it's been presented at a conference, though according to paperswithcode's benchmark leaderboard there are other papers on arxiv that report higher performance on STS and similar tasks such as DCPCSE. Having tried both of these for my use case I found SimCSE to be better but YMMV.

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts

  • Using AirPods as a Morse Transmitter

    2 projects | news.ycombinator.com | 7 May 2024
  • AWS Python SDK downloaded over 1.4B times a month

    1 project | news.ycombinator.com | 8 May 2024
  • SpaCyEx: SpaCy Matcher patterns with RegEx like syntax

    1 project | news.ycombinator.com | 8 May 2024
  • Troubleshoot dependencies in Pipfile/pipenv

    2 projects | dev.to | 8 May 2024
  • The ultimate guide to creating a secure Python package

    4 projects | dev.to | 8 May 2024