SentenceTransformers: Python framework for sentence, text and image embeddings

InfluxDB - Power Real-Time Data Analytics at Scale

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

www.influxdata.com

featured

SaaSHub - Software Alternatives and Reviews

SaaSHub helps you find the best software and product alternatives

www.saashub.com

featured

fast_vector_similarity

7 323 7.2 Rust

The Fast Vector Similarity Library is designed to provide efficient computation of various similarity measures between vectors.

Yes, check out my library for vector similarity that has various other measures which are more discriminative:
https://github.com/Dicklesworthstone/fast_vector_similarity
pip install fast_vector_similarity

LASER

5 3,520 7.0 Jupyter Notebook

Language-Agnostic SEntence Representations

I'm curious how people are handling multi-lingual embeddings.
I've found LASER[1] which originally had the idea to embed all languages in the same vector space, though it's a bit harder to use than models available through SentenceTransformers. LASER2 stuck with this approach, but LASER3 switched to language-specific models. However, I haven't found benchmarks for these models, and they were released about 2 years ago.
Another alternative would be to translate everything before embedding, which would introduce some amount of error, though maybe it wouldn't be significant.
1. https://github.com/facebookresearch/LASER

InfluxDB

www.influxdata.com featured

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Quick tip: Write numpy arrays directly to the SingleStore VECTOR data type

1 project | dev.to | 1 May 2024
Kolmogorov-Arnold Networks

3 projects | news.ycombinator.com | 30 Apr 2024
Navigating the Risky Waters of Loan Defaults: A Predictive Beacon

1 project | dev.to | 30 Apr 2024
Alternative Chunking Methods

1 project | news.ycombinator.com | 30 Apr 2024
MinSH: Near-linear time global string alignment in a few lines of Python

1 project | news.ycombinator.com | 29 Apr 2024

SentenceTransformers: Python framework for sentence, text and image embeddings

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com Post date: 7 Apr 2024

fast_vector_similarity

LASER

InfluxDB

Related posts

Quick tip: Write numpy arrays directly to the SingleStore VECTOR data type

Kolmogorov-Arnold Networks

Navigating the Risky Waters of Loan Defaults: A Predictive Beacon

Alternative Chunking Methods

MinSH: Near-linear time global string alignment in a few lines of Python