Leaving Spotify for Freer Pastures

InfluxDB - Power Real-Time Data Analytics at Scale

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

www.influxdata.com

featured

SaaSHub - Software Alternatives and Reviews

SaaSHub helps you find the best software and product alternatives

www.saashub.com

featured

annoy

40 12,740 5.3 C++

Approximate Nearest Neighbors in C++/Python optimized for memory usage and loading/saving to disk

Is your music recommendation system open source? Would be down to check it out and learn a thing or two from it.
On the topic of vector search, I'm fairly certain that Spotify still uses Annoy (https://github.com/spotify/annoy). Like Faiss, it's a great library but not quite a database, which would ideally have features like replication (https://milvus.io/docs/replica.md), caching, and access control, to name a few.

spotify-record-player

1 3 10.0 Python

An app using the Spotify API to mock playing records
InfluxDB

www.influxdata.com featured

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Vector Databases 101

3 projects | /r/datascience | 25 Jun 2023
I'm an undergraduate data science intern and trying to run kmodes clustering. Did this elbow method to figure out how many clusters to use, but I don't really see an "elbow". Tips on number of clusters?

2 projects | /r/datascience | 21 Jun 2023
Calculating document similarity in a special domain

1 project | /r/LanguageTechnology | 1 Jun 2023
Can Parquet file format index string columns?

1 project | /r/dataengineering | 27 May 2023
Billion-Scale Approximate Nearest Neighbor Search [pdf]

1 project | news.ycombinator.com | 6 May 2023

Leaving Spotify for Freer Pastures

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com
Recommender Systems C++ Python nearest-neighbor-search locality-sensitive-hashing
Post date: 30 Nov 2022

annoy

spotify-record-player

InfluxDB

Related posts

Vector Databases 101

I'm an undergraduate data science intern and trying to run kmodes clustering. Did this elbow method to figure out how many clusters to use, but I don't really see an "elbow". Tips on number of clusters?

Calculating document similarity in a special domain

Can Parquet file format index string columns?

Billion-Scale Approximate Nearest Neighbor Search [pdf]

Leaving Spotify for Freer Pastures

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com Recommender Systems C++ Python nearest-neighbor-search locality-sensitive-hashing Post date: 30 Nov 2022

annoy

spotify-record-player

InfluxDB

Related posts

Vector Databases 101

I'm an undergraduate data science intern and trying to run kmodes clustering. Did this elbow method to figure out how many clusters to use, but I don't really see an "elbow". Tips on number of clusters?

Calculating document similarity in a special domain

Can Parquet file format index string columns?

Billion-Scale Approximate Nearest Neighbor Search [pdf]

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com
Recommender Systems C++ Python nearest-neighbor-search locality-sensitive-hashing
Post date: 30 Nov 2022