Is Cosine-Similarity of Embeddings Really About Similarity?

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com

InfluxDB – Built for High-Performance Time Series Workloads
InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.
www.influxdata.com
featured
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com
featured
  1. word2vec

    Automatically exported from code.google.com/p/word2vec

    The original paper included source, and that has their test data and results -- it gets ~77% accuracy on about 20k example word analogies (with 99.7% coverage), and 78% accuracy with phrases with 77% coverage. You can see the test set here:

    https://github.com/tmikolov/word2vec/blob/master/questions-w...

  2. InfluxDB

    InfluxDB – Built for High-Performance Time Series Workloads. InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.

    InfluxDB logo
NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts

  • Building a Security-First OS from Scratch: AtomicOS Journey

    1 project | dev.to | 20 Jun 2025
  • kb_text_shape.h: Unicode text segmentation and OpenType shaping

    1 project | news.ycombinator.com | 20 Jun 2025
  • AtomicOS – A security-first OS with real crypto and deterministic language

    1 project | news.ycombinator.com | 20 Jun 2025
  • I Build libSQL Server Web GUI - MylibSQLAdmin

    1 project | dev.to | 20 Jun 2025
  • Building Intelligent Search with AI Embeddings, Neon, and pgvector

    1 project | dev.to | 20 Jun 2025