Our great sponsors
-
annoy
Approximate Nearest Neighbors in C++/Python optimized for memory usage and loading/saving to disk
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
Tracking particles in High Energy Physics is about connecting together the hits produced by the same particle. In a typical High Luminosity collision event, 100K hits are produced by 10K particles leading to an average particle size of 10 hits. The challenge is then to connect the right 10 hits together from a collection of similarly looking 100K hits and to do it under a second! This post is a python guide to particle tracking with Approximate Nearest Neighbor library Annoy.
Related posts
- Vector Databases 101
- I'm an undergraduate data science intern and trying to run kmodes clustering. Did this elbow method to figure out how many clusters to use, but I don't really see an "elbow". Tips on number of clusters?
- Calculating document similarity in a special domain
- Can Parquet file format index string columns?
- Billion-Scale Approximate Nearest Neighbor Search [pdf]