-
Typesense
Open Source alternative to Algolia + Pinecone and an Easier-to-Use alternative to ElasticSearch ⚡ 🔍 ✨ Fast, typo tolerant, in-memory fuzzy Search Engine for building delightful search experiences
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
A problem close to my heart. Good search is certainly still too difficult to pull off for small teams, and was one of my motivations for building and open sourcing Typesense[1].
Most people think of search and immediately think of large data sets, but the problems that plague smaller datasets are equally interesting. It's less about performance and more about relevance. For e.g. searching across multiple fields on a compound query like "taylor swift style", which requires breaking the query into segments before searching for the appropriate fields. There are also a class of problems that traditional search engines that rely on BM25 or TF-IDF for ranking cannot reliably solve (e.g. searching on small texts like titles) where you have to consider distance between matching words (which TF-IDF and BM25 ignore). Lastly, there is also personalization which is almost always left as an exercise to the reader :)
[1]: https://github.com/typesense/typesense
Assuming you have an NVIDIA GPU, you can build a semantic search engine by indexing CLIP embeds (image or text).
https://github.com/rom1504/clip-retrieval