Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality. Learn more →
Top 12 C++ search-engine Projects
-
Typesense
Open Source alternative to Algolia + Pinecone and an Easier-to-Use alternative to ElasticSearch ⚡ 🔍 ✨ Fast, typo tolerant, in-memory fuzzy Search Engine for building delightful search experiences
-
manticoresearch
Easy to use open source fast database for search | Good alternative to Elasticsearch now | Drop-in replacement for E in the ELK soon
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
-
infinity
The AI-native database built for LLM applications, providing incredibly fast full-text and vector search (by infiniflow)
-
usearch
Fast Open-Source Search & Clustering engine × for Vectors & 🔜 Strings × in C++, C, Python, JavaScript, Rust, Java, Objective-C, Swift, C#, GoLang, and Wolfram 🔍
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
-
orient
CLI/GUI file searcher combining find, locate and Everything, plus content searching. Works on Linux, Windows and macOS. This repo is for CLI/Library.
Disregarding props-drilling technique in favor of a more reliable and elegant solution we looked for inspiration elsewhere. Another project of ours .find was using Typesense/Algolia components, which looked a bit like black-box/magic, but at the same time provided a clean approach to build complex and highly customizable solutions.
Note, you need to do it in the root folder of a clone from https://github.com/manticoresoftware/manticoresearch
Using C++20 modules, take a look at this project: https://github.com/infiniflow/infinity
Project mention: I'm writing a new vector search SQLite Extension | news.ycombinator.com | 2024-05-02Might have a look at this library:
https://github.com/unum-cloud/usearch
It does HNSW and there is a SQLite related project, though not quite the same thing.
The EF core algorithm implemented in folly [3] may be a bit faster, and implementing partitioning on top of that is relatively easy.
It would definitely compress much better than roaring bitmaps. In terms of performance, it depends on the access patterns. If very sparse (large jumps) PEF would likely be faster, if dense (visit a large fraction of the bitmap) it'd be slower.
It is possible to squeeze a bit more compression out of PEF by introducing a chunk type for Elias-Fano of the chunk complement (for very dense chunks), but you lose the operation of skipping to a given position, which is however not needed in inverted indexes (you only need to skip past a given id, and that can be supported efficiently). That is not mentioned in the paper because at the time I thought the skip-to-position operation was a non-negotiable.
[1] https://github.com/ot/ds2i/
[2] https://github.com/pisa-engine/pisa
[3] https://github.com/facebook/folly/blob/main/folly/experiment...
Project mention: Myscaledb: Open-source SQL vector database to build AI apps using SQL | news.ycombinator.com | 2024-04-01
Project mention: `orient` File Searcher - Drop-in Cross-platform Replacement for `Everything`, as Fast as `plocate`, plus Content Searches. Has both CLI and GUI. | /r/coolgithubprojects | 2023-05-11
C++ search-engine related posts
-
Myscaledb: Open-source SQL vector database to build AI apps using SQL
-
Show HN: High-Performance SQL Vector Database MyScaleDB Goes Open Source
-
Website Search Hurts My Feelings
-
Remote Machine Learning and Searching on a Raspberry Pi 5
-
DNS record "hn.algolia.com" is gone
-
A Compressed Indexable Bitset
-
Obsidian Publish full text search
-
A note from our sponsor - InfluxDB
www.influxdata.com | 10 May 2024
Index
What are some of the best open-source search-engine projects in C++? This list will help you:
Project | Stars | |
---|---|---|
1 | Typesense | 18,107 |
2 | manticoresearch | 8,350 |
3 | infinity | 1,726 |
4 | usearch | 1,691 |
5 | pisa | 864 |
6 | MeTA | 684 |
7 | MyScaleDB | 665 |
8 | alexandria | 180 |
9 | librengine | 64 |
10 | orient | 61 |
11 | DawnlightSearch | 36 |
12 | flipper | 29 |
Sponsored