smyrna
search-benchmark-game
smyrna | search-benchmark-game | |
---|---|---|
1 | 5 | |
18 | 66 | |
- | - | |
10.0 | 6.7 | |
almost 2 years ago | 3 months ago | |
Clojure | Rust | |
- | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
smyrna
-
An alternative to Elasticsearch that runs on a few MBs of RAM
I've written a full-text search engine as well. I don't tout it as a replacement for Elasticsearch, but it does have a few advantages: it's fast; supports HTML documents; supports Polish inflection (via a full-blown morphological dictionary, not just a stemmer); and has a very compact on-disk format (pre-parsed HTML trees, Huffman-encoded over large alphabets). Oh, and it's 100% Clojure.
It underlies a GUI called Smyrna: https://github.com/nathell/smyrna, https://smyrna.danieljanus.pl
I haven't touched it in six years, other than a few small changes. But I do plan on revisiting it when time permits.
search-benchmark-game
-
Infino - Fast and scalable service to store time series and logs - written in Rust
Also, we have a benchmark for search. Feel free to add your engine. I believe it is fair: we are not leading the leaderboard, the rules are fairly clear, and no one has contested them so far. https://github.com/quickwit-oss/search-benchmark-game/
-
tantivy 0.19 is released: IP field type, Faster indexing, Configurable doc store compression, Improved aggregation support, and more...
Could you update the benchmark? It still uses tantivity 0.16.
-
An alternative to Elasticsearch that runs on a few MBs of RAM
This is very very difficult, but Tantivy tried: see https://github.com/quickwit-oss/search-benchmark-game
-
Why Is C Faster Than Java (2009)
That's just because there's no a lucene equivalent C library with the same level of attention?
however, there are increasingly such written in C++ (pisa) and rust (tantivy). They handily beat lucene in benchmark suites [1] - so it seems like lucene does suffer from a java penalty - despite getting even more developer attention than pisa and tantivy I would think.
1: https://tantivy-search.github.io/bench/
-
Tantivy v0.15 released! Now backed by Quickwit Inc.!
The benchmark is open sourced here: https://github.com/tantivy-search/search-benchmark-game
What are some alternatives?
tantivy-wasm
proposal-explicit-resource-managemen
Nim - Nim is a statically typed compiled systems programming language. It combines successful concepts from mature languages like Python, Ada and Modula. Its design focuses on efficiency, expressiveness, and elegance (in that order of priority).
librope - UTF-8 rope library for C
tantivy - Tantivy is a full-text search engine library inspired by Apache Lucene and written in Rust [Moved to: https://github.com/quickwit-oss/tantivy]
distributed-wikipedia-mirror - Putting Wikipedia Snapshots on IPFS
proposal-explicit-resource-management - ECMAScript Explicit Resource Management
lyra - 🌌 Fast, in-memory, typo-tolerant, full-text search engine written in TypeScript. [Moved to: https://github.com/LyraSearch/lyra]
Graal - GraalVM compiles Java applications into native executables that start instantly, scale fast, and use fewer compute resources 🚀
okon - Fast offline searching for SHA-1 keys in Have I Been Pwned databases
Vrmac - Vrmac Graphics, a cross-platform graphics library for .NET. Supports 3D, 2D, and accelerated video playback. Works on Windows 10 and Raspberry Pi4.
tantivy - Tantivy is a full-text search engine library inspired by Apache Lucene and written in Rust