search-benchmark-game
tantivy
search-benchmark-game | tantivy | |
---|---|---|
5 | 18 | |
66 | 5,829 | |
- | - | |
6.7 | 9.3 | |
3 months ago | over 2 years ago | |
Rust | Rust | |
MIT License | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
search-benchmark-game
-
Infino - Fast and scalable service to store time series and logs - written in Rust
Also, we have a benchmark for search. Feel free to add your engine. I believe it is fair: we are not leading the leaderboard, the rules are fairly clear, and no one has contested them so far. https://github.com/quickwit-oss/search-benchmark-game/
-
tantivy 0.19 is released: IP field type, Faster indexing, Configurable doc store compression, Improved aggregation support, and more...
Could you update the benchmark? It still uses tantivity 0.16.
-
An alternative to Elasticsearch that runs on a few MBs of RAM
This is very very difficult, but Tantivy tried: see https://github.com/quickwit-oss/search-benchmark-game
-
Why Is C Faster Than Java (2009)
That's just because there's no a lucene equivalent C library with the same level of attention?
however, there are increasingly such written in C++ (pisa) and rust (tantivy). They handily beat lucene in benchmark suites [1] - so it seems like lucene does suffer from a java penalty - despite getting even more developer attention than pisa and tantivy I would think.
1: https://tantivy-search.github.io/bench/
-
Tantivy v0.15 released! Now backed by Quickwit Inc.!
The benchmark is open sourced here: https://github.com/tantivy-search/search-benchmark-game
tantivy
-
Hey y'all back again w/ the personal, self-hosted search engine
Backend uses tantivy to index the web pages, sqlite3 to hold metadata / crawl queue
- Ask HN: What are some good rust code to read to learn the language?
-
Looking for recommendations of well maintained open source rust codebases that I can look through/contribute to
Tantivy is a very well made library and also follows alot of the best practices if you like search you'll like this: https://github.com/quickwit-inc/tantivy
-
self hosted elasticsearch alternative
tantivy - More of a search engine library than out of the box solution
-
Whats your favourite open source Rust project that needs more recognition?
Tantivy search engine.
-
Is there a library for instant arbitrary text searching?
You could try the Tantivy crate, with an n-gram tokenizer, which would split and index your text in sliding groups of n characters.
-
Zest: a CLI tool for zettelkasten-like note management
I had to look up the "tantivy" that README mentions. https://github.com/tantivy-search/tantivy. Might want to add a link to the project in your README.
-
Are you using Rust at work? If yes, for what?
We're using Rust for a domain-specific search engine. When I first learned Rust some years ago my first thought was that this language is perfect for heavy text processing. IMO, &str is that single killer feature that got me sold :) The search engine that we're building is based on https://github.com/tantivy-search/tantivy.
- Tantivy, a full-text search engine library in Rust inspired by Apache Lucene
-
Tantivy v0.15 released! Now backed by Quickwit Inc.!
Well spotted. Like IPFS, there's a comment about that here: https://github.com/tantivy-search/tantivy/pull/1067#issuecomment-853139923 that points to the distributed wikipedia mirror project https://github.com/ipfs/distributed-wikipedia-mirror/issues/76
What are some alternatives?
tantivy-wasm
sonic - 🦔 Fast, lightweight & schema-less search backend. An alternative to Elasticsearch that runs on a few MBs of RAM.
proposal-explicit-resource-managemen
Nim - Nim is a statically typed compiled systems programming language. It combines successful concepts from mature languages like Python, Ada and Modula. Its design focuses on efficiency, expressiveness, and elegance (in that order of priority).
pueue - :stars: Manage your shell commands.
librope - UTF-8 rope library for C
neon - Rust bindings for writing safe and fast native Node.js modules.
distributed-wikipedia-mirror - Putting Wikipedia Snapshots on IPFS
neuron - Future-proof note-taking and publishing based on Zettelkasten (superseded by Emanote: https://github.com/srid/emanote)
proposal-explicit-resource-management - ECMAScript Explicit Resource Management
zk - A plain text note-taking assistant