aistore
tantivy
aistore | tantivy | |
---|---|---|
1 | 18 | |
1,098 | 5,829 | |
3.3% | - | |
9.8 | 9.3 | |
1 day ago | over 2 years ago | |
Go | Rust | |
MIT License | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
aistore
-
Search 1B pages on AWS S3 for
> If you think more about this, it will be like distributed key value store with support both disk and memory access. You can write one using some opensource Raft libraries, or a possible candidate is Tikv from PingCap
My whole point was not building it ;)
There's also https://github.com/NVIDIA/aistore
tantivy
-
Hey y'all back again w/ the personal, self-hosted search engine
Backend uses tantivy to index the web pages, sqlite3 to hold metadata / crawl queue
- Ask HN: What are some good rust code to read to learn the language?
-
Looking for recommendations of well maintained open source rust codebases that I can look through/contribute to
Tantivy is a very well made library and also follows alot of the best practices if you like search you'll like this: https://github.com/quickwit-inc/tantivy
-
self hosted elasticsearch alternative
tantivy - More of a search engine library than out of the box solution
-
Whats your favourite open source Rust project that needs more recognition?
Tantivy search engine.
-
Is there a library for instant arbitrary text searching?
You could try the Tantivy crate, with an n-gram tokenizer, which would split and index your text in sliding groups of n characters.
-
Zest: a CLI tool for zettelkasten-like note management
I had to look up the "tantivy" that README mentions. https://github.com/tantivy-search/tantivy. Might want to add a link to the project in your README.
-
Are you using Rust at work? If yes, for what?
We're using Rust for a domain-specific search engine. When I first learned Rust some years ago my first thought was that this language is perfect for heavy text processing. IMO, &str is that single killer feature that got me sold :) The search engine that we're building is based on https://github.com/tantivy-search/tantivy.
- Tantivy, a full-text search engine library in Rust inspired by Apache Lucene
-
Tantivy v0.15 released! Now backed by Quickwit Inc.!
Well spotted. Like IPFS, there's a comment about that here: https://github.com/tantivy-search/tantivy/pull/1067#issuecomment-853139923 that points to the distributed wikipedia mirror project https://github.com/ipfs/distributed-wikipedia-mirror/issues/76
What are some alternatives?
seaweedfs - SeaweedFS is a fast distributed storage system for blobs, objects, files, and data lake, for billions of files! Blob store has O(1) disk seek, cloud tiering. Filer supports Cloud Drive, cross-DC active-active replication, Kubernetes, POSIX FUSE mount, S3 API, S3 Gateway, Hadoop, WebDAV, encryption, Erasure Coding.
sonic - 🦔 Fast, lightweight & schema-less search backend. An alternative to Elasticsearch that runs on a few MBs of RAM.
Seaweed File System - SeaweedFS is a fast distributed storage system for blobs, objects, files, and data lake, for billions of files! Blob store has O(1) disk seek, cloud tiering. Filer supports Cloud Drive, cross-DC active-active replication, Kubernetes, POSIX FUSE mount, S3 API, S3 Gateway, Hadoop, WebDAV, encryption, Erasure Coding. [Moved to: https://github.com/seaweedfs/seaweedfs]
tantivy-wasm
lakeFS - lakeFS - Data version control for your data lake | Git for data
pueue - :stars: Manage your shell commands.
microblob - Serve millions of JSON documents via HTTP.
neon - Rust bindings for writing safe and fast native Node.js modules.
minio - The Object Store for AI Data Infrastructure
neuron - Future-proof note-taking and publishing based on Zettelkasten (superseded by Emanote: https://github.com/srid/emanote)
juicefs - JuiceFS is a distributed POSIX file system built on top of Redis and S3.
zk - A plain text note-taking assistant