spyglass
tantivy
Our great sponsors
spyglass | tantivy | |
---|---|---|
39 | 48 | |
2,428 | 9,803 | |
2.1% | 2.9% | |
7.3 | 9.1 | |
about 1 month ago | 7 days ago | |
Rust | Rust | |
GNU Affero General Public License v3.0 | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
spyglass
- Spyglass: A Personal Search Engine
-
Ask HN: Search engine for a small number of sites?
No direct experience but I saved some links about this:
https://wiby.me/about/guide.html
https://github.com/spyglass-search/spyglass
Interesting thread(s) on HN about the last one:
-
A little demo integration the alpaca model w/ my open-source search app
I've been working on a self-hosted personal search app (https://github.com/spyglass-search/spyglass) and have recently been playing around with how to integrate it with local LLMs. I think this would be an awesome step into having your own personal assistant that can search through all your data and give you analysis / summaries.
-
Building better search for OSRS content
We have an open-source desktop app (https://github.com/spyglass-search/spyglass) and are working on a web version that has more powerful features such as conversation search, check it out!
-
Building conversational search for your data
I'm one of the devs for Spyglass ([https://github.com/spyglass-search/spyglass](https://github.com/spyglass-search/spyglass)) an open-source personal search app. We're excited about all the advancements with language models recently and wanted to try merging the two ideas together to form something even better.
-
wiby: build your own search engine of selected/submitted websites
Another more recently available option is spyglass; it is more tenable than YaCy but dev is mostly on MacOS with focus on a desktop interface. I like the idea of web based interface.
- Spaceman: A gRPC client from another world. Comes both as a CLI and as a GUI built with Tauri and Yew.rs
-
Building personal search engine for local documents (including Obsidian notes!) and more
I'm part of small team that's been building an open-source personal search engine (https://github.com/spyglass-search/spyglass). One of the major use cases is searching your local files & their contents which aligns perfectly with how Obsidian stores it notes.
-
Search your reddit saved & upvoted posts via Spyglass
I'm one of the developers of Spyglass (https://github.com/spyglass-search/spyglass), an open-source self-hosted personal search engine. We recently added the ability to search through your Reddit saved & upvoted posts!
tantivy
-
SeekStorm VS tantivy - a user suggested alternative
2 projects | 22 Mar 2024
-
What is Hybrid Search?
Tantivy - a full-text indexing library written in Rust. Has a great performance and featureset.
- Tantivy – Fast, OSS full-text search library in Rust
-
RAG Using Unstructured Data and Role of Knowledge Graphs
By this I presume you mean build a search index that can retrieve results based on keywords? I know certain databases use Lucene to build a keyword-based index on top of unstructured blobs of data. Another alternative is to use Tantivy (https://github.com/quickwit-oss/tantivy), a Rust version of Lucene, if building search indices via Java isn't your cup of tea :)
Both libraries offer multilingual support for keywords, I believe, so that's a benefit to vector search where multilingual embedding models are rather expensive.
-
Show HN: Quickwit – OSS Alternative to Elasticsearch, Splunk, Datadog
We also implemented our schemaless columnar storage optimized for object storage.
The inverted index and columnar storage are part of tantivy [0], which is the fastest search library out there. We maintain it and we decided to build the distributed engine on top of it.
[0] tantivy github repo: https://github.com/quickwit-oss/tantivy
-
Pg_bm25: Elastic-Quality Full Text Search Inside Postgres
The issue for geo search is here: https://github.com/quickwit-oss/tantivy/issues/44
-
Grimoire - A recipe management application.
Search index : Custom-built using tantivy.
-
A Compressed Indexable Bitset
The roaring bitmap variant is used only for the optional index (1 docid => 0 or 1 value) in the columnar storage (DocValues), not for the inverted index. Since this is used for aggregation, some queries may be a full scan.
The inverted index in tantivy uses bitpacked values of 128 elements with a skip index on top.
> I didn't follow the rest of your comment, select is what EF is good at, every other data structure needs a lot more scanning once you land on the right chunk. With BMI2 you can also use the PDEP instruction to accelerate the final select on a 64-bit block
The select for the sparse codec is a [simple array index access](https://github.com/quickwit-oss/tantivy/blob/main/columnar/s...), that is hard to beat. Compression is not good near the 5k threshold though.
-
Job: Rust + Retrieval Systems at Etsy
Hi /r/rust, I’m a SWE on Etsy’s Retrieval Systems team where we’re building a platform based on rust and tantivy (https://github.com/quickwit-oss/tantivy). We’re looking to bring two new engineers onto the team.
-
Announcing Velo - Your Rust-Powered Brainstorming and Note-Taking Tool
Quick Search: Easily find specific notes with Velo's fuzzy-search feature, powered by tantivy. tantivy might have been a little overkill, but it was really easy to integrate.
What are some alternatives?
tika-docker - Convenience Docker images for Apache Tika Server
sonic - 🦔 Fast, lightweight & schema-less search backend. An alternative to Elasticsearch that runs on a few MBs of RAM.
OneTab-Night-Mode - Little theme for the onetab page to make it less eye raping.
surrealdb - A scalable, distributed, collaborative, document-graph database, for the realtime web
calculator-rust-react - Calculadora que realiza las funciones basicas aritmeticas, estas funciones se ejecutan por medio de RUST y la UI esta construida con ReactJS utilizando TauriApp de intermediario entre RUST y REACTJS
milli - Search engine library for Meilisearch ⚡️
abracabra - Eventually a search engine, but currently a filtering pipeline for HTML and soon WARC files.
MeiliSearch - A lightning-fast search API that fits effortlessly into your apps, websites, and workflow
vscode-haskell - VS Code extension for Haskell, powered by haskell-language-server
quickwit - Cloud-native search engine for observability. An open-source alternative to Datadog, Elasticsearch, Loki, and Tempo.
alfred-my-mind - Alfred workflow to search through my notes and bookmarks
fselect - Find files with SQL-like queries