distributed-wikipedia-mirror
tantivy
distributed-wikipedia-mirror | tantivy | |
---|---|---|
11 | 18 | |
603 | 5,829 | |
1.5% | - | |
3.6 | 9.3 | |
3 months ago | over 2 years ago | |
TypeScript | Rust | |
- | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
distributed-wikipedia-mirror
- Distributed Wikipedia Mirror Project: Putting Wikipedia Snapshots on IPFS
-
Is it possible (and does it make sense) to self host, openstreetmaps, Wikipedia and a complete search engine ?
You might like this repo. This tech was/is used in Turkey since they banned access to wikipedia. The read-only is a feature because nobody should be able to manipulate the contents of this distributed copy.
-
Uhhh wtf is this? 'Distributed Wikipedia Mirror Project' built on GME blockchain???
Link to the github
- Wikiless: A free open source alternative Wikipedia front-end focused on privacy
-
An idea about permanent hosting SCIHub on IPFS
So I thought there is a very suitable way to enhance the availability of SCIHub --- to store SCIHub papers on IPFS network through Crust, and develop a SCIHub-IPFS-Mirror for this to facilitate user access (similar to the project [distributed-wikipedia-mirror](https://github.com/ipfs/distributed-wikipedia-mirror) ).
-
What are the odds of the Internet Archive getting shut in the next 5 years and what will we do after it is shut?
follow the cohost steps https://github.com/ipfs/distributed-wikipedia-mirror
-
Internet in a Box
For my wikipedia cache I use IPFS companion and https://en.wikipedia-on-ipfs.org/wiki/. All the devices that use this approach on a local network can share data. And to make sure unused wikipedia pages aren't garbage collected, https://github.com/ipfs/distributed-wikipedia-mirror#cohost-...
-
Tantivy v0.15 released! Now backed by Quickwit Inc.!
Well spotted. Like IPFS, there's a comment about that here: https://github.com/tantivy-search/tantivy/pull/1067#issuecomment-853139923 that points to the distributed wikipedia mirror project https://github.com/ipfs/distributed-wikipedia-mirror/issues/76
tantivy
-
Hey y'all back again w/ the personal, self-hosted search engine
Backend uses tantivy to index the web pages, sqlite3 to hold metadata / crawl queue
- Ask HN: What are some good rust code to read to learn the language?
-
Looking for recommendations of well maintained open source rust codebases that I can look through/contribute to
Tantivy is a very well made library and also follows alot of the best practices if you like search you'll like this: https://github.com/quickwit-inc/tantivy
-
self hosted elasticsearch alternative
tantivy - More of a search engine library than out of the box solution
-
Whats your favourite open source Rust project that needs more recognition?
Tantivy search engine.
-
Is there a library for instant arbitrary text searching?
You could try the Tantivy crate, with an n-gram tokenizer, which would split and index your text in sliding groups of n characters.
-
Zest: a CLI tool for zettelkasten-like note management
I had to look up the "tantivy" that README mentions. https://github.com/tantivy-search/tantivy. Might want to add a link to the project in your README.
-
Are you using Rust at work? If yes, for what?
We're using Rust for a domain-specific search engine. When I first learned Rust some years ago my first thought was that this language is perfect for heavy text processing. IMO, &str is that single killer feature that got me sold :) The search engine that we're building is based on https://github.com/tantivy-search/tantivy.
- Tantivy, a full-text search engine library in Rust inspired by Apache Lucene
-
Tantivy v0.15 released! Now backed by Quickwit Inc.!
Well spotted. Like IPFS, there's a comment about that here: https://github.com/tantivy-search/tantivy/pull/1067#issuecomment-853139923 that points to the distributed wikipedia mirror project https://github.com/ipfs/distributed-wikipedia-mirror/issues/76
What are some alternatives?
ipfs - Peer-to-peer hypermedia protocol
sonic - 🦔 Fast, lightweight & schema-less search backend. An alternative to Elasticsearch that runs on a few MBs of RAM.
internetarchive-downloader - Simultaneous, resumable and hash-verified downloads from Internet Archive (archive.org)
tantivy-wasm
pueue - :stars: Manage your shell commands.
iiab - Internet-in-a-Box - Build your own LIBRARY OF ALEXANDRIA with a Raspberry Pi !
neon - Rust bindings for writing safe and fast native Node.js modules.
search-benchmark-game - Search engine benchmark (Tantivy, Lucene, PISA, ...)
neuron - Future-proof note-taking and publishing based on Zettelkasten (superseded by Emanote: https://github.com/srid/emanote)
ipfs-backup - Backup encrypted files on ipfs
zk - A plain text note-taking assistant