fclones
tantivy
fclones | tantivy | |
---|---|---|
17 | 18 | |
1,729 | 5,829 | |
- | - | |
6.4 | 9.3 | |
22 days ago | over 2 years ago | |
Rust | Rust | |
MIT License | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
fclones
- Fclones: Parallel duplicate files finder in Rust with reflink support
- Fclones: Efficient Duplicate File Finder
- fdupes: Identify or Delete Duplicate Files
-
Deduplication at file level
Install fclones (fdupes modern equivalent)
- A duplicate file finder with GUI written using gtk4-rs and relm4
-
making duplicate files hard links, probably rdfind
I said "fclone" I meant "fclones" (https://github.com/pkolaczk/fclones) -- I missed the "s" at the end
-
Use `dano` to find duplicate media files
While we're recommending stuff, fclones: https://github.com/pkolaczk/fclones
-
Oxidise Your Life
fclones: Efficient Duplicate File Finder.
-
Pip and cargo are not the same
Yes, that would be safer when available (although generally files within library dependencies are not modified I think?). It looks like fclones implements this, is faster and is written in Rust https://github.com/pkolaczk/fclones (the last is the most important point of course /s).
- Caf a rust program which filters out duplicate files
tantivy
-
Hey y'all back again w/ the personal, self-hosted search engine
Backend uses tantivy to index the web pages, sqlite3 to hold metadata / crawl queue
- Ask HN: What are some good rust code to read to learn the language?
-
Looking for recommendations of well maintained open source rust codebases that I can look through/contribute to
Tantivy is a very well made library and also follows alot of the best practices if you like search you'll like this: https://github.com/quickwit-inc/tantivy
-
self hosted elasticsearch alternative
tantivy - More of a search engine library than out of the box solution
-
Whats your favourite open source Rust project that needs more recognition?
Tantivy search engine.
-
Is there a library for instant arbitrary text searching?
You could try the Tantivy crate, with an n-gram tokenizer, which would split and index your text in sliding groups of n characters.
-
Zest: a CLI tool for zettelkasten-like note management
I had to look up the "tantivy" that README mentions. https://github.com/tantivy-search/tantivy. Might want to add a link to the project in your README.
-
Are you using Rust at work? If yes, for what?
We're using Rust for a domain-specific search engine. When I first learned Rust some years ago my first thought was that this language is perfect for heavy text processing. IMO, &str is that single killer feature that got me sold :) The search engine that we're building is based on https://github.com/tantivy-search/tantivy.
- Tantivy, a full-text search engine library in Rust inspired by Apache Lucene
-
Tantivy v0.15 released! Now backed by Quickwit Inc.!
Well spotted. Like IPFS, there's a comment about that here: https://github.com/tantivy-search/tantivy/pull/1067#issuecomment-853139923 that points to the distributed wikipedia mirror project https://github.com/ipfs/distributed-wikipedia-mirror/issues/76
What are some alternatives?
jdupes - A powerful duplicate file finder and an enhanced fork of 'fdupes'.
sonic - 🦔 Fast, lightweight & schema-less search backend. An alternative to Elasticsearch that runs on a few MBs of RAM.
rdfind - find duplicate files utility
tantivy-wasm
rmlint - Extremely fast tool to remove duplicates and other lint from your filesystem
pueue - :stars: Manage your shell commands.
czkawka - Multi functional app to find duplicates, empty folders, similar images etc.
neon - Rust bindings for writing safe and fast native Node.js modules.
CompactGUI - Transparently compress active games and programs using Windows 10/11 APIs
neuron - Future-proof note-taking and publishing based on Zettelkasten (superseded by Emanote: https://github.com/srid/emanote)
dupd - CLI utility to find duplicate files
zk - A plain text note-taking assistant