tantivy
Rust-Bio
tantivy | Rust-Bio | |
---|---|---|
18 | 9 | |
5,829 | 1,511 | |
- | 2.1% | |
9.3 | 6.4 | |
over 2 years ago | 26 days ago | |
Rust | Rust | |
MIT License | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
tantivy
-
Hey y'all back again w/ the personal, self-hosted search engine
Backend uses tantivy to index the web pages, sqlite3 to hold metadata / crawl queue
- Ask HN: What are some good rust code to read to learn the language?
-
Looking for recommendations of well maintained open source rust codebases that I can look through/contribute to
Tantivy is a very well made library and also follows alot of the best practices if you like search you'll like this: https://github.com/quickwit-inc/tantivy
-
self hosted elasticsearch alternative
tantivy - More of a search engine library than out of the box solution
-
Whats your favourite open source Rust project that needs more recognition?
Tantivy search engine.
-
Is there a library for instant arbitrary text searching?
You could try the Tantivy crate, with an n-gram tokenizer, which would split and index your text in sliding groups of n characters.
-
Zest: a CLI tool for zettelkasten-like note management
I had to look up the "tantivy" that README mentions. https://github.com/tantivy-search/tantivy. Might want to add a link to the project in your README.
-
Are you using Rust at work? If yes, for what?
We're using Rust for a domain-specific search engine. When I first learned Rust some years ago my first thought was that this language is perfect for heavy text processing. IMO, &str is that single killer feature that got me sold :) The search engine that we're building is based on https://github.com/tantivy-search/tantivy.
- Tantivy, a full-text search engine library in Rust inspired by Apache Lucene
-
Tantivy v0.15 released! Now backed by Quickwit Inc.!
Well spotted. Like IPFS, there's a comment about that here: https://github.com/tantivy-search/tantivy/pull/1067#issuecomment-853139923 that points to the distributed wikipedia mirror project https://github.com/ipfs/distributed-wikipedia-mirror/issues/76
Rust-Bio
- Bioinformatics Data Structures in Rust
- Bioinformatics with Rust
-
bioinformatic libraries and zig?
Does anyone know of zig native libraries for bioinformatics (here is a Rust example https://rust-bio.github.io/ )? It seems as though one could pull in a lot of bioinformatics C libraries such as done with https://github.com/brentp/hts-zig.
-
Proteomics search engine written in Rust
e.g. Rust-Bio
-
What are your top 3-5 programming languages and why?
I would start with the book and then rust-bio library. Rust is a pretty low level language compared to R/Python. It’s an especially good fit for writing efficient tools that make use of the kinds of algorithms / data structures that are implemented in rust-bio.
-
I have to admit. The free code camp course is a bit more sparing than I would have preferred. How did everyone learn Rust?
Absolutely! It already is, e.g., https://github.com/rust-bio/rust-bio. I'm moving from the academia/nonprofit world into industry bioinformatics, and I intend to use Rust as much as possible. I've already replaced as much of my Python as possible with Rust. I feel I'm able to create larger, more complex programs with Rust because I have the compiler to keep me from making common mistakes that are so easy to make in dynamically typed languages like Perl and Python. It might take longer to write a program initially, but I've started to create a library of functions I can paste together to do things like parse a positive integer, find a bunch of files with a certain file extension, search through data for a pattern, parse CSV files, etc. Writing my latest book has provided even more common patterns I keep finding I use over and over.
-
Is learning Rust and systems programming through the books Rust in Action and Crafting Interpreters a good idea?
I think there is huge potential for Rust in bioinformatics, and there are already some great projects like https://rust-bio.github.io/. It seems industry is also hiring for these skills. This Nature article is a little old, but also covers why people in the field are looking for greater safety and performance. It's relatively easy to write a Python program to do bio stuff, but it's also very easy to get lots of things wrong or for the resulting program to be slow and/or impossible to extend and maintain. In the long run, I think it makes sense to write in Rust. Perl was king in biofx when I started, and I would not have predicted it being displaced by Python, so there's good reason to believe that Python may one day be eclipsed by Rust.
-
Whats your favourite open source Rust project that needs more recognition?
Well, someone mentioned https://rust-bio.github.io/
-
How can one make Rust excel in the Sciences
So generally stuff in this maths/numerical space. The term is a bit deceptive because it rarely means domain-specific science libraries like rust-bio even thought that might be what you think when you hear "scientific computing".
What are some alternatives?
sonic - 🦔 Fast, lightweight & schema-less search backend. An alternative to Elasticsearch that runs on a few MBs of RAM.
dash - Data Apps & Dashboards for Python. No JavaScript Required.
tantivy-wasm
kanidm - Kanidm: A simple, secure and fast identity management platform
pueue - :stars: Manage your shell commands.
clickhouse-rs - Asynchronous ClickHouse client library for Rust programming language.
neon - Rust bindings for writing safe and fast native Node.js modules.
GeoRust - Geospatial primitives and algorithms for Rust
neuron - Future-proof note-taking and publishing based on Zettelkasten (superseded by Emanote: https://github.com/srid/emanote)
Rhai - Rhai - An embedded scripting language for Rust.
zk - A plain text note-taking assistant
cycle - Modern and safe symbolic mathematics