smartstring
fst
smartstring | fst | |
---|---|---|
7 | 11 | |
482 | 1,712 | |
- | - | |
0.0 | 3.5 | |
8 months ago | 4 months ago | |
Rust | Rust | |
Mozilla Public License 2.0 | The Unlicense |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
smartstring
-
Does using "String" instead of "&str" a lot results in unoptimised code?
Your use case sounds like it will involve a lot of small strings that use a subset of UTF-8. If you’re concerned about performance, you could look into something like smartstring. Sixbit also looks interesting, but it looks like it won’t give you any more characters and it’d probably require additional computation to do the conversion (and they’d have to be converted back out).
-
Rust Is Hard, Or: The Misery of Mainstream Programming
> If you have a long-running async function, then pass parameters by value! If you have a polymorphic async function, then return your result in a Box.
I've taken to making heavy use of the smallvec and smartstring crates for this. Most lists and strings are small in practice. Using smallvec / smartstring lets you keep most clone() calls allocation-free. This in turn lets you use owned objects, which are easier to reason about - for you and the borrow checker. And you keep a lot of the performance of just passing around references.
I tried to use async rust a couple of years ago, and fell on my face in the process. Most of my rust at the moment is designed to compile to wasm - and then I'm leaning on nodejs for networking and IO. Writing async networked code is oh so much easier to reason about in javascript. When GAT, TAIT and some other language features to fix async land I'll muster up the courage to make another attempt. But rust's progress at fixing these problems feels painfully slow.
https://crates.io/crates/smallvec / https://crates.io/crates/smartstring
-
GitHub - epage/string-benchmarks-rs: Comparison of Rust string types
Just to point out, smartstring no longer assumes String memory layout. From the changelog:
-
Why is str not just [char]?
There's some really good crates that implement SSO floating around - eg, SmartString. But I agree - its a pity they're needed. Swift built this into the core string type in the language. I think that was the right call.
-
Announcing `compact_str`! A super memory efficient immutable string that is transparently stored on the stack, when possible
Comparatively: * SmolStr can inline up to 22 bytes but does not adjust down for 32-bit architectures, meaning it's potentially wasting memory on 32-bit archs. Similarly though it's immutable and Clone is O(1) * SmartString can inline up to 23 bytes, but it's mutable and Clone is O(n). Also this crate makes assumptions about the memory layout of a String, which in theory should be fine, but is a slight caveat.
-
Version 0.19.15 released.
SmartString is used to store identifiers (which tends to be short, fewer than 23 characters, and ASCII-based) because they can usually be stored inline. Map keys now also use SmartString.
-
Speed of Rust vs. C
I’ve been using smartstrings, which is both excellent and maintained. https://github.com/bodil/smartstring
fst
- fst: Represent large sets and maps compactly with finite state transducers
-
Creating a perfect HashMap from string keys known in advance
I'd point you towards BurntSushi's fst crate: https://github.com/BurntSushi/fst
-
How to use mmap safely in Rust?
The fst crate effectively relies on mmap for it to work right. The folks here suggesting you just use the heap might be right, but only if using the heap is actually plausible. If your dictionary is GBs big (an FST might be bigger than available memory), then copying it the heap first would be disastrous.
-
Official /r/rust "Who's Hiring" thread for job-seekers and job-offerers [Rust 1.64]
You'll love what we're working on if you're interested in the implementation of:- Tantivy- Meilisearch- Finite State Transducers
-
rustc is unacceptably slow compiling long lists of constant slices
Here's an example of longest prefix matching using a FST which I based my approach on: https://github.com/BurntSushi/fst/pull/104/files
-
Official /r/rust "Who's Hiring" thread for job-seekers and job-offerers [Rust 1.63]
Finite State Transducers
-
Wikit Desktop - A dictionary application using tauri GUI framework
As a result, I have a plan to implement a desktop version from then and I finished today with a beta version. The desktop is based on tauri, and the dictionary index algorithm is FST (it is an awesome index algorithm).
-
WordBueno.com online dictionary. Fast, no frills, mobile friendly.
WordBueno’s data is currently derived from Wiktionary. The backend is using Rust’s warp with fst for indexing.
- Show HN: WordBueno: sleek dictionary built with Rust and Svelte
-
Speed of Rust vs. C
No you don't. I've written multiple programs that load things instantly off the file system via memory maps. See the fst crate[1], for example, which is designed to work with memory maps.
Rust "works badly with memory mapped files" doesn't mean, "Rust can't use memory mapped files." It means, "it is difficult to reconcile Rust's safety story with memory maps." ripgrep for example uses memory maps because they are faster sometimes, and its safety contract[2] is a bit strained. But it works.
[1] - https://github.com/BurntSushi/fst/
[2] - https://docs.rs/grep-searcher/0.1.7/grep_searcher/struct.Mma...
What are some alternatives?
smol_str
libskry_r - Lucky imaging library
compact_str - A memory efficient string type that can store up to 24* bytes on the stack
rust-fnv - Fowler–Noll–Vo hash function
min-sized-rust - 🦀 How to minimize Rust binary size 📦
itoa - Fast function for printing integer primitives to a decimal string
redgrep - ♥ Janusz Brzozowski
bitter - Extract bits from a byte slice
tao - The TAO of cross-platform windowing. A library in Rust built for Tauri.
warp - A super-easy, composable, web server framework for warp speeds.