A byte string library for Rust

InfluxDB - Power Real-Time Data Analytics at Scale

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

www.influxdata.com

featured

SaaSHub - Software Alternatives and Reviews

SaaSHub helps you find the best software and product alternatives

www.saashub.com

featured

bstr

10 744 6.7 Rust

A string type for Rust that is not required to be valid UTF-8.

> It's especially helpful that the library doesn't require you to opt into its own dedicated types, and instead defines extension methods on existing types.
Fun fact: bstr 0.1 went the route of defining its own dedicated types! See: https://docs.rs/bstr/0.1.4/bstr/
But it did indeed quickly prove to be pretty annoying. Because you still really want to use &[u8] in places because it's so ubiquitous. But to get access to the byte string methods, you had to explicitly convert it to another type.
The reason why I went that route initially was so you'd always get the good Debug impl. But it ended up not being worth it sadly. This issue discusses it a bit more: https://github.com/BurntSushi/bstr/issues/5

rust-memchr

29 758 7.7 Rust

Optimized string search routines for Rust.

I think that would be an interesting API design to explore, absolutely. I think you'll have a lot of issues making it fast though. There is a fair bit of SIMD going on under the hoods in both the substring routines and the UTF-8 validation routines, for example. Building APIs based around iterator adapters that munch one byte at a time are difficult to square with SIMD optimizations that want to operate on a whole bunch of bytes at a time.
Consider, for example, how you might use a routine like memchr[1] if all of your public APIs are generic iterator adapters.
And then once you get into things like regex engines, modifying them to work on Iterator is a highly non-trivial affair. It is of course possible to write a regex engine that works on such things, but it's going to be limited in performance or capabilities. The way to make regexes and streaming work together is probably something more like Iterator (which is perhaps roughly analogous to what Hyperscan does). You really want blocks of bytes, not one-byte-at-a-time.
[1]: https://github.com/BurntSushi/memchr/blob/8037d11b4357b0f07b...

InfluxDB

www.influxdata.com featured

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
blog

2 50 5.7 Rust

My blog. (by BurntSushi)
rust-semverver

8 641 1.7 Rust

Discontinued Automatic checking for semantic versioning in library crates

1) No. I think semver is just fine for its intended purpose. I mean, I'm sure its spec could be improved in various ways, but its fundamental idea seems fine to me. I think it's just important to remember that semver is a means to an end, and not an end itself. It is a tool of communication most useful in a decentralized context.
2) No.
3) See: https://github.com/rust-lang/rust-semverver --- But also, this is only ever going to be a "best effort" sort of thing. Semver isn't just about method additions or deletions, but also behavior.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Memchr: Optimized string search routines for Rust

1 project | news.ycombinator.com | 13 Jan 2024
Rust-Cache

1 project | news.ycombinator.com | 4 Dec 2023
Rust memchr adds aarch64 SIMD with impressive speedups

1 project | news.ycombinator.com | 29 Aug 2023
Ripgrep now twice as fast on Apple Silicon with new aarch64 SIMD implementations

1 project | news.ycombinator.com | 28 Aug 2023
Chunking strings in Elixir: how difficult can it be?

2 projects | news.ycombinator.com | 4 Jan 2023

A byte string library for Rust

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com
Bytes Rust Unicode memchr byte-string
Post date: 8 Sep 2022

bstr

rust-memchr

InfluxDB

blog

rust-semverver

Related posts

Memchr: Optimized string search routines for Rust

Rust-Cache

Rust memchr adds aarch64 SIMD with impressive speedups

Ripgrep now twice as fast on Apple Silicon with new aarch64 SIMD implementations

Chunking strings in Elixir: how difficult can it be?

A byte string library for Rust

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com Bytes Rust Unicode memchr byte-string Post date: 8 Sep 2022

bstr

rust-memchr

InfluxDB

blog

rust-semverver

Related posts

Memchr: Optimized string search routines for Rust

Rust-Cache

Rust memchr adds aarch64 SIMD with impressive speedups

Ripgrep now twice as fast on Apple Silicon with new aarch64 SIMD implementations

Chunking strings in Elixir: how difficult can it be?

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com
Bytes Rust Unicode memchr byte-string
Post date: 8 Sep 2022