encoding_rs VS simdutf8

Compare encoding_rs vs simdutf8 and see what are their differences.

encoding_rs

A Gecko-oriented implementation of the Encoding Standard in Rust (by hsivonen)

simdutf8

SIMD-accelerated UTF-8 validation for Rust. (by rusticstuff)
Our great sponsors
  • WorkOS - The modern identity platform for B2B SaaS
  • InfluxDB - Power Real-Time Data Analytics at Scale
  • SaaSHub - Software Alternatives and Reviews
encoding_rs simdutf8
8 15
348 505
- 2.2%
6.3 3.9
15 days ago about 2 months ago
Rust Rust
GNU General Public License v3.0 or later GNU General Public License v3.0 or later
The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

encoding_rs

Posts with mentions or reviews of encoding_rs. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2022-08-21.

simdutf8

Posts with mentions or reviews of simdutf8. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2021-08-05.
  • simdutf: Unicode validation and transcoding at billions of characters per second
    2 projects | /r/rust | 5 Aug 2021
    That's not enough to make it interesting. There's already a porting of it in rust (https://github.com/rusticstuff/simdutf8), and inclusion in the stdlib has already been discussed: the problem is that you can't use simd in all supported targets and conditional compilation/detection is also very tricky.
  • Making Rust Float Parsing Fast: libcore Edition
    10 projects | /r/rust | 17 Jul 2021
    No, libcore uses simple branching code at the moment, see https://github.com/rust-lang/rust/issues/68455. The issue is still actively being worked on. Note, it's not a simple drop in, and there seem to be even faster algorithms. For now there is https://github.com/rusticstuff/simdutf8.
  • What's everyone working on this week (19/2021)?
    15 projects | /r/rust | 9 May 2021
    I will work on simdutf8, either
  • simdutf v0.1.1 - A small step for semver, one giant leap for performance.
    6 projects | /r/rust | 26 Apr 2021
    Now I have to benchmark again, it might be negligable. See also the discussion for this pull request.
    6 projects | /r/rust | 26 Apr 2021
  • Show HN: High-speed UTF-8 validation in Rust
    9 projects | news.ycombinator.com | 21 Apr 2021
    Check the benchmarks section (https://github.com/rusticstuff/simdutf8#Benchmarks), second table. simdutf8 is up to 28 % faster on my Comet Lake CPU. However with pure ASCII clang does something magical with simdjson and it beats my implementation by a lot. GCC-compiled simdjson is slower all around except for a few outliers with short byte sequences.

    The algorithm is the one from simdjson, the main difference is that it uses an extra step in the beginning to align reads to the SIMD block size.

    9 projects | news.ycombinator.com | 21 Apr 2021
  • Incredibly fast UTF-8 validation
    8 projects | /r/rust | 21 Apr 2021
    ARM SIMD intrinsics are currently unstable and many are not yet implemented. But work to add them is ongoing and it should be possible. I will look into it. See issue #1
    8 projects | /r/rust | 21 Apr 2021
    Yes, it is now also listed in the References section. The only difference is that it does 32-byte-aligned reads which proves to be a bit faster even on modern architectures since it is the SIMD register width and reads do not cross cachelines. Also, the compat API flavor checks every 64-byte block if invalid data has been encountered and calculates the error position using std::str::from_utf8().
    8 projects | /r/rust | 21 Apr 2021

What are some alternatives?

When comparing encoding_rs and simdutf8 you can also consider the following projects:

serde - Serialization framework for Rust

sqloxide - Python bindings for sqlparser-rs

simdutf - Unicode routines (UTF8, UTF16, UTF32): billions of characters per second using SSE2, AVX2, NEON, AVX-512, RISC-V Vector Extension. Part of Node.js and Bun.

cxx - Safe interop between Rust and C++

icu4x - Solving i18n for client-side and resource-constrained environments.

simdjson - Parsing gigabytes of JSON per second : used by Facebook/Meta Velox, the Node.js runtime, ClickHouse, WatermelonDB, Apache Doris, Milvus, StarRocks

bumpalo - A fast bump allocation arena for Rust

fast_float - Fast and exact implementation of the C++ from_chars functions for number types: 4x to 10x faster than strtod, part of GCC 12 and WebKit/Safari

feel

fast-float-rust - Super-fast float parser in Rust (now part of Rust core)