Incredibly fast UTF-8 validation

This page summarizes the projects mentioned and recommended in the original post on /r/rust

Our great sponsors
  • WorkOS - The modern identity platform for B2B SaaS
  • InfluxDB - Power Real-Time Data Analytics at Scale
  • SaaSHub - Software Alternatives and Reviews
  • simdutf8

    SIMD-accelerated UTF-8 validation for Rust.

  • Yes, it is now also listed in the References section. The only difference is that it does 32-byte-aligned reads which proves to be a bit faster even on modern architectures since it is the SIMD register width and reads do not cross cachelines. Also, the compat API flavor checks every 64-byte block if invalid data has been encountered and calculates the error position using std::str::from_utf8().

  • stdarch

    Rust's standard library vendor-specific APIs and run-time feature detection

  • You can check the code. Apparently the std implementation uses the OSXSAVE register to check that the OS supports saving AVX/AVX2 registers during context switches and only then enables it. In a non-std context one might not generally be able to depend on the OSXSAVE register.

  • WorkOS

    The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.

    WorkOS logo
  • rust

    Empowering everyone to build reliable and efficient software.

  • There is an issue for SIMD UTF-8 validation where this was discussed previously.

  • utils

    Utility crates used in RustCrypto

  • I opened a tracking issue for that.

  • reference

    The Rust Reference (by rust-lang)

  • It doesn't. This was changed about a year ago: https://github.com/rust-lang/reference/pull/792

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts