encoding_rs
icu4x
Our great sponsors
encoding_rs | icu4x | |
---|---|---|
8 | 25 | |
357 | 1,250 | |
- | 2.6% | |
6.3 | 9.8 | |
3 days ago | 1 day ago | |
Rust | Rust | |
GNU General Public License v3.0 or later | GNU General Public License v3.0 or later |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
encoding_rs
-
This Program is Illegally Packaged in 14 Distributions
Author of ripgrep here.
Maybe, umm, don't depend on auto-detection tools as the ultimeate source of whether something is appropriately licensed or not? encoding_rs is clearly licensed, otherwise I wouldn't have used it in ripgrep: https://github.com/hsivonen/encoding_rs
Take it up with the auto-detection tools. Or at least do your due diligence before claiming ripgrep is using "unlicensed" code.
- Learning to Parse Text Files: BOM!
-
Yore - library for decoding/encoding character sets according to OEM code pages
You might also want to include tests, https://github.com/hsivonen/encoding_rs/tree/master/src/test_data could be used as a reference.
-
Show HN: High-speed UTF-8 validation in Rust
That's not the only use of SIMD in the crate (e.g. see https://github.com/hsivonen/encoding_rs/blob/e98a2096ab09c92...), but I haven't looked into exactly where/how it's used further.
icu4x
-
Any new Opensource projects in (rust) looking for contributors. I want to start my journey as an OSS contributor.
ICU4X has a large priority backlog which are "issues that the team wants to definitely see fixed, but which currently lack resourcing."
- icu4x: pure rust implementation of the unicode ICU library
-
Self-referential types for fun and profit
this also (probably) means it's safe from LLVM-noalias unsoundness, though it still runs into the same Rust-level unsoundness
- ICU4X: Solving Internationalization for Clients and Limited Environments
-
uni-algo v0.5.0: Modern Unicode Library
Actually, the Rust version also offers multiple modes, see comparison.rs:
-
Announcing ICU4X 1.0 – New Internationalization Library from Unicode
It's generated from https://github.com/unicode-org/icu4x/blob/main/provider/datagen/data/segmenter/dictionary_cj.toml, which in turn comes from ICU4C.
-
The Unicode Consortium announces ICU4X 1.0, its new high-performance internationalization library. It's written in Rust, with official C++ and JavaScript wrappers available.
The code can be seen at https://github.com/unicode-org/icu4x, I count 193 uses of unsafe, though not all are the keyword, and some are in tests.
- icu4x: Can we have `rustc_layout_scalar_valid_range_end` on stable. Lang team: You have `rustc_layout_scalar_valid_range_end` on stable. `rustc_layout_scalar_valid_range_end` on stable:
-
Not a Yoking Matter (Zero-Copy #1)
We've got an issue filed about noalias UB in Yoke.
-
Chinese numerals are not recognized by char::is_numeric
As a reference for his expertise: he's part of team that develops https://github.com/unicode-org/icu4x
What are some alternatives?
serde - Serialization framework for Rust
Fluent - Rust implementation of Project Fluent
simdjson - Parsing gigabytes of JSON per second : used by Facebook/Meta Velox, the Node.js runtime, ClickHouse, WatermelonDB, Apache Doris, Milvus, StarRocks
I18N - I18N Library for .NET, and Delphi
simdutf8 - SIMD-accelerated UTF-8 validation for Rust.
cldr - The home of the Unicode Common Locale Data Repository
sqloxide - Python bindings for sqlparser-rs
rust_icu - rust_icu: rust bindings for ICU (International Components for Unicode) library
plover - Open source stenotype engine
verona - Research programming language for concurrent ownership
fontdue - The fastest font renderer in the world, written in pure rust.
UNIC - UNIC: Unicode and Internationalization Crates for Rust