Our great sponsors
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
While I wait, I dig a bit—and it turns out that icu does not yet have normalization available (although it's marked with a milestone goal of 0.5): https://github.com/unicode-org/icu4x/issues/972
There is also rust-unic which provides both normalization and access to the character database. I have also used this because of their text segmentation support, and I would probably recommend rust-unic in general. I hope to see more progress on that front.
That's odd, because it doesn't seem to use icu-config. Are you using an outdated version or something?
Otherwise the current state is pretty much fragmented. There is num-format just for formatting numbers. For currencies, you have iso_currency. simple_locale helps with getting information about your locale. You can definitely find a few more.
Related posts
- Unicode sorting is hard & why browsers added special emoji matching to regexp
- icu4x: pure rust implementation of the unicode ICU library
- ICU4X: Solving Internationalization for Clients and Limited Environments
- uni-algo v0.5.0: Modern Unicode Library
- Announcing ICU4X 1.0 – New Internationalization Library from Unicode