unicode-transforms vs text-icu

unicode-transforms

Fast Unicode normalization in Haskell (by composewell)

Source Code

Suggest alternative

Edit details

text-icu

This package provides the Haskell Data.Text.ICU library, for performing complex manipulation of Unicode text. (by haskell)

Data Text

Source Code

Suggest alternative

Edit details

Our great sponsors

InfluxDB - Power Real-Time Data Analytics at Scale

WorkOS - The modern identity platform for B2B SaaS

SaaSHub - Software Alternatives and Reviews

Our great sponsors

unicode-transforms		text-icu
	Project
1	Mentions	2
47	Stars	46
-	Growth	-
2.5	Activity	6.1
6 months ago	Latest Commit	17 days ago
Haskell	Language	Haskell
BSD 3-clause "New" or "Revised" License	License	BSD 3-clause "New" or "Revised" License

The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

unicode-transforms

Posts with mentions or reviews of unicode-transforms. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2021-04-17.

[ANN] unicode-collation 0.1
3 projects | /r/haskell | 17 Apr 2021

Thanks! Here's a puzzle. Profiling shows that about a third of the time in my code is spent in normalize from unicode-transforms. (Normalization is a required step in the algorithm but can be omitted if you know that the input is already in NFD form.) And when I add a benchmark that omits normalization, I see run time cut by a third. But text-icu's run time in my benchmark doesn't seem to be affected much by whether I set the normalization option. I am not sure how to square that with the benchmarks here that seem to show unicode-transforms outperforming text-icu in normalization. text-icu's documentation says that "an incremental check is performed to see whether the input data is in FCD form. If the data is not in FCD form, incremental NFD normalization is performed." I'm not sure exactly what this means, but it may mean that text-icu avoids normalizing the whole string, but just normalizes enough to do the comparison, and sometimes avoids normalization altogether if it can quickly determine that the string is already normalized. I don't see a way to do this currently with unicode-transforms.

text-icu

Posts with mentions or reviews of text-icu. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2021-04-17.

[ANN] unicode-collation 0.1
3 projects | /r/haskell | 17 Apr 2021

Until now, the only way to do proper Unicode sorting in Haskell was to depend on text-icu, which wraps the C library icu4c. However, there are disadvantages to depending on an external C library. In addition, the last release of text-icu was in 2015, and since then there have been changes to icu4c that cause build-failures, as noted in this issue.
Haskell ghost knowledge; difficult to access, not written down
13 projects | /r/haskell | 24 Jan 2021

text-icu occasionally breaks horribly and nondeterministically (1, 2) despite being praised by SOTU

What are some alternatives?

When comparing unicode-transforms and text-icu you can also consider the following projects:

with-utf8 - Get your IO right on the first try

text-stream-decode - Streaming decoding functions for UTF encodings.

refined - Refinement types with static checking

gps2htmlReport - Generates a HTML page report detailing a GPS journey, with charts, statistics and an OpenStreetMap graphic.

hashable - A class for types that can be converted to a hash value

text-trie - An efficient finite map from Text to values, based on bytestring-trie.

jump - Jump start your Haskell development

observable-sharing - Observable sharing for haskell

code-builder - Packages for defining APIs, running them, generating client code and documentation.

hebrew-time - Hebrew dates and prayer times.

critbit - A Haskell implementation of crit-bit trees.

text-binary - Binary instances for strict and lazy Text data types

unicode-transforms vs with-utf8 text-icu vs text-stream-decode unicode-transforms vs refined text-icu vs gps2htmlReport unicode-transforms vs hashable text-icu vs text-trie unicode-transforms vs jump text-icu vs observable-sharing unicode-transforms vs code-builder text-icu vs hebrew-time unicode-transforms vs critbit text-icu vs text-binary

Compare unicode-transforms vs text-icu and see what are their differences.

unicode-transforms

text-icu

unicode-transforms

text-icu

What are some alternatives?