ziglyph vs utfcpp

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

www.influxdata.com

featured

SaaSHub - Software Alternatives and Reviews

SaaSHub helps you find the best software and product alternatives

www.saashub.com

featured

ziglyph		utfcpp
	Project
5	Mentions	3
207	Stars	1,431
-	Growth	-
6.7	Activity	7.3
7 months ago	Latest Commit	4 months ago
Zig	Language	C++
MIT License	License	Boost Software License 1.0

The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

ziglyph

Posts with mentions or reviews of ziglyph. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2022-01-17.

What are your favorite utility libraries?
1 project | /r/Zig | 21 Feb 2023
Failing to Learn Zig via Advent of Code
17 projects | news.ycombinator.com | 17 Jan 2022

> My big problem with Zig is that Andrew Kelley is promising a lot of features, but doesn't really deliver much.
Have you, like, seen the release notes for 0.9.0?
https://ziglang.org/download/0.9.0/release-notes.html
> Zig still can't proper handle UTF-8 strings [1] in 2022
There's plenty of discussion on the subject in basically every HN thread about Zig: the stdlib has some utf8 and wtf validation code, ziglyph implements the full unicode spec.
https://github.com/jecolon/ziglyph
You might not like how it's done, but its factually incorrect to state that Zig can't handle unicode.
> In a `recent` interview[2], he claims that Zig is faster than C and Rust, but he refers to extremely short benchmarking that has almost no value in the real world.
From my reddit reply to this same topic:
This podcast interview might not be the best showcase of the practical implications of Zig's take on safety and performance. If you want something with more meat, I highly recommend Andrew's recent talk from Handmade Seattle, where he shows the work being done on the Zig self-hosted compiler.
https://media.handmade-seattle.com/practical-data-oriented-d...
Lots of bit fiddling that can't be fully proven safe statically, but then you get a compiler capable of compiling Zig code stupidly fast, and that's even without factoring in incremental compilation with in-place binary patching, with which we're aiming for sub-millisecond rebuilds of arbitrarily large projects.
> The ecosystem for zig is insignificant now and a stable release would help the language.
I hope you don't mind if we don't take this advice, given the overall tone of your post.
Resizable string in Zig?
2 projects | /r/Zig | 16 Nov 2021

For Unicode text processing you can take a look at Ziglyph https://github.com/jecolon/ziglyph and for a sample UTF-8 string structure, Zigstr https://github.com/jecolon/zigstr . (bias alert: I'm the author of both. :^D )
Maintain It with Zig
16 projects | news.ycombinator.com | 8 Sep 2021

Agreed, and Zig also has a lib for that as well:
https://github.com/jecolon/ziglyph/
Unicode data file compression: achieving 40-70% reduction over gzip alone
1 project | news.ycombinator.com | 4 Jul 2021

Yes, sorry about that - I omitted a bit of that information for brevity.
If you want to play with allkeys.txt (which is by far much more sequential, simpler data than UnicodeData.txt) then you only need to remove the non-NFD strings (since the Unicode Collation Algorithm's first step requires you to decompose the string's code points to canonical NFD form), that removes ~2,000 entries.
The full file parser code, which strips those out and other useless information like comments and version information can be found here: https://github.com/jecolon/ziglyph/blob/main/src/collator/Al...
If you want to play around with UnicodeData.txt (which is less sequential, more complex data) then only two fields are used (the code point and decomposition field), and only records where the second field is not empty (the full decomposition type name in angle brackets is not needed, only whether it is or is not there is important.)
The full parser code for that file can be found here: https://github.com/jecolon/ziglyph/blob/main/src/normalizer/...
Hope that helps!

utfcpp

Posts with mentions or reviews of utfcpp. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2022-03-17.

Current utf8 support options.
1 project | /r/cpp_questions | 16 Feb 2023

std::string is simply a string of bytes, so can already contain utf-8 encoded text. The only problem is when you want to interact with OS (Windows) API and other library APIs that don't expect utf-8 and when you need to count number of characters etc. For that you can look into existing libraries, e.g. the official Unicode ICU or whatever you can find that others have made, e.g.: https://github.com/nemtrif/utfcpp
How to cout a non-ASCII character within a non-ASCII string
2 projects | /r/cpp_questions | 17 Mar 2022

Suffice it to say, this is a mess. However, there are libraries that make this easier.
Maintain It with Zig
16 projects | news.ycombinator.com | 8 Sep 2021

> I've always tried as much as possible to treat strings as just opaque data and never look into them, which tends to work well, but in some domains you really need to look at and massage the characters/codepoints/grapheme clusters, and the lack of a first-citizen UTF-8-aware string type is, I think, a bit unfortunate in this day and age.
You don't need a UTF-8 type for that, you just need routines that handle UTF-8 strings, like utfcpp (https://github.com/nemtrif/utfcpp).

What are some alternatives?

When comparing ziglyph and utfcpp you can also consider the following projects:

zig-string - A String Library made for Zig

icu - The home of the ICU project source code.

zigstr - Zigstr is a UTF-8 string type for Zig programs.

dstep - A tool for converting C and Objective-C headers to D modules

RIIR - why not Rewrite It In Rust

arocc - A C compiler written in Zig.

zig - General-purpose programming language and toolchain for maintaining robust, optimal, and reusable software.

cc-rs - Rust library for build scripts to compile C/C++ code into a Rust library

mach - zig game engine & graphics toolkit

ziglyph vs zig-string utfcpp vs icu ziglyph vs zigstr utfcpp vs dstep ziglyph vs RIIR utfcpp vs arocc ziglyph vs zig utfcpp vs cc-rs ziglyph vs arocc utfcpp vs zigstr ziglyph vs mach utfcpp vs RIIR

Compare ziglyph vs utfcpp and see what are their differences.

ziglyph

utfcpp

ziglyph

utfcpp

What are some alternatives?