Uniunihan-db Alternatives

Similar projects and alternatives to uniunihan-db based on common topics and language

kengdic

1 101 0.0 Python uniunihan-db VS kengdic

Joe Speigle's Korean/English dictionary database
pykakasi

1 350 5.1 Python uniunihan-db VS pykakasi

Discontinued Lightweight converter from Japanese Kana-kanji sentences into Kana-Roman.
WorkOS

workos.com sponsored

The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
buondua-downloader

2 37 0.0 Python uniunihan-db VS buondua-downloader

Discontinued :ribbon: NSFW. Album downloader for https://buondua.com.
unihan-etl

1 51 9.5 Python uniunihan-db VS unihan-etl

Export UNIHAN's database to csv, json or yaml
python-jamo

1 94 10.0 Python uniunihan-db VS python-jamo

Hangul syllable decomposition and synthesis using jamo.
ark-pixel-font

1 2,954 9.2 Python uniunihan-db VS ark-pixel-font

Open source Pan-CJK pixel font / 开源的泛中日韩像素字体
unihandecode

1 71 2.3 Python uniunihan-db VS unihandecode

Discontinued unihandecode is a transliteration library to convert all characters/words in Unicode into ASCII alphabet that aware with Language preference priorities
InfluxDB

www.influxdata.com sponsored

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a better uniunihan-db alternative or higher similarity.

Suggest an alternative to uniunihan-db

uniunihan-db reviews and mentions

Posts with mentions or reviews of uniunihan-db. We have used some of these posts to build our list of alternatives and similar projects.

Office of the President of Mongolia (top to bottom text on the web)
1 project | news.ycombinator.com | 24 Apr 2023

I loved learning to read Japanese through the second volume of Heisig's _Learning the Kanji_. Volume 1, which teaches only meanings, is a slog, but volume 2, which teaches the Sino-Japanese readings is a beautiful example of organizing material to minimize entropy and maximize benefit for memorization as soon as possible. Unfortunately he never put together a volume 2 for a Chinese language. I haven't worked on it in a while, but I have a project where I attempt re-create the book for Japanese as well as Mandarin, Korean, and Vietnamese: https://nateglenn.com/uniunihan-db/ (repo: https://github.com/garfieldnate/uniunihan-db).
The "pure groups" are the ones where the presence of a specific radical guarantee you a specific pronunciation (within the list of character/pronunciation pairs you're trying to learn). Of the 4800 characters I used for the volume, only 290 are in the chapter on pure groups. The rest are either in semi-regular groups with varying numbers of exceptions, or in completely irregular groups with no discernible patterns.
The characters were designed continuously over a period of time starting thousands of years ago, and the phonetic parts were sometimes exact and sometimes just clues, similar sounds or rhymes to give the reader a hint. Ancient Chinese pronunciation has changed beyond recognition, so it makes perfect sense that the pronunciations wouldn't be regular anymore.
Mainland China uses a "simplified" character set, which did not affect literacy but in my opinion is a bit more difficult to read; they reduced the number of lines so that more characters look samey and they combined many (Mandarin) homonyms (https://en.wikipedia.org/wiki/Simplified_Chinese_characters#...), removing the meaning portion of characters that would have distinguished them. The simplification did not apply to all characters, so to achieve a high level of literacy you need to know traditional forms, anyway.
It would be interesting to see someone try to actually remodel hanzi from scratch for a specific dialect of Chinese, using 100% regular phonetic components and no variants; multiple pronunciations of a character in the current system would be required to be written differently. An interesting example of this would be certain Korean gukja, where they've combined a Chinese character with a phonetic hangeul (example: https://en.wiktionary.org/wiki/%E3%AB%87). This would be a truly simplified Chinese character set... but all of the culture's history that gets built into spelling over time would be completely lost, which is why I always prefer conservative spelling systems.