python-jamo vs uniunihan-db

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

www.influxdata.com

featured

SaaSHub - Software Alternatives and Reviews

SaaSHub helps you find the best software and product alternatives

www.saashub.com

featured

python-jamo		uniunihan-db
	Project
1	Mentions	1
94	Stars	4
-	Growth	-
10.0	Activity	4.7
over 1 year ago	Latest Commit	2 months ago
Python	Language	Python
Apache License 2.0	License	Apache License 2.0

The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

python-jamo

Posts with mentions or reviews of python-jamo. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-03-02.

Creating a Korean character set for use in a game.
2 projects | /r/Korean | 2 Mar 2023

There are plenty of libraries around that can compose/decompose Hangul syllables, so might be worth looking around and seeing what's already implemented in your language: Python example and JavaScript example

uniunihan-db

Posts with mentions or reviews of uniunihan-db. We have used some of these posts to build our list of alternatives and similar projects.

Office of the President of Mongolia (top to bottom text on the web)
1 project | news.ycombinator.com | 24 Apr 2023

I loved learning to read Japanese through the second volume of Heisig's _Learning the Kanji_. Volume 1, which teaches only meanings, is a slog, but volume 2, which teaches the Sino-Japanese readings is a beautiful example of organizing material to minimize entropy and maximize benefit for memorization as soon as possible. Unfortunately he never put together a volume 2 for a Chinese language. I haven't worked on it in a while, but I have a project where I attempt re-create the book for Japanese as well as Mandarin, Korean, and Vietnamese: https://nateglenn.com/uniunihan-db/ (repo: https://github.com/garfieldnate/uniunihan-db).
The "pure groups" are the ones where the presence of a specific radical guarantee you a specific pronunciation (within the list of character/pronunciation pairs you're trying to learn). Of the 4800 characters I used for the volume, only 290 are in the chapter on pure groups. The rest are either in semi-regular groups with varying numbers of exceptions, or in completely irregular groups with no discernible patterns.
The characters were designed continuously over a period of time starting thousands of years ago, and the phonetic parts were sometimes exact and sometimes just clues, similar sounds or rhymes to give the reader a hint. Ancient Chinese pronunciation has changed beyond recognition, so it makes perfect sense that the pronunciations wouldn't be regular anymore.
Mainland China uses a "simplified" character set, which did not affect literacy but in my opinion is a bit more difficult to read; they reduced the number of lines so that more characters look samey and they combined many (Mandarin) homonyms (https://en.wikipedia.org/wiki/Simplified_Chinese_characters#...), removing the meaning portion of characters that would have distinguished them. The simplification did not apply to all characters, so to achieve a high level of literacy you need to know traditional forms, anyway.
It would be interesting to see someone try to actually remodel hanzi from scratch for a specific dialect of Chinese, using 100% regular phonetic components and no variants; multiple pronunciations of a character in the current system would be required to be written differently. An interesting example of this would be certain Korean gukja, where they've combined a Chinese character with a phonetic hangeul (example: https://en.wiktionary.org/wiki/%E3%AB%87). This would be a truly simplified Chinese character set... but all of the culture's history that gets built into spelling over time would be completely lost, which is why I always prefer conservative spelling systems.

What are some alternatives?

When comparing python-jamo and uniunihan-db you can also consider the following projects:

kss - KSS: Korean String processing Suite

kengdic - Joe Speigle's Korean/English dictionary database

hangul-js - JavaScript functions to manipulate Hangul text

pykakasi - Lightweight converter from Japanese Kana-kanji sentences into Kana-Roman.

hangul-jamo - A library to compose and decompose Hangul syllables using Hangul jamo characters

buondua-downloader - :ribbon: NSFW. Album downloader for https://buondua.com.

unihan-etl - Export UNIHAN's database to csv, json or yaml

ark-pixel-font - Open source Pan-CJK pixel font / 开源的泛中日韩像素字体

python-jamo vs kss uniunihan-db vs kengdic python-jamo vs hangul-js uniunihan-db vs pykakasi python-jamo vs hangul-jamo uniunihan-db vs buondua-downloader python-jamo vs kengdic uniunihan-db vs unihan-etl uniunihan-db vs ark-pixel-font

Compare python-jamo vs uniunihan-db and see what are their differences.

python-jamo

uniunihan-db

python-jamo

uniunihan-db

What are some alternatives?