Our great sponsors
-
HanziGraph
A webapp to visualize relationships among Chinese characters and to see example sentences that illustrate their use. Also available for Japanese learners.
-
tatoeba2
Tatoeba is a platform whose purpose is to create a collaborative and open dataset of sentences and their translations.
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
-
hkcancor
Hong Kong Cantonese Corpus of transcribed speech (spontaneous speech, radio programmes and a monologue).
The code is free and open source, and you can find it on GitHub.
To build the tool, I analyzed sentences in the HKCanCor corpus and from Tatoeba to find the most common words. I then created a graph structure where each character is a node and the words are edges. The definitions came from CC-Canto and CEDICT.
To build the tool, I analyzed sentences in the HKCanCor corpus and from Tatoeba to find the most common words. I then created a graph structure where each character is a node and the words are edges. The definitions came from CC-Canto and CEDICT.
Related posts
- Learning kanji through the words that connect them
- Show HN: Learning Chinese and Japanese with graphs and trees
- Visualizing, and learning, the relationships among kanji, words, and morphemes
- Graph representations of Chinese and Japanese characters, words, and lemmas, for language learning (links in comments)
- Free frequency dictionary and study tool to learn hanzi and see how words flow together, available in Simplified, Traditional, or Cantonese