pykakasi
Lightweight converter from Japanese Kana-kanji sentences into Kana-Roman. (by miurahr)
toiro
A comparison tool of Japanese tokenizers (by taishi-i)
Our great sponsors
pykakasi | toiro | |
---|---|---|
1 | 1 | |
350 | 112 | |
- | - | |
5.1 | 5.2 | |
almost 2 years ago | 9 months ago | |
Python | Python | |
GNU General Public License v3.0 only | Apache License 2.0 |
The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
pykakasi
Posts with mentions or reviews of pykakasi.
We have used some of these posts to build our list of alternatives
and similar projects. The last one was on 2021-08-04.
-
Any recommendations for a good Japanese NLP engine?
I have built a prototype application for helping me learn japanese which does the following using kakasi.
toiro
Posts with mentions or reviews of toiro.
We have used some of these posts to build our list of alternatives
and similar projects. The last one was on 2021-08-04.
-
Any recommendations for a good Japanese NLP engine?
Thank you! I have also been looking at Toiro which is not a NLP but a comparison tool, and it includes MeCab. You can use it to install all Japanese language parsers (that it knows about) and then run tests on your data set. Right now I'm running each one on the game script I have and see which one is best.
What are some alternatives?
When comparing pykakasi and toiro you can also consider the following projects:
spaCy - 💫 Industrial-strength Natural Language Processing (NLP) in Python
pythainlp - Thai Natural Language Processing in Python.
jiten - jiten - japanese android/cli/web dictionary based on jmdict/kanjidic — 日本語 辞典 和英辞典 漢英å—典 和独辞典 和è˜è¾žå…¸
jProcessing - Japanese Natural Langauge Processing Libraries
jmdict-kindle - Japanese - English dictionary for Kindle based on the JMdict / EDICT database
uniunihan-db - Chinese character dictionary for learning Sino-xenic languages
mahjong - Implementation of riichi mahjong related stuff (hand cost, shanten, agari end, etc.)