wakaranai
janome
wakaranai | janome | |
---|---|---|
1 | 2 | |
0 | 828 | |
- | - | |
7.0 | 5.2 | |
10 months ago | 11 months ago | |
Python | Python | |
GNU General Public License v3.0 only | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
wakaranai
-
wakaranai (わからない) — test your hiragana and katakana skills
GitHub: https://github.com/guidanoli/wakaranai
janome
- [discussion] Open AI api translations
-
[Computer Stuff] What's the best way to split a Japanese sentence into "words"?
I did program stuff like that a bit in Korean and Japanese. So, in short, these tools/libraries are called 'Tokenizers'. I.e. search for "Japanese tokenizer", it will also tell you that MeCab is one of them. There is no good/easy way to split words in Japanese with simple algorithms, so these libraries, that are based on statistics or AI, will be your only choice. There is a good example sentence that shows how futile this would be without those libraries: "すもももももももものうち". From here.
What are some alternatives?
Kanamoji - Learn Japanese kana with a simple quiz game
kanji-data - A JSON kanji dataset with updated JLPT levels and WaniKani information
tika-python - Tika-Python is a Python binding to the Apache Tika™ REST services allowing Tika to be called natively in the Python community.
asian-comprehension-worksheet-generator - Create worksheet to learn Asian language (eg. Chinese) and practice reading and writing in grid format. Perfect tool for kid and beginner.
languagepod101-scraper - Python scraper for Language Pods such as Japanesepod101.com :japanese_ogre: :japan: :sushi: Compatible with Japanese, Chinese, French, German, Italian, Korean, Portuguese, Russian, Spanish and many more! ✨
skweak - skweak: A software toolkit for weak supervision applied to NLP tasks
scattertext - Beautiful visualizations of how language differs among document types.
transformers - 🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.