janome
kanji-data
janome | kanji-data | |
---|---|---|
2 | 1 | |
828 | 123 | |
- | - | |
5.2 | 0.0 | |
11 months ago | about 2 years ago | |
Python | Python | |
Apache License 2.0 | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
janome
- [discussion] Open AI api translations
-
[Computer Stuff] What's the best way to split a Japanese sentence into "words"?
I did program stuff like that a bit in Korean and Japanese. So, in short, these tools/libraries are called 'Tokenizers'. I.e. search for "Japanese tokenizer", it will also tell you that MeCab is one of them. There is no good/easy way to split words in Japanese with simple algorithms, so these libraries, that are based on statistics or AI, will be your only choice. There is a good example sentence that shows how futile this would be without those libraries: "すもももももももものうち". From here.
kanji-data
-
I'm making the kanji learning app that I wish existed.
Oh that sounds interesting, I'd love to try it! For the English character meanings I'm currently using https://github.com/davidluzgouveia/kanji-data
What are some alternatives?
tika-python - Tika-Python is a Python binding to the Apache Tika™ REST services allowing Tika to be called natively in the Python community.
JLPT-N5-N1-Japanese-Vocabulary-Anki - Script to generate Japanese JLPT Anki deck used in https://ankiweb.net/shared/info/1550984460
asian-comprehension-worksheet-generator - Create worksheet to learn Asian language (eg. Chinese) and practice reading and writing in grid format. Perfect tool for kid and beginner.
kanjium - The ultimate kanji resource
wakaranai - An educational tool for learning hiragana and katakana
cjkvi-ids - IDS data for CJK Unified Ideographs
skweak - skweak: A software toolkit for weak supervision applied to NLP tasks
transformers - 🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
kanjivg - Kanji vector graphics
languagepod101-scraper - Python scraper for Language Pods such as Japanesepod101.com :japanese_ogre: :japan: :sushi: Compatible with Japanese, Chinese, French, German, Italian, Korean, Portuguese, Russian, Spanish and many more! ✨
spaCy - 💫 Industrial-strength Natural Language Processing (NLP) in Python