sakubun
SudachiPy
sakubun | SudachiPy | |
---|---|---|
10 | 3 | |
31 | 348 | |
- | - | |
7.1 | 1.6 | |
16 days ago | over 1 year ago | |
JavaScript | Python | |
MIT License | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
sakubun
- is there a free tool which creates sentences with just the kanji you know?
- Applying kanji learned in Wanikani
-
Sakubun - a tool I made to help you practice kanji, with customized quiz questions and sentences
I made a (completely free!) tool called sakubun - it helps you practice kanji, and improves your vocabulary. You can check it out here: https://sakubun.xyz
-
I programmed a script for finding Japanese sentences containing only words I already know
It's not the exact same thing, but what you're saying reminds me of my project sakubun - it gives you reading practice that consists only of kanji you already know. It allows you to import from anki, and uses the tatoeba project for its sentence database.
-
Are there any good reading resource websites?
You could check out my website which has quizzes for reading practice, sakubun
-
A tool to practice kanji you've already learnt, and acquire vocabulary
I've made a tool called sakubun - https://sakubun.herokuapp.com - it helps you practice the kanji you've learnt, and exposes you to new words made up of those kanji.
-
Sakubun - a tool to practice kanji you've already learnt and acquire new vocabulary
I'm glad you like it! There's a link to the GitHub repo in the footer of the home page, or you could click here: https://github.com/cubetastic33/sakubun
-
I've made a tool that gives you practice using specifically kanji that you've learnt so far
The website works on all devices, and you can also install it like an app if you'd like - instructions for that are provided in the website. It's completely free and open source software, and the backend is written in rust. Please feel free to contribute if you'd like! You can find the GitHub repo here.
SudachiPy
-
Sakubun - a tool I made to help you practice kanji, with customized quiz questions and sentences
The current readings were generated with SudachiPy, with a little processing. UniDic seems pretty interesting, I'll check it out. Do you know how well its accuracy is, compared to Sudachi?
-
software which turn hiragana and katakana into kanji
There are free tools for both of these things. I made game2text to do OCR and script matching. It includes a segmentation and normalization library Sudachi but I have not used its normalization feature for the app. I'm not sure anyone else even wants this feature but it will be pretty straightforward to add it if you're familiar with Python and vanilla Javascript.
-
Tokenizing / picking words out of non-english languages
spaCy uses SudachiPy internally (see the doc comment about that), so if you don't need any of spaCy's extra features or want more control over the tokenization, you could use SudachiPy directly.
What are some alternatives?
yomichan - Japanese pop-up dictionary extension for Chrome and Firefox.
Sudachi - A Japanese Tokenizer for Business
jiten - jiten - japanese android/cli/web dictionary based on jmdict/kanjidic — 日本語 辞典 和英辞典 漢英字典 和独辞典 和蘭辞典
spaCy - 💫 Industrial-strength Natural Language Processing (NLP) in Python
Aion-Japanese-Voice-Pack - Change the voice acting of your Aion client into sweet Japanese or Korean.
momepy - Urban Morphology Measuring Toolkit
MorphMan - Anki plugin that reorders language cards based on the words you know
quanfima - Quanfima (Quantitative Analysis of Fibrous Materials)
mecab - Yet another Japanese morphological analyzer
AssessMe - A quiz application that lets instructors create multiple-choice graded assessments, results of which can be downloaded as a CSV file
simplemma - Simple multilingual lemmatizer for Python, especially useful for speed and efficiency