east-central-european-dicts
ukrainian-word-stress-dictionary
east-central-european-dicts | ukrainian-word-stress-dictionary | |
---|---|---|
1 | 1 | |
3 | 18 | |
- | - | |
10.0 | 1.8 | |
about 2 years ago | almost 2 years ago | |
- | - |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
east-central-european-dicts
-
Can Migaku work with Turkish?
I was wondering if Migaku in its current state can work with other languages such as Turkish. Does every language require its unique parsing rules? And there is also the matter of dictionaries, I'm not sure what exact dictionary formats work with Migaku, I found several Turkish dictionaries on GitHub (such as this and this) but none of them work after installing them through "install from file". They give out an error when installing that it's wrong JSON or something and then I can't look up a single word in them. What am I doing wrong? Maybe there's another database of open sourse dictionaries that work with migaku and that I just don't know of?
ukrainian-word-stress-dictionary
-
Show HN: Ukrainian.fyi – Find the location of stress in ~2m Ukrainian words
Tech stack:
List of word stresses via https://github.com/lang-uk/ukrainian-word-stress-dictionary.
I made a Python script to remove the special stress accent from each word. The script then produces a table of words with and without stresses. This script takes a second or so to run.
The database is hosted via Supabase. A Python script uploads the data to Supabase.
The website is hosted on Vercel. Search results are cached so become very quick for the next person.
And it’s all free (except the domain), with generous usage limits!
What are some alternatives?
rjecnik-hrvatskih-jezika - Rječnik hrvatskih jezika
Probable-Wordlists - Version 2 is live! Wordlists sorted by probability originally created for password generation and testing - make sure your passwords aren't popular!
Romanian-Word-Embeddings - Romanian Word Embeddings. Here you can find pre-trained corpora of word embeddings. Current methods: CBOW, Skip-Gram, Fast-Text (from Gensim library). The .vec and .model files are available for download (all in one archive).
Kaonashi - Wordlist, rules and masks from Kaonashi project (RootedCON 2019)
ua-gec - UA-GEC: Grammatical Error Correction and Fluency Corpus for the Ukrainian Language
typescript-docs-ua - Переклад документації TypeScript українською