ua-gec
ukrainian-word-stress-dictionary
ua-gec | ukrainian-word-stress-dictionary | |
---|---|---|
1 | 1 | |
255 | 18 | |
0.4% | - | |
4.1 | 1.8 | |
4 months ago | almost 2 years ago | |
Macaulay2 | ||
Creative Commons Attribution 4.0 | - |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
ua-gec
-
[N] Grammarly releases a grammatical error correction (GEC) dataset for the Ukrainian language
The data and code are on Github: https://github.com/grammarly/ua-gec
ukrainian-word-stress-dictionary
-
Show HN: Ukrainian.fyi – Find the location of stress in ~2m Ukrainian words
Tech stack:
List of word stresses via https://github.com/lang-uk/ukrainian-word-stress-dictionary.
I made a Python script to remove the special stress accent from each word. The script then produces a table of words with and without stresses. This script takes a second or so to run.
The database is hosted via Supabase. A Python script uploads the data to Supabase.
The website is hosted on Vercel. Search results are cached so become very quick for the next person.
And it’s all free (except the domain), with generous usage limits!
What are some alternatives?
open-discourse - Open Discourse is the first fully comprehensive corpus of the plenary proceedings of the federal German Parliament (Bundestag).
Probable-Wordlists - Version 2 is live! Wordlists sorted by probability originally created for password generation and testing - make sure your passwords aren't popular!
open-australian-legal-corpus-creator - The code used to create and update the Open Australian Legal Corpus, the first and only multijurisdictional open corpus of Australian legislative and judicial documents.
Kaonashi - Wordlist, rules and masks from Kaonashi project (RootedCON 2019)
Chinese-Names-Corpus - 中文人名语料库。人名生成器。中文姓名,姓氏,名字,称呼,日本人名,翻译人名,英文人名。可用于中文分词、人名实体识别。
east-central-european-dicts - Dictionaries for Eastern and Central European languages: Albanian, Armenian, Belarusian, Croatian, Finnish, Hungarian, Latvian, Lithuanian, Macedonian, Polish, Romanian, Russian, Serbian, Slovak, Slovenian, Swedish, Turkish, Ukrainian, Uzbek
typescript-docs-ua - Переклад документації TypeScript українською