OPUS-MT-train
klpt
OPUS-MT-train | klpt | |
---|---|---|
1 | 1 | |
302 | 91 | |
3.0% | - | |
1.7 | 1.8 | |
about 2 months ago | about 2 years ago | |
Makefile | Python | |
MIT License | GNU General Public License v3.0 or later |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
OPUS-MT-train
-
Amazon releases 51-language dataset for language understanding
https://translatelocally.com/ is a nice gui around marian/bergamot. So far not very many bundled pairs, though I would guess any of the models from https://github.com/Helsinki-NLP/Opus-MT-train/tree/master/mo... and https://github.com/Helsinki-NLP/Tatoeba-Challenge/blob/maste... should be usable.
There is also Apertium, a rule-based system which is very good for some closely-related pairs that have had a lot of work put into them (especially translation between Romance languages, e.g. Spanish→Catalan, and Norwegian Bokmål→Nynorsk), and the only OK translator for some lesser-resourced languages (e.g. Northern Saami→Norwegian Bokmål), but very underdeveloped for anything to/from English (it feels a bit pointless writing rules for English where there is so much available data; RBMT shines where there's not enough available data, ie. most of the languages of the world)
klpt
-
Insert tsvector data without using to_tsvector()
I did try it. And it's very good except for the part that I have a Hunspell dictionary with some custom logic (similar to snowball stemmers). Unfortunately I don't have a snowball stemmer so postgres simply ignores words that are not in the hunspell dictionary. So I want to use this library to do the stemming: https://github.com/sinaahmadi/klpt as it has custom rules implemented in python instead of relying on postgres
What are some alternatives?
Opus-MT - Open neural machine translation models and web services
NLP-progress - Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the most common NLP tasks.
pygod - A Python Library for Graph Outlier Detection (Anomaly Detection)
Tatoeba-Challenge
HiddenEye-Legacy - Modern Phishing Tool With Advanced Functionality And Multiple Tunnelling Services [ Android-Support-Available ]
tensor2tensor - Library of deep learning models and datasets designed to make deep learning more accessible and accelerate ML research.
xsser - Cross Site "Scripter" (aka XSSer) is an automatic -framework- to detect, exploit and report XSS vulnerabilities in web-based applications.
Face-Recognition_Flutter - A sample Face recognition app using Flutter and Firebase ML Kit
nicegui - Create web-based user interfaces with Python. The nice way.
deep-learning-drizzle - Drench yourself in Deep Learning, Reinforcement Learning, Machine Learning, Computer Vision, and NLP by learning from these exciting lectures!!
FundamentalAnalysis - Transparent and Efficient Financial Analysis [Moved to: https://github.com/JerBouma/FinanceToolkit]