klpt
OPUS-MT-train
klpt | OPUS-MT-train | |
---|---|---|
1 | 1 | |
91 | 304 | |
- | 3.6% | |
1.8 | 1.7 | |
about 2 years ago | 2 months ago | |
Python | Makefile | |
GNU General Public License v3.0 or later | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
klpt
-
Insert tsvector data without using to_tsvector()
I did try it. And it's very good except for the part that I have a Hunspell dictionary with some custom logic (similar to snowball stemmers). Unfortunately I don't have a snowball stemmer so postgres simply ignores words that are not in the hunspell dictionary. So I want to use this library to do the stemming: https://github.com/sinaahmadi/klpt as it has custom rules implemented in python instead of relying on postgres
OPUS-MT-train
-
Amazon releases 51-language dataset for language understanding
https://translatelocally.com/ is a nice gui around marian/bergamot. So far not very many bundled pairs, though I would guess any of the models from https://github.com/Helsinki-NLP/Opus-MT-train/tree/master/mo... and https://github.com/Helsinki-NLP/Tatoeba-Challenge/blob/maste... should be usable.
There is also Apertium, a rule-based system which is very good for some closely-related pairs that have had a lot of work put into them (especially translation between Romance languages, e.g. Spanish→Catalan, and Norwegian Bokmål→Nynorsk), and the only OK translator for some lesser-resourced languages (e.g. Northern Saami→Norwegian Bokmål), but very underdeveloped for anything to/from English (it feels a bit pointless writing rules for English where there is so much available data; RBMT shines where there's not enough available data, ie. most of the languages of the world)
What are some alternatives?
Opus-MT - Open neural machine translation models and web services
pygod - A Python Library for Graph Outlier Detection (Anomaly Detection)
NLP-progress - Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the most common NLP tasks.
HiddenEye-Legacy - Modern Phishing Tool With Advanced Functionality And Multiple Tunnelling Services [ Android-Support-Available ]
Tatoeba-Challenge
xsser - Cross Site "Scripter" (aka XSSer) is an automatic -framework- to detect, exploit and report XSS vulnerabilities in web-based applications.
tensor2tensor - Library of deep learning models and datasets designed to make deep learning more accessible and accelerate ML research.
nicegui - Create web-based user interfaces with Python. The nice way.
Face-Recognition_Flutter - A sample Face recognition app using Flutter and Firebase ML Kit
FundamentalAnalysis - Transparent and Efficient Financial Analysis [Moved to: https://github.com/JerBouma/FinanceToolkit]
deep-learning-drizzle - Drench yourself in Deep Learning, Reinforcement Learning, Machine Learning, Computer Vision, and NLP by learning from these exciting lectures!!