mt5-M2M-comparison
100DaysOfML
Our great sponsors
mt5-M2M-comparison | 100DaysOfML | |
---|---|---|
1 | 2 | |
13 | 127 | |
- | - | |
3.8 | 0.0 | |
almost 3 years ago | over 1 year ago | |
Jupyter Notebook | Jupyter Notebook | |
- | GNU General Public License v3.0 only |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
mt5-M2M-comparison
-
[D] Comparing M2M to mT5 in low resource translation (10k dataset Yoruba - English)
I found no clear comparison nor a clear guide on how to fine tune both of the models on the translation task, so I decided to write it myself. (code: https://github.com/maroxtn/mt5-M2M-comparison)
100DaysOfML
What are some alternatives?
fastT5 - ⚡ boost inference speed of T5 models by 5x & reduce the model size by 3x.
mlreef - The collaboration workspace for Machine Learning
keytotext - Keywords to Sentences
MetalTranslate - Customizable machine translation in C++
OpenNMT-Tutorial - Neural Machine Translation (NMT) tutorial. Data preprocessing, model training, evaluation, and deployment.
MAGIST-Algorithm - Multi-Agent Generally Intelligent Simultaneous Training Algorithm for Project Zeta
fake-news - Building a fake news detector from initial ideation to model deployment
Astock - Astock
question_generation - Neural question generation using transformers
elastic_transformers - Making BERT stretchy. Semantic Elasticsearch with Sentence Transformers
adblockradio - An adblocker for live radio streams and podcasts. Machine learning meets Shazam.