bitextor
OpenNMT-py
bitextor | OpenNMT-py | |
---|---|---|
2 | 6 | |
279 | 6,574 | |
0.7% | 0.9% | |
5.9 | 8.7 | |
8 months ago | 11 days ago | |
Python | Python | |
GNU General Public License v3.0 only | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
bitextor
OpenNMT-py
-
Making a custom Google Translate equivalent / web translation filter for my conlang?
I already tried this with OpenNMT.
-
Cutting edge language translation models
fairseq and OpenNMT are very good starting points if you want to train your NMT model from scratch.
- How Telegram Messenger circumvents Google Translate's API
-
WEBNLG challenge 2017 on Google Colab error
It looks like this uses the version of OpenNMT implemented in torch, which has been deprecated. You will be much better off using the pytorch implementation of OpenNMT or the transformers library. In fact, I would recommend taking a look at the GEM benchmark, since it also uses the WebNLG dataset. Here is a tutorial to get started, you can change the dataset here to WebNLG instead of CommonGen.
-
Help with Neural Machine Translation
Umm... open-nmt This is a library maintained since 2016 for NMT
-
Oop concepts for pytorch
However, you do not need to use much OOP when training models with pytorch. Most of the time it is just inheriting a class and overwriting functions. You might need more advanced stuff if you were writing a framework on top of it, something like ONMT
What are some alternatives?
ArchiveBox - 🗃 Open source self-hosted web archiving. Takes URLs/browser history/bookmarks/Pocket/Pinboard/etc., saves HTML, JS, PDFs, media, and more...
transformers - 🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
trankit - Trankit is a Light-Weight Transformer-based Python Toolkit for Multilingual Natural Language Processing
pytorch-tutorial - PyTorch Tutorial for Deep Learning Researchers
Hebrew-Tokenizer - A very simple python tokenizer for Hebrew text.
tensor2tensor - Library of deep learning models and datasets designed to make deep learning more accessible and accelerate ML research.
grab-site - The archivist's web crawler: WARC output, dashboard for all crawls, dynamic ignore patterns
Transformer-Models-from-Scratch - implementing various transformer models for various tasks
nematus - Open-Source Neural Machine Translation in Tensorflow
Opus-MT - Open neural machine translation models and web services
sentence-splitter - Text to sentence splitter using heuristic algorithm by Philipp Koehn and Josh Schroeder.
OpenNMT - Open Source Neural Machine Translation in Torch (deprecated)