Python machine-translation

Open-source Python projects categorized as machine-translation

Top 22 Python machine-translation Projects

  • NLP-progress

    Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the most common NLP tasks.

  • NeMo

    A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

  • Project mention: [P] Making a TTS voice, HK-47 from Kotor using Tortoise (Ideally WaveRNN) | /r/MachineLearning | 2023-07-06

    I don't test WaveRNN but from the ones that I know the best that is open source is FastPitch. And it's easy to use, here is the tutorial for voice cloning.

  • WorkOS

    The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.

    WorkOS logo
  • espnet

    End-to-End Speech Processing Toolkit

  • Project mention: WhisperSpeech – An Open Source text-to-speech system built by inverting Whisper | news.ycombinator.com | 2024-01-17

    You might check out this list from espnet. They list the different corpuses they use to train their models sorted by language and task (ASR, TTS etc):

    https://github.com/espnet/espnet/blob/master/egs2/README.md

  • OpenNMT-py

    Open Source Neural Machine Translation and (Large) Language Models in PyTorch

  • Project mention: Making a custom Google Translate equivalent / web translation filter for my conlang? | /r/conlangs | 2023-04-26

    I already tried this with OpenNMT.

  • manga-image-translator

    Translate manga/image 一键翻译各类图片内文字 https://cotrans.touhou.ai/

  • Project mention: [DISC] - The angel who came to pick me up is a Gal (Oneshot by Shiraishi Kouhei) | /r/manga | 2023-09-06

    OCR works pretty good. ocr.space, ocr.best and cotrans.touhou.ai/ are all pretty nice.

  • argos-translate

    Open-source offline translation library written in Python

  • Project mention: Fast and secure translation on your local machine with a GUI | news.ycombinator.com | 2024-04-13

    Interestingly, I think this is actually related to the offline translation features built into Firefox. Both are products of "Project Bergamot", but the Mozilla-maintained version was later merged into the Firefox application:

    https://browser.mt/

    https://blog.mozilla.org/en/mozilla/local-translation-add-on...

    https://hacks.mozilla.org/2022/06/training-efficient-neural-...

    https://github.com/mozilla/firefox-translations

    https://firefox-source-docs.mozilla.org/toolkit/components/t...

    Extra webpage with screenshot and links, impossible to search for normally:

    https://translatelocally.com/downloads/

    Does one thing and does it well.

    Oh— For downloading models, it's much easier to pipe/`xargs` `translateLocally --available-models` into `translateLocally -d` than go through the GUI.

    ---

    Other self-hostable translation tools:

    https://www.apertium.org/index.eng.html

    - Traditional rule-based translation. Seems to work pretty well, but no good desktop frontend.

    https://www.argosopentech.com/

    - Works, but crashy desktop app.

    https://libretranslate.com/

    - API wrapping Argos Translate.

    https://lingva.thedaviddelta.com/

    - Google Translate scraper/privacy frontend.

    https://euroglot.com/

    - Proprietary, subscription trialware.

  • lingvo

    Lingvo

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
  • RL4LMs

    A modular RL library to fine-tune language models to human preferences

  • Project mention: How To Setup a Model With Guardrails? | /r/LocalLLaMA | 2023-05-12

    I think of guardrails as another dimension of human preferences: whether you are training a model to answer questions more gooder or avoid saying horrifying stuff, you are teaching the model a preference. So I thinks it's a straightforward RLHF problem but from a different perspective.

  • nematus

    Open-Source Neural Machine Translation in Tensorflow

  • Opus-MT

    Open neural machine translation models and web services

  • COMET

    A Neural Framework for MT Evaluation (by Unbabel)

  • edenai-apis

    Eden AI: simplify the use and deployment of AI technologies by providing a unique API that connects to the best possible AI engines

  • Project mention: We're Building an Open-Source LLM/AI API Wrapper: Here's Why | news.ycombinator.com | 2023-08-28

    HackerNoon featured our latest article in the "Future of AI" category

    We explain how Eden AI contributes to the AI ecosystem in structuring AI and LLM APIs by creating the most accomplished Open-Source wrapper possible.

    You can support us in reaching 1000 stars on Github here: https://github.com/edenai/edenai-apis

  • bitextor

    Bitextor generates translation memories from multilingual websites

  • comic-translate

    Desktop app for automatically translating comics - BDs, Manga, Manhwa, Fumetti and more in a variety of formats (Image, Pdf, Epub, cbr, cbz, etc) and in multiple languages.

  • Project mention: Ask HN: What have you built with LLMs? | news.ycombinator.com | 2024-02-05
  • stopes

    A library for preparing data for machine translation research (monolingual preprocessing, bitext mining, etc.) built by the FAIR NLLB team.

  • a-PyTorch-Tutorial-to-Transformers

    Attention Is All You Need | a PyTorch Tutorial to Transformers

  • Project mention: Transformers Tutorial - learn to implement transformers from scratch | /r/learnmachinelearning | 2023-05-01
  • firefox-translations-training

    Training pipelines for Firefox Translations neural machine translation models

  • NLP-Guide

    Natural Language Processing (NLP). Covering topics such as Tokenization, Part Of Speech tagging (POS), Machine translation, Named Entity Recognition (NER), Classification, and Sentiment analysis.

  • tm2tb

    Bilingual term extractor

  • osdg-tool

    OSDG is an open-source tool that maps and connects activities to the UN Sustainable Development Goals (SDGs) by identifying SDG-relevant content in any text. The tool is available online at www.osdg.ai. API access available for research purposes.

  • valhalla-nmt

    Code repository for CVPR 2022 paper "VALHALLA: Visual Hallucination for Machine Translation"

  • tagtransfer

    Evaluations for tag-transfer in bergamot

  • SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

Python machine-translation related posts

Index

What are some of the best open-source machine-translation projects in Python? This list will help you:

Project Stars
1 NLP-progress 22,296
2 NeMo 10,021
3 espnet 7,872
4 OpenNMT-py 6,558
5 manga-image-translator 4,169
6 argos-translate 3,229
7 lingvo 2,780
8 RL4LMs 2,084
9 nematus 796
10 Opus-MT 522
11 COMET 394
12 edenai-apis 357
13 bitextor 278
14 comic-translate 242
15 stopes 238
16 a-PyTorch-Tutorial-to-Transformers 159
17 firefox-translations-training 132
18 NLP-Guide 64
19 tm2tb 47
20 osdg-tool 31
21 valhalla-nmt 26
22 tagtransfer 1

Sponsored
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com