Tatoeba-Challenge Alternatives

Similar projects and alternatives to Tatoeba-Challenge

edenai-apis

13 364 9.8 Python Tatoeba-Challenge VS edenai-apis

Eden AI: simplify the use and deployment of AI technologies by providing a unique API that connects to the best possible AI engines
COMET

3 401 7.7 Python Tatoeba-Challenge VS COMET

A Neural Framework for MT Evaluation (by Unbabel)
InfluxDB

www.influxdata.com featured

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
OPUS-MT-train

1 304 1.7 Makefile Tatoeba-Challenge VS OPUS-MT-train

Training open neural machine translation models
fastseq

2 425 0.0 Python Tatoeba-Challenge VS fastseq

An efficient implementation of the popular sequence models for text generation, summarization, and translation tasks. https://arxiv.org/pdf/2106.04718.pdf
AutomaticKeyphraseExtraction

1 336 10.0 Tatoeba-Challenge VS AutomaticKeyphraseExtraction

Data for Automatic Keyphrase Extraction Task
SaaSHub

www.saashub.com featured

SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a better Tatoeba-Challenge alternative or higher similarity.

Suggest an alternative to Tatoeba-Challenge

Tatoeba-Challenge reviews and mentions

Posts with mentions or reviews of Tatoeba-Challenge. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-01-06.

OpenAI GPT-3 vs Other Models [Benchmark] - Should AI companies be really worried ?
4 projects | dev.to | 6 Jan 2023

Automatically translate a text from a language A to a language B. 1/ Dataset : we chose a dataset from the Language Technology Research Group at the University of Helsinki’s Tatoeba Translation Challenge . We took 100 of examples from different latin languages pairs : deu-fra, eng-fra, fra -ita, deu-spa , deu-swe which constitutes a 500 example test dataset.
Amazon releases 51-language dataset for language understanding
2 projects | news.ycombinator.com | 21 Apr 2022

https://translatelocally.com/ is a nice gui around marian/bergamot. So far not very many bundled pairs, though I would guess any of the models from https://github.com/Helsinki-NLP/Opus-MT-train/tree/master/mo... and https://github.com/Helsinki-NLP/Tatoeba-Challenge/blob/maste... should be usable.
There is also Apertium, a rule-based system which is very good for some closely-related pairs that have had a lot of work put into them (especially translation between Romance languages, e.g. Spanish→Catalan, and Norwegian Bokmål→Nynorsk), and the only OK translator for some lesser-resourced languages (e.g. Northern Saami→Norwegian Bokmål), but very underdeveloped for anything to/from English (it feels a bit pointless writing rules for English where there is so much available data; RBMT shines where there's not enough available data, ie. most of the languages of the world)
[P] What we learned by accelerating by 5X Hugging Face generative language models
2 projects | /r/MachineLearning | 10 Feb 2022

#1: University of Helsinki language technology professor Jörg Tiedemann has released a dataset with over 500 million translated sentences in 188 languages | 0 comments #2: The NLP Index: 3,000+ code repos for hackers and researchers. [self-promotion] #3: A Python library to boost T5 models speed up to 5x & reduce the model size by 3x.
Labelling of Text (NLP)
1 project | /r/MLQuestions | 29 Mar 2021

#1: Matching GPT-3's performance with just 0.1% of its parameters #2: University of Helsinki language technology professor Jörg Tiedemann has released a dataset with over 500 million translated sentences in 188 languages | 0 comments #3: Trained a Markov Chain on a bunch of r/WSB posts and comments. Only 2-word conditional probabilities but honestly, that's all that's necessary 🚀🚀
Helsinki professor Jörg Tiedemann – 500M translations in 188 languages
1 project | news.ycombinator.com | 23 Mar 2021
Thought it could be useful to someone
1 project | /r/datasets | 23 Mar 2021
University of Helsinki language technology professor Jörg Tiedemann has released a dataset with over 500 million translated sentences in 188 languages
1 project | /r/Develovers | 22 Mar 2021

1 project | /r/ooj | 22 Mar 2021
Translated language database released by Helsinki scientist
1 project | /r/Cyberdelinaut | 22 Mar 2021
500 million sentences in 188 languages
1 project | /r/languagelearning | 22 Mar 2021
A note from our sponsor - InfluxDB
www.influxdata.com | 9 May 2024

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality. Learn more →

Stats

Basic Tatoeba-Challenge repo stats

Mentions

Stars

779

Activity

5.7

Last Commit

16 days ago

Helsinki-NLP/Tatoeba-Challenge is an open source project licensed under GNU General Public License v3.0 or later which is an OSI approved license.

The primary programming language of Tatoeba-Challenge is Makefile.