A library for preparing data for machine translation research (monolingual preprocessing, bitext mining, etc.) built by the FAIR NLLB team.
Why do you think that https://github.com/jjasim/Thirukkural-English-Translation-Dataset is a good alternative to stopes