Skip to content

Machine Translation Datasets

There are 25 machine translation datasets in our directory, 1 of which are benchmarks. Each links to its source, paper, and download — browse the full list below or filter by language.

Machine Translation is the task of automatically converting text between languages while preserving meaning. We catalog 25 datasets for it.

Updated June 2026

What languages do machine translation datasets cover?

Explore other dataset tasks

Frequently asked questions