Skip to content

European Parliament Proceedings (Europarl)

Text CorporaMachine TranslationMulti-Lingual

European Parliament Proceedings (Europarl) is a text corpora-focused dataset in Multi-Lingual that provides 10 labeled examples distributed in XML format.

About European Parliament Proceedings (Europarl)

The Europarl parallel corpus is extracted from the proceedings of the European Parliament. It includes versions in 21 European languages.

Details

Task
Text Corpora, Machine Translation
Language
Multi-Lingual
Format
XML
Rows / instances
10M+
Creator
Koehn et al.
Year
2002
Download Paper

Related Text Corpora, Machine Translation datasets

FAQ