Skip to content

Web Inventory of Transcribed and Translated Talks (WIT3)

Machine TranslationMulti-Lingual

Created by Cettolo et al. at 2012, the Web Inventory of Transcribed and Translated Talks (WIT3) is a machine translation dataset in Multi-Lingual in XML format.

About Web Inventory of Transcribed and Translated Talks (WIT3)

Dataset contains a collection of transcribed and translated talks. The core of the dataset is from Ted Talks corpus. As of 2016, It holds 109 languages.

Details

Task
Machine Translation
Language
Multi-Lingual
Format
XML
Rows / instances
n/a
Creator
Cettolo et al.
Year
2012
Download Paper

Related Machine Translation datasets

FAQ