Skip to content

LibriVoxDeEn

Speech TranslationMachine TranslationGerman, English

Created by Beilharz et al. at 2019, the LibriVoxDeEn is a speech translation dataset in German, English containing 50,000+ records in Text, TSV format.

About LibriVoxDeEn

Dataset contains sentence-aligned triples of German audio, German text, and English translation, based on German audio books. The corpus consists of over 100 hours of audio material and over 50k parallel sentences.

Details

Task
Speech Translation, Machine Translation
Language
German, English
Format
Text, TSV
Rows / instances
50,000+
Creator
Beilharz et al.
Year
2019
Download Paper

FAQ