LibriVoxDeEn
Speech TranslationMachine TranslationGerman, English
Created by Beilharz et al. at 2019, the LibriVoxDeEn is a speech translation dataset in German, English containing 50,000+ records in Text, TSV format.
About LibriVoxDeEn
Dataset contains sentence-aligned triples of German audio, German text, and English translation, based on German audio books. The corpus consists of over 100 hours of audio material and over 50k parallel sentences.
Details
- Task
- Speech Translation, Machine Translation
- Language
- German, English
- Format
- Text, TSV
- Rows / instances
- 50,000+
- Creator
- Beilharz et al.
- Year
- 2019