Skip to content

Multilingual Corpus of Sentence-Aligned Spoken Utterances (MaSS)

Speech CorporaMulti-Lingual

The Multilingual Corpus of Sentence-Aligned Spoken Utterances (MaSS) dataset is a Multi-Lingual speech corpora resource from Boito et al. at 2020 comprising 8,13 examples.

About Multilingual Corpus of Sentence-Aligned Spoken Utterances (MaSS)

Dataset of 8,130 parallel spoken utterances across 8 languages (56 language pairs). Languages: Basque, English, Finnish, French. Hungarian, Romanian, Russian, Spanish.

Details

Task
Speech Corpora
Language
Multi-Lingual
Format
n/a
Rows / instances
8,13
Creator
Boito et al.
Year
2020
Download Paper

Related Speech Corpora datasets

FAQ