Skip to content

LEMAS-Project/LEMAS-Dataset-train

Text To SpeechAutomatic Speech RecognitionIT, PT, ES

The LEMAS-Project/LEMAS-Dataset-train dataset is a IT, PT, ES text to speech resource from LEMAS-Project at 2025.

About LEMAS-Project/LEMAS-Dataset-train

Overview This dataset is part of LEMAS-Project (lemas-project.github.io/LEMAS-Project). It contains a large-scale training set (150k+ hours) and a curated evaluation set (500 utterances per language) covering 10 languages, all with word-level a...

Details

Task
Text To Speech, Automatic Speech Recognition
Language
IT, PT, ES
Format
Parquet
Rows / instances
N/A
Creator
LEMAS-Project
Year
2025
Download

Related Text To Speech, Automatic Speech Recognition datasets

FAQ