SparkAudio/voxbox
Text To SpeechZH, EN
SparkAudio/voxbox is a text to speech dataset in ZH, EN from SparkAudio in Parquet format.
About SparkAudio/voxbox
VoxBox
This dataset is a curated collection of bilingual speech corpora annotated clean transcriptions and rich metadata incluing age, gender, and emotion.
Dataset Structure
.
├── audios/
│ └── aishell-3/ # Aud...
Details
- Task
- Text To Speech
- Language
- ZH, EN
- Format
- Parquet
- Rows / instances
- N/A
- Creator
- SparkAudio
- Year
- 2025