Skip to content

VoxCeleb

Speech RecognitionVisualMulti-Lingual

VoxCeleb is a speech recognition-focused dataset in Multi-Lingual distributed in MD5, URL format.

About VoxCeleb

An audio-visual dataset consisting of short clips of human speech, extracted from interview videos uploaded to YouTube.

Details

Task
Speech Recognition, Visual
Language
Multi-Lingual
Format
MD5, URL
Rows / instances
n/a
Creator
Nagrani et al.
Year
2017
Download Paper

Related Speech Recognition, Visual datasets

FAQ