VoxCeleb
Speech RecognitionVisualMulti-Lingual
VoxCeleb is a speech recognition-focused dataset in Multi-Lingual distributed in MD5, URL format.
About VoxCeleb
An audio-visual dataset consisting of short clips of human speech, extracted from interview videos uploaded to YouTube.
Details
- Task
- Speech Recognition, Visual
- Language
- Multi-Lingual
- Format
- MD5, URL
- Rows / instances
- n/a
- Creator
- Nagrani et al.
- Year
- 2017