Skip to content

CSTR VCTK Corpus

Text-to-SpeechEnglish

CSTR VCTK Corpus is a text-to-speech-focused dataset in English.

About CSTR VCTK Corpus

Dataset contains speech data uttered by 109 native speakers of English with various accents. Each speaker reads out about 400 sentences, most of which were selected from a newspaper plus the Rainbow Passage and an elicitation paragraph intended to identify the speaker's accent.

Details

Task
Text-to-Speech
Language
English
Format
n/a
Rows / instances
n/a
Creator
Veaux et al.
Year
2017
Download

Related Text-to-Speech datasets

FAQ