Question 1

What is the CSTR VCTK Corpus dataset?

Accepted Answer

Dataset contains speech data uttered by 109 native speakers of English with various accents. Each speaker reads out about 400 sentences, most of which were selected from a newspaper plus the Rainbow Passage and an elicitation paragraph intended to identify the speaker's accent.

Question 2

Is CSTR VCTK Corpus a benchmark?

Accepted Answer

CSTR VCTK Corpus is a dataset for training or evaluation; it isn't tracked as a standard LLM benchmark in our catalog.

Question 3

Where can I download CSTR VCTK Corpus?

Accepted Answer

CSTR VCTK Corpus is available at its source: https://datashare.is.ed.ac.uk/handle/10283/2651.

CSTR VCTK Corpus

About CSTR VCTK Corpus

Details

Related Text-to-Speech datasets

FAQ