Skip to content

MLCommons/ml_spoken_words

Audio ClassificationAR, AS, BR

MLCommons/ml_spoken_words is a audio classification-focused dataset in AR, AS, BR distributed in Parquet format.

About MLCommons/ml_spoken_words

Multilingual Spoken Words Corpus is a large and growing audio dataset of spoken words in 50 languages collectively spoken by over 5 billion people, for academic research and commercial applications in keyword spotting and spoken term search, licen...

Details

Task
Audio Classification
Language
AR, AS, BR
Format
Parquet
Rows / instances
N/A
Creator
MLCommons
Year
2022
Download

Related Audio Classification datasets

FAQ