Skip to content

google/WaxalNLP

Automatic Speech RecognitionText To SpeechACH, AKA, AMHcc-by-sa-4.0

Created by google at 2026, the google/WaxalNLP is a automatic speech recognition dataset in ACH, AKA, AMH in Parquet format. With 29.7K downloads and 235 likes, it is actively used by the community. It is released under the cc-by-sa-4.0 license and is a 1M<n<10M-scale dataset.

About google/WaxalNLP

Waxal Datasets The WAXAL dataset is a large-scale multilingual speech corpus for African languages, introduced in the paper WAXAL: A Large-Scale Multilingual African Language Speech Corpus. Dataset Description The Waxal project p...

Details

Task
Automatic Speech Recognition, Text To Speech
Language
ACH, AKA, AMH
Format
Parquet
Rows / instances
N/A
Size
1M<n<10M
Creator
google
Year
2026
License
cc-by-sa-4.0
Downloads
29715
Likes
235
Download Homepage

Related Automatic Speech Recognition, Text To Speech datasets

FAQ