google/WaxalNLP
Automatic Speech RecognitionText To SpeechACH, AKA, AMHcc-by-sa-4.0
Created by google at 2026, the google/WaxalNLP is a automatic speech recognition dataset in ACH, AKA, AMH in Parquet format. With 29.7K downloads and 235 likes, it is actively used by the community. It is released under the cc-by-sa-4.0 license and is a 1M<n<10M-scale dataset.
About google/WaxalNLP
Waxal Datasets
The WAXAL dataset is a large-scale multilingual speech corpus for African languages, introduced in the paper WAXAL: A Large-Scale Multilingual African Language Speech Corpus.
Dataset Description
The Waxal project p...
Details
- Task
- Automatic Speech Recognition, Text To Speech
- Language
- ACH, AKA, AMH
- Format
- Parquet
- Rows / instances
- N/A
- Size
- 1M<n<10M
- Creator
- Year
- 2026
- License
- cc-by-sa-4.0
- Downloads
- 29715
- Likes
- 235