reazon-research/reazonspeech
Automatic Speech RecognitionJABenchmark
Created by reazon-research at 2023, the reazon-research/reazonspeech is a automatic speech recognition benchmark dataset in JA in Parquet format.
📊 This dataset is used as an LLM benchmark. See model leaderboards →
About reazon-research/reazonspeech
Dataset Card for ReazonSpeech
Dataset Summary
This dataset contains a diverse set of natural Japanese speech, collected
from terrestrial television streams. It contains more than 35000 hours of
audio.
Paper: ReazonSpeech: A Free and ...
Details
- Task
- Automatic Speech Recognition
- Language
- JA
- Format
- Parquet
- Rows / instances
- N/A
- Creator
- reazon-research
- Year
- 2023