Skip to content

ibm-research/duorc

Question AnsweringENBenchmarkmit

Ibm-research/duorc is a question answering benchmark dataset in EN from ibm-research with 187,213 records in Parquet format. It is distributed under the mit license and falls in the 100K<n<1M size category, and has been downloaded 2.2K times.

📊 This dataset is used as an LLM benchmark. See model leaderboards →

About ibm-research/duorc

Dataset Card for duorc Dataset Summary The DuoRC dataset is an English language dataset of questions and answers gathered from crowdsourced AMT workers on Wikipedia and IMDb movie plots. The workers were given freedom to pick answer ...

Details

Task
Question Answering
Language
EN
Format
Parquet
Rows / instances
187213
Size
100K<n<1M
Creator
ibm-research
Year
2022
License
mit
Downloads
2154
Likes
34
Download Homepage

Related Question Answering datasets

FAQ