Question Answering Datasets
There are 130 question answering datasets in our directory, 4 of which are benchmarks. Each links to its source, paper, and download — browse the full list below or filter by language.
Question Answering is the task of returning a precise answer to a natural-language question, often grounded in a supporting passage. We catalog 130 datasets for it.
Updated June 2026
- OpenSQZ/AutoMathText-V2Text Generation, Question AnsweringEN, ZH
- clips/mfaqQuestion AnsweringCS, DA, DE
- clzoro/GLM-5.1-1000000xText Generation, Question AnsweringEN, ZH
- SubjQAQuestion AnsweringEnglish
- aps/super_glueText Classification, Token Classification, Question AnsweringEN
- MilkQAQuestion AnsweringPortuguese
- OpenMed/Medical-Reasoning-SFT-MegaText Generation, Question AnsweringEN
- GrailQAQuestion Answering, Knowledge BaseEnglish
- Russian Multi-Sentence Reading Comprehension (MuSeRC) (SuperGlue)Question AnsweringRussian
- JailbreakV-28K/JailBreakV-28kText Generation, Question AnsweringEnglish
- Vietnamese Question Answering Dataset (ViQuAD)Question AnsweringVietnamese
- QEDQuestion Answering, ExplainabilityEnglish
- Vietnamese Multiple-choice Machine Reading Comprehension Corpus (ViMMRC)Question Answering, Reading ComprehensionVietnamese
- PubmedQAQuestion AnsweringEnglish
- K-and-K/knights-and-knavesQuestion AnsweringEN
- allenai/openbookqaQuestion AnsweringEN
- Question Answering in Context (QuAC)Question Answering, Reading ComprehensionEnglish
- Reading Comprehension with Commonsense Reasoning Dataset (Record)Question Answering, Reading ComprehensionEnglish
- Reading Comprehension with Multiple Hops (Qangaroo)Question Answering, Reading ComprehensionEnglish
- OpenGVLab/ShareGPT-4oVisual Question Answering, Question AnsweringEN
- AmbigNQQuestion Answering, Reading ComprehensionEnglish
- Congliu/Chinese-DeepSeek-R1-Distill-data-110kText Generation, Question AnsweringZH
- Congliu/Chinese-DeepSeek-R1-Distill-data-110k-SFTText Generation, Question AnsweringZH
- Situations With Adversarial Generations (SWAG)Question Answering, Reading ComprehensionEnglish
- SQuAD v2.0Question Answering, Reading ComprehensionEnglish
- TIGER-Lab/WebInstructSubQuestion AnsweringEN
- corbyrosset/researchy_questionsQuestion AnsweringENBenchmark
- DoQaQuestion Answering, DialogueEnglish
- Who Did What DatasetQuestion Answering, Reading ComprehensionEnglish
- Video Commonsense Reasoning (VCR)Question Answering, Visual, CommonsenseEnglish
- silk-road/Wizard-LM-Chinese-instruct-evolText Generation, Question AnsweringZH, EN
- MATINFClassification, Question Answering, SummarizationChinese
- Complex Sequential Question Answering (CSQA)Question Answering, Knowledge BaseEnglish
- TriviaQAQuestion Answering, Reading ComprehensionEnglish
- MathQAQuestion Answering, Reading ComprehensionEnglish
- WebQuestionsQuestion Answering, Knowledge BaseEnglish
- neural-bridge/rag-dataset-12000Question AnsweringEN
- LDJnr/PuffinQuestion Answering, Text GenerationEN
- KorQuADQuestion Answering, Reading ComprehensionKorean
- SberQuADQuestion Answering, Reading ComprehensionRussian
- FQuADQuestion Answering, Reading ComprehensionFrench
- LC-QuAD 2.0Question Answering, Knowledge GraphEnglish
- QASCQuestion Answering, Reading ComprehensionEnglish
- QuorefQuestion Answering, Reading ComprehensionEnglish
- A Diagnostic Dataset for Compositional Language and Elementary Visual Reasoning (CLEVR & CoGenT)Question Answering, VisualEnglish
- Fact-based Visual Question Answering (FVQA)Question Answering, VisualEnglish
- Physical IQAQuestion Answering, CommonsenseEnglish
- QA-ZREQuestion Answering, Relation ExtractionEnglish
- Social IQAQuestion Answering, CommonsenseEnglish
- QA-SRL BankQuestion Answering, Semantic Role LabelingEnglish
- lvwerra/stack-exchange-pairedText Generation, Question AnsweringEN
- UCSD26/medical_dialogQuestion AnsweringEN, ZH
- lmsys/mt_bench_human_judgmentsQuestion AnsweringEN
- Mxode/Chinese-InstructText Generation, Question AnsweringZH
- COmmonsense Dataset Adversarially-authored by Humans (CODAH)Question Answering, Reading Comprehension, CommonsenseEnglish
- AI2 Science Questions v2.1Question Answering, Reading ComprehensionEnglish
- Children’s Book Test (CBT)Question Answering, Reading ComprehensionEnglish
- bAbI 20 TasksQuestion Answering, Reading ComprehensionHindi, English
- ComplexWebQuestionsQuestion Answering, Knowledge BaseEnglish
- Human-in-the-loop Dialogue Simulator (HITL)Question Answering, Reading ComprehensionEnglish
- GQAQuestion Answering, Visual, CommonsenseEnglish
- Jeapardy Questions AnswersQuestion Answering, Reading ComprehensionEnglish
- Natural Questions (NQ)Question Answering, Reading ComprehensionEnglish
- NarrativeQAQuestion Answering, Reading ComprehensionEnglish
- deepset/germanquadQuestion Answering, Text RetrievalDE
- NewsQAQuestion Answering, Reading ComprehensionEnglish
- nvidia/OpenMathReasoningQuestion Answering, Text GenerationEN
- microsoft/orca-agentinstruct-1M-v1Question AnsweringEN
- Quasar-S & TQuestion Answering, Reading ComprehensionEnglish
- ReAding Comprehension Dataset From Examinations (RACE)Question Answering, Reading ComprehensionEnglish
- Reading Comprehension over Multiple Sentences (MultiRC)Question Answering, Reading ComprehensionEnglish
- OpenBookQAQuestion Answering, Reading ComprehensionEnglish
- ProPara DatasetQuestion Answering, Reading ComprehensionEnglish
- QuaRel DatasetQuestion Answering, Reading ComprehensionEnglish
- QuaRTz DatasetQuestion Answering, Reading ComprehensionEnglish
- SciQ DatasetQuestion Answering, Reading ComprehensionEnglish
- SemEvalCQAQuestion Answering, Reading ComprehensionArabic, English
- Social-IQ DatasetQuestion Answering, Visual, CommonsenseEnglish
- Textbook Question AnsweringQuestion Answering, Reading Comprehension, VisualEnglish
- TextVQAQuestion Answering, Visual, CommonsenseEnglish
- The Dialog-based Language Learning DatasetQuestion Answering, Reading ComprehensionEnglish
- The Movie Dialog DatasetQuestion Answering, Reading ComprehensionEnglish
- The SimpleQuestions DatasetQuestion Answering, Knowledge BaseEnglish
- The Story Cloze Test | ROCStoriesQuestion Answering, Reading ComprehensionEnglish
- TrecQAQuestion Answering, Reading ComprehensionEnglish
- WikiQA CorpusQuestion Answering, Reading ComprehensionEnglish
- angrygiraffe/claude-opus-4.6-4.7-reasoning-8.7kText Generation, Question AnsweringEN
- WNT3D/Ultimate-Offensive-Red-TeamText Generation, Question Answering, Text ClassificationEN
- math-ai/StackMathQAText Generation, Question AnsweringEN
- Open-Orca/OpenOrcaText Classification, Token Classification, Table Question Answering, Question Answering, Zero Shot Classification, Summarization, Feature Extraction, Text GenerationEN
- Salesforce/xlam-function-calling-60kQuestion Answering, Text Generation, Reinforcement LearningEN
- JosephusCheung/GuanacoDatasetText Generation, Question AnsweringZH, EN, JA
- tonytan48/TempReasonQuestion AnsweringEN
- allenai/WildChat-1MText Generation, Question AnsweringEnglish
- AdaptLLM/finance-tasksText Classification, Question Answering, Zero Shot ClassificationEN
- iamtarun/python_code_instructions_18k_alpacaQuestion Answering, Text GenerationEnglish
- qiaojin/PubMedQAQuestion AnsweringEN
- vicgalle/alpaca-gpt4Text Generation, Question AnsweringEN
- stanfordnlp/SHPText Generation, Question AnsweringEN
- wangrui6/Zhihu-KOLQuestion AnsweringZH
- llamafactory/tiny-supervised-datasetText Generation, Question AnsweringEN, ZH
- AI2 Reasoning Challenge (ARC)Question Answering, Reading ComprehensionEnglishBenchmark
- Open-Orca/SlimOrcaText Classification, Token Classification, Table Question Answering, Question Answering, Zero Shot Classification, Summarization, Feature Extraction, Text GenerationEN
- MedRAG/pubmedQuestion AnsweringEN
- shareAI/ShareGPT-Chinese-English-90kQuestion Answering, Text GenerationEN, ZH
- sunzeyeah/chinese_chatgpt_corpusText Generation, Question Answering, Reinforcement LearningZH
- ShadenA/MathNetQuestion Answering, Text Generation, Image To TextEN, PT, ES
- UCSC-VLAA/MedReasonQuestion AnsweringEnglish
- neulab/PangeaInstructVisual Question Answering, Question AnsweringAM, AR, BG
- stanfordnlp/coqaQuestion AnsweringEN
- Magpie-Align/Magpie-Qwen2-Pro-200K-ChineseQuestion AnsweringZH
- OpenDataArena/MMFineReason-SFT-123K-Qwen3-VL-235B-ThinkingVisual Question Answering, Question Answering, Text GenerationEN
- sujet-ai/Sujet-Finance-Instruct-177kText Generation, Question AnsweringEN
- opencsg/Fineweb-Edu-Chinese-V2.2Text Generation, Question AnsweringZH
- LDJnr/Pure-DoveQuestion Answering, Text GenerationEN
- BAAI/Infinity-PreferenceText Generation, Question AnsweringEN, ZH
- galaxyMindAiLabs/stem-reasoning-complexText Generation, Question AnsweringEN, ZH
- lmms-lab/M4-Instruct-DataVisual Question Answering, Question AnsweringEN
- OpenMed/Medical-Reasoning-SFT-Trinity-MiniText Generation, Question AnsweringEN
- zai-org/LongCite-45kText Generation, Question AnsweringEN, ZH
- Malikeh1375/medical-question-answering-datasetsQuestion AnsweringEN
- microsoft/wiki_qaQuestion AnsweringEN
- VLR-CVC/DocVQA-2026Visual Question Answering, Document Question Answering, Image Text To Text, Question AnsweringEN
- deepmind/aqua_ratQuestion AnsweringEN
- Modotte/MathX-5MQuestion Answering, Text GenerationEnglish
- Flmc/DISC-Med-SFTQuestion AnsweringZH
- tasksource/bigbenchMultiple Choice, Question Answering, Text Classification, Text Generation, Zero Shot ClassificationENBenchmark
- TIGER-Lab/WebInstruct-verifiedQuestion AnsweringEN
- maya-research/IndicVaultQuestion Answering, Text GenerationHI, TE, ENBenchmark
- facebook/kilt_tasksFill Mask, Question Answering, Text Classification, Text Generation, Text RetrievalEN
What languages do question answering datasets cover?
English datasets (61)EN datasets (50)ZH datasets (18)DE datasets (2)Russian datasets (2)Vietnamese datasets (2)CS datasets (1)DA datasets (1)Portuguese datasets (1)Chinese datasets (1)Korean datasets (1)French datasets (1)Hindi datasets (1)Arabic datasets (1)JA datasets (1)PT datasets (1)ES datasets (1)AM datasets (1)