Skip to content

Multi-Xscience

SummarizationEnglish

Multi-Xscience is a summarization-focused dataset in English that provides 40,528 labeled examples distributed in JSON format.

About Multi-Xscience

A multi-document summarization dataset created from scientific articles. MultiXScience introduces a challenging multidocument summarization task: writing the related-work section of a paper based on its abstract and the articles it references.

Details

Task
Summarization
Language
English
Format
JSON
Rows / instances
40,528
Creator
Lu et al.
Year
2020
Download Paper

Related Summarization datasets

FAQ