Skip to content

WikiSplit

Sentence SimplificationEnglish

WikiSplit is a sentence simplification dataset in English from Botha et al. with 1 records in TSV format.

About WikiSplit

Dataset contains 1 million English sentences, each split into two sentences that together preserve the original meaning, extracted from Wikipedia edits.

Details

Task
Sentence Simplification
Language
English
Format
TSV
Rows / instances
1M
Creator
Botha et al.
Year
2018
Download Paper

Related Sentence Simplification datasets

FAQ