Skip to content

Reuters-21578 Benchmark Corpus

ClassificationEnglish

The Reuters-21578 Benchmark Corpus dataset is a English classification resource from Lewis et al. at 1997 comprising 10,788 examples.

About Reuters-21578 Benchmark Corpus

Dataset is a collection of 10,788 documents from the Reuters financial newswire service, partitioned into a training set with 7769 documents and a test set with 3019 documents.

Details

Task
Classification
Language
English
Format
TSV
Rows / instances
10,788
Creator
Lewis et al.
Year
1997
Download

Related Classification datasets

FAQ