Skip to content

ccdv/govreport-summarization

SummarizationText GenerationEN

Created by ccdv at 2022, the ccdv/govreport-summarization is a summarization dataset in EN containing 19,463 records in Parquet format. With 6.2K downloads and 61 likes, it is actively used by the community and is a 10K<n<100K-scale dataset.

About ccdv/govreport-summarization

GovReport dataset for summarization Dataset for summarization of long documents.Adapted from this repo and this paperThis dataset is compatible with the run_summarization.py script from Transformers if you add this line to the summarization_nam...

Details

Task
Summarization, Text Generation
Language
EN
Format
Parquet
Rows / instances
19463
Size
10K<n<100K
Creator
ccdv
Year
2022
Downloads
6163
Likes
61
Download Homepage

Related Summarization, Text Generation datasets

FAQ