euirim/goodwiki
Text GenerationSummarizationENmit
Created by euirim at 2023, the euirim/goodwiki is a text generation dataset in EN in Parquet format. With 306 downloads and 54 likes, it is actively used by the community. It is released under the mit license and is a 10K<n<100K-scale dataset.
About euirim/goodwiki
GoodWiki Dataset
GoodWiki is a 179 million token dataset of English Wikipedia articles collected on September 4, 2023, that have been marked as Good or Featured by Wikipedia editors. The dataset provides these articles in GitHub-flavored Markdo...
Details
- Task
- Text Generation, Summarization
- Language
- EN
- Format
- Parquet
- Rows / instances
- N/A
- Size
- 10K<n<100K
- Creator
- euirim
- Year
- 2023
- License
- mit
- Downloads
- 306
- Likes
- 54