wikimedia/wikisource
Text GenerationFill MaskAR, AS, AZ
Wikimedia/wikisource is a text generation dataset in AR, AS, AZ from wikimedia in Parquet format.
About wikimedia/wikisource
Dataset Card for Wikimedia Wikisource
Dataset Summary
Wikisource dataset containing cleaned articles of all languages.
The dataset is built from the Wikisource dumps (https://dumps.wikimedia.org/)
with one subset per language, each c...
Details
- Task
- Text Generation, Fill Mask
- Language
- AR, AS, AZ
- Format
- Parquet
- Rows / instances
- N/A
- Creator
- wikimedia
- Year
- 2022