allenai/dolma3_pool
Text GenerationENodc-by
Allenai/dolma3_pool is a text generation dataset in EN from allenai in Parquet format. It is distributed under the odc-by license, and has been downloaded 31.3K times.
About allenai/dolma3_pool
⚠️ IMPORTANT NOTICE ⚠️
This is the Dolma 3 pool, pre–quality upsampling and mixing.
If you are interested in the data used to train Olmo 3 7B and Olmo 3 32B, visit allenai/dolma3_mix-6T-1025.
Dolma 3 Pool
The Dolma 3 pool is a...
Details
- Task
- Text Generation
- Language
- EN
- Format
- Parquet
- Rows / instances
- N/A
- Creator
- allenai
- Year
- 2025
- License
- odc-by
- Downloads
- 31288
- Likes
- 36