Skip to content

allenai/dolma3_pool

Text GenerationENodc-by

Allenai/dolma3_pool is a text generation dataset in EN from allenai in Parquet format. It is distributed under the odc-by license, and has been downloaded 31.3K times.

About allenai/dolma3_pool

⚠️ IMPORTANT NOTICE ⚠️ This is the Dolma 3 pool, pre–quality upsampling and mixing. If you are interested in the data used to train Olmo 3 7B and Olmo 3 32B, visit allenai/dolma3_mix-6T-1025. Dolma 3 Pool The Dolma 3 pool is a...

Details

Task
Text Generation
Language
EN
Format
Parquet
Rows / instances
N/A
Creator
allenai
Year
2025
License
odc-by
Downloads
31288
Likes
36
Download Homepage

Related Text Generation datasets

FAQ