allenai/dolma3_dolmino_pool
Text GenerationENodc-by
Created by allenai at 2025, the allenai/dolma3_dolmino_pool is a text generation dataset in EN in Parquet format. With 17.6K downloads and 8 likes, it is actively used by the community. It is released under the odc-by license.
About allenai/dolma3_dolmino_pool
⚠️ IMPORTANT NOTICE ⚠️
This is the Dolma 3 Dolmino pool; it hasn't been mixed.
If you are interested in the data used to train:
Olmo 3 7B: allenai/dolma3_dolmino_mix-100B-1025
Olmo 3 32B: allenai/dolma3_dolmino_mix-100B-1125
Dolma...
Details
- Task
- Text Generation
- Language
- EN
- Format
- Parquet
- Rows / instances
- N/A
- Creator
- allenai
- Year
- 2025
- License
- odc-by
- Downloads
- 17649
- Likes
- 8