Skip to content

AI-MO/NuminaMath-LEAN

General NLPEnglishapache-2.0

Created by AI-MO at 2025, the AI-MO/NuminaMath-LEAN is a General NLP dataset in English containing 104,155 records in Parquet format. With 327 downloads and 59 likes, it is actively used by the community. It is released under the apache-2.0 license and is a 100K<n<1M-scale dataset.

About AI-MO/NuminaMath-LEAN

Dataset Card for NuminaMath-LEAN Dataset Summary NuminaMath-LEAN is a large-scale dataset of 100K mathematical competition problems formalized in Lean 4. It is derived from a challenging subset of the NuminaMath 1.5 dataset, focusin...

Details

Task
General NLP
Language
English
Format
Parquet
Rows / instances
104155
Size
100K<n<1M
Creator
AI-MO
Year
2025
License
apache-2.0
Downloads
327
Likes
59
Download Homepage

Related General NLP datasets

FAQ