Skip to content

google-research-datasets/mbpp

General NLPENBenchmarkcc-by-4.0

The google-research-datasets/mbpp dataset is a EN General NLP resource from google-research-datasets at 2026 comprising 1,401 examples. With 161.7K downloads and 231 likes, it is actively used by the community. It is released under the cc-by-4.0 license and is a 1K<n<10K-scale dataset.

📊 This dataset is used as an LLM benchmark. See model leaderboards →

About google-research-datasets/mbpp

Dataset Card for Mostly Basic Python Problems (mbpp) Dataset Summary The benchmark consists of around 1,000 crowd-sourced Python programming problems, designed to be solvable by entry level programmers, covering programming f...

Details

Task
General NLP
Language
EN
Format
Parquet
Rows / instances
1401
Size
1K<n<10K
Creator
google-research-datasets
Year
2026
License
cc-by-4.0
Downloads
161740
Likes
231
Download Homepage

Related General NLP datasets

FAQ