Skip to content

SWE-bench

CodeEnglishBenchmark

SWE-bench is a code benchmark dataset in English from Princeton University (Jimenez et al.) with 2,294 records in JSON format.

📊 This dataset is used as an LLM benchmark. See model leaderboards →

Details

Task
Code
Language
English
Format
JSON
Rows / instances
2,294
Creator
Princeton University (Jimenez et al.)
Year
2023
Download Paper

Related Code datasets

FAQ