SWE-bench
CodeEnglishBenchmark
SWE-bench is a code benchmark dataset in English from Princeton University (Jimenez et al.) with 2,294 records in JSON format.
📊 This dataset is used as an LLM benchmark. See model leaderboards →
Details
- Task
- Code
- Language
- English
- Format
- JSON
- Rows / instances
- 2,294
- Creator
- Princeton University (Jimenez et al.)
- Year
- 2023