Question 1

What is the ScaleAI/SWE-bench_Pro dataset?

Accepted Answer

Dataset Summary

SWE-Bench Pro is a challenging, enterprise-level dataset for testing agent ability on long-horizon software engineering tasks.
Paper: https://static.scale.com/uploads/654197dc94d34f66c0f5184e/SWEAP_Eval_Scale%20(9).pdf
See the r...

Question 2

Is ScaleAI/SWE-bench_Pro a benchmark?

Accepted Answer

Yes — ScaleAI/SWE-bench_Pro is used as an LLM benchmark. See model leaderboards in the Benchmarks section.

Question 3

Where can I download ScaleAI/SWE-bench_Pro?

Accepted Answer

ScaleAI/SWE-bench_Pro is available at its source: https://huggingface.co/datasets/ScaleAI/SWE-bench_Pro.

ScaleAI/SWE-bench_Pro

About ScaleAI/SWE-bench_Pro

Details

Related General NLP datasets

FAQ