SWE-bench introduced in: Jimenez et al. 2024 — software engineering benchmark from GitHub issues.
Object
Jimenez et al. 2024 — software engineering benchmark from GitHub issues
Primary source · preprint · 2023-10-10
SWE-bench: Can Language Models Resolve Real-World GitHub Issues? — arXiv (Jimenez, Yang, Wettig, Yao, Pei, Press, Narasimhan / Princeton + Chicago)