
What will be the highest score achieved on SWE-Bench Verified in 2025?
Plus
13
Ṁ12002026
1D
1W
1M
ALL
9%
<70
31%
70-85 inclusive
60%
>85
https://openai.com/index/introducing-swe-bench-verified/
https://www.swebench.com/
Highest performance reported before 2026. Any run on https://www.swebench.com/ counts. Large AI company reported numbers count whether or not they're listed on swebench.com Other claimed scores will generally not be counted unless verified by a third party.
Get Ṁ1,000 play money
Related questions
Related questions
What will be the best normalized score achieved on the original 7 RE-Bench tasks by December 31st 2025?
What will be the best performance on SWE-bench Verified by December 31st 2025?
When will SWE-bench be solved?
AI resolves at least X% on SWE-bench WITH assistance, by 2028?
AI resolves at least X% on SWE-bench without any assistance, by 2028?
Codebuff solves at least 40% of issues on SWE-Bench by March 31, 2025
17% chance
What will be the best score on Cybench by December 31st 2025?
BIG-bench accuracy 75% #3: Will SOTA for a single model on BIG-bench pass 75% by the start of 2026?
86% chance
Will an autonomous agent resolve 90% of tasks on SWE-bench by 2026?
63% chance
What will be true of the SOTA AI on the FrontierMath benchmark, before 2026?