Resolves yes after GPT-5 is first benchmarked on IMO-2025. OpenAI's own reporting counts. Also resolves yes if the model achieves Silver or Gold.
Update 2025-07-25 (PST) (AI summary of creator comment): The creator has clarified the conditions under which the model's performance will be evaluated:
No scaffolds are permitted.
The model must be prompted with the questions exactly as written.
The model must not have access to tools or the internet.
🏅 Top traders
# | Name | Total profit |
---|---|---|
1 | Ṁ206 | |
2 | Ṁ28 | |
3 | Ṁ16 | |
4 | Ṁ15 | |
5 | Ṁ11 |
Seems largely dependent on whether this market permits custom scaffolds by external researchers. If Gemini 2.5 Pro could win Gold with custom elicitation, then GPT-5 could likely get at least Bronze. https://arxiv.org/abs/2507.15855
@bh For this market I'll say no scaffolds. Model must simply be prompted with the questions exactly as written with no tools are internet access