Will OpenAI's next-generation model score 65% or higher on the GPQA benchmark?
Plus
13
Ṁ803resolved Sep 16
Resolved
YES1D
1W
1M
ALL
Resolve to YES if OpenAI's next generation language model scores 65% or higher on the GPQA benchmark(extended set).
If OpenAI's existing model gets 65% or higher by post-training enhancements, that also counts.
There's room for improvement via prompt engineering after the release, but I don't know how long I should wait, so I will resolve this question as soon as OpenAI releases their next model.
Get Ṁ1,000 play money
🏅 Top traders
# | Name | Total profit |
---|---|---|
1 | Ṁ168 | |
2 | Ṁ68 | |
3 | Ṁ23 | |
4 | Ṁ22 | |
5 | Ṁ11 |
Related questions
Related questions
Will OpenAI's next major LLM (after GPT-4) surpass 70% accuracy on the GPQA benchmark?
98% chance
Will OpenAI's next major LLM (after GPT-4) surpass 74% accuracy on the GPQA benchmark?
97% chance
Will OpenAI's next major LLM (after GPT-4) achieve over 50% resolution rate on the SWE-bench benchmark?
99% chance
In what year will AI achieve a score of 95% or higher on the GPQA benchmark?
-
Will OpenAI models achieve ≥90% on SimpleBench by the end of 2025?
36% chance
Will any AI model score above 95% on GRAB by the end of 2025?
40% chance
Will any AI model score >80% on Epoch's Frontier Math Benchmark in 2025?
6% chance
Open-Source AI model gets perfect IMO 2026 score? [International Math Olympiad 2026]
31% chance
Will the gap between open-weights and frontier models on GPQA Diamond be at most 7%?
49% chance
Will OpenAI announce a new model that EpochAI estimates is at least as large as GPT-4.5, in 2025?
34% chance