Will Grok 4 achieve over 69% on SimpleBench
15
Ṁ1128
resolved Jul 18
Resolved
NO

Grok 4, xAI's latest AI model, has demonstrated significant improvements over its predecessors. SimpleBench is a benchmark designed to evaluate AI models on spatio-temporal reasoning, social intelligence, and linguistic adversarial robustness. As of July 2025, Grok 4 has not been publicly tested on SimpleBench. This market resolves to 'Yes' if Grok 4 achieves a score above 69% on SimpleBench. Verification will be based on official announcements from xAI or SimpleBench's leaderboard updates. SimpleBench Leaderboard

Get Ṁ1,000 play money

🏅 Top traders

#NameTotal profit
1Ṁ107
2Ṁ30
3Ṁ20
4Ṁ7
5Ṁ6
Sort by:
bought Ṁ350 NO

60.5% (non-heavy).

bought Ṁ10 YES

I dumped the 10 public questions from SimpleBench in JSON format without answers into Grok 4 in one message on lmarena, and I got 8/10 correct (got no. 6 and no. 10 incorrect) so that's something

SOTA is Gemini 2.5 Pro (06-05) at 62.4%, and new models have generally improved by 1-3 %points