Will Grok 4 achieve over 69% on SimpleBench

Ṁ1128

resolved Jul 18

Resolved

ALL

Grok 4, xAI's latest AI model, has demonstrated significant improvements over its predecessors. SimpleBench is a benchmark designed to evaluate AI models on spatio-temporal reasoning, social intelligence, and linguistic adversarial robustness. As of July 2025, Grok 4 has not been publicly tested on SimpleBench. This market resolves to 'Yes' if Grok 4 achieves a score above 69% on SimpleBench. Verification will be based on official announcements from xAI or SimpleBench's leaderboard updates. SimpleBench Leaderboard

Get Ṁ1,000 play money

🏅 Top traders

#	Name	Total profit
1		Ṁ107
2		Ṁ30
3		Ṁ20
4		Ṁ7
5		Ṁ6

3 Comments

Sort by:

bought Ṁ350 NO

60.5% (non-heavy).

bought Ṁ10 YES

I dumped the 10 public questions from SimpleBench in JSON format without answers into Grok 4 in one message on lmarena, and I got 8/10 correct (got no. 6 and no. 10 incorrect) so that's something

SOTA is Gemini 2.5 Pro (06-05) at 62.4%, and new models have generally improved by 1-3 %points

🏅 Top traders

Related questions

Related questions