Gemini 3's execution time-horizon?
20
Ṁ1696
Dec 31

Invalid contract

On the task described in https://arxiv.org/abs/2509.09677, what will be the length of tasks that Gemini 3 will be able to complete in one go?

I'm an author, and I will run the same setup above^ to resolve this.

Currently:
GPT-5 Thinking is 1024
Claude 4 Sonnet is 432
Grok-4 is 384

Get Ṁ1,000 play money
Sort by:

added some liquidity

@Bayesian thanks!