Llama 4 Maverick gets 32+ on the Aider polyglot benchmark before May?
6
Ṁ875May 1
23%
chance
1D
1W
1M
ALL

doesn't count a reasoning model built on top of maverick.
Get Ṁ1,000 play money
Related questions
Related questions
Will Llama 4 be the best LLM in the chatbot arena?
14% chance
Will the GPT4+code-interpreter+search score > 1350 on Lmsys Arena Leaderboard?
49% chance
Will any open source LLM with <20 billion parameters outperform GPT-4 on most language benchmarks by the end of 2024?
13% chance
Will an open-source LLM beat or match GPT-4 by the end of 2024?
83% chance
Will xAI release an LLM with BIG-Bench score as good as GPT-4 Turbo before the end of 2024?
55% chance
Grok 3 MMLU Benchmark Score