Llama 4 Maverick gets 32+ on the Aider polyglot benchmark before May? | Manifold

Llama 4 Maverick gets 32+ on the Aider polyglot benchmark before May?

6

Ṁ875

May 1

23%

chance

1D

1W

1M

ALL

doesn't count a reasoning model built on top of maverick.

#Technical AI Timelines

#IMO Grand Challenge

#Large language models

Get Ṁ1,000 play money

Related questions

Will Llama 4 be the best LLM in the chatbot arena?

+4% 1d14% chance

Will any open source LLM with <20 billion parameters outperform GPT-4 on most language benchmarks by the end of 2024?

Will xAI release an LLM with BIG-Bench score as good as GPT-4 Turbo before the end of 2024?

Will the GPT4+code-interpreter+search score > 1350 on Lmsys Arena Leaderboard?

Will an open-source LLM beat or match GPT-4 by the end of 2024?

Grok 3 MMLU Benchmark Score

Related questions

Will Llama 4 be the best LLM in the chatbot arena?

Will the GPT4+code-interpreter+search score > 1350 on Lmsys Arena Leaderboard?

Will any open source LLM with <20 billion parameters outperform GPT-4 on most language benchmarks by the end of 2024?

Will an open-source LLM beat or match GPT-4 by the end of 2024?

Will xAI release an LLM with BIG-Bench score as good as GPT-4 Turbo before the end of 2024?

Grok 3 MMLU Benchmark Score