
Will Claude 3.5 Opus beat OpenAI's best released model on the arena.lmsys.org leaderboard?
Mini
50
แน5482resolved Aug 1
Resolved as
5%1D
1W
1M
ALL
OpenAI's best released model could be GPT-4, GPT-4o, or something else. It does not count as an OpenAI model unless it's made available to the public to try, and is known to be from OpenAI (e.g. the model can not be a secret, pseudonymous release). If arena.lmsys.org is not available at the time, the successor site or most similar leaderboard will be used.
Resolves yes if Claude 3.5 Opus is ranked above all OpenAI models 1 week after it is put on the leaderboard.
Update 2025-01-01 (PST) (AI summary of creator comment): - Models must be listed on lmarena to be counted.
Examples:
o1 pro does not count since it's not on the arena.
Regular o1 does count.
Get แน1,000 play money
๐ Top traders
# | Name | Total profit |
---|---|---|
1 | แน1,066 | |
2 | แน247 | |
3 | แน130 | |
4 | แน96 | |
5 | แน61 |
Sort by:
Related questions
Related questions
Will Claude 3.5 Opus have a higher Chat Arena Elo than GPT-5?
10% chance
Will Claude Opus be ranked in the top 20 on the Chatbot Arena Leaderboard two years from today (3/10/24)?
6% chance
Will Claude 4 achieve over 95% on the MMLU-Pro benchmark by end of 2025?
12% chance
Will Claude 3.5 Opus be available via API by end of 2025?
3% chance
Will the top model by OpenAI rank 3rd (or lower) behind 2 other model families at any point before 2026?
69% chance