Which company has best AI model end of March? (Chatbot Arena Leaderboard)
Which company has best AI model end of March? (Chatbot Arena Leaderboard)
➕
Plus
197
Ṁ740k
resolved Apr 1
100%99.0%
Google
1.0%
OpenAI
0.0%
xAI
0.0%
Anthropic
0.0%
Meta
0.0%
Alibaba
0.0%
DeepSeek
0.0%Other

Resolution is based on the chatbot arena LLM leaderboard (https://lmarena.ai), specifically the company with the highest Arena Score in the Overall category without filters (without style control or show deprecated), at the end of March 31, 2025 11:59PM ET.

See also:

Get Ṁ1,000 play money

🏅 Top traders

#NameTotal profit
1Ṁ7,221
2Ṁ2,331
3Ṁ1,792
4Ṁ954
5Ṁ688


Sort by:
17d

fucking brutal

17d

@MalachiteEagle eh it's cope. There are two labs above OA on lmarena. Sam's days are numbered.

17d

@jim haha guess we'll see. I think OpenAI has some internal results with o4 (or whatever they're calling it) that are quite far beyond anything Google has.

17d

17d

@jim oh and the image stuff is just with gpt-4o, will be interesting to see how fast they ship gpt-4.5 image capabilities

17d

@MalachiteEagle is it confirmed that 4.5 can generate images natively? i thought 4o was unique in its multimodality?

17d

@SaviorofPlant I haven't seen them explicitly confirm it, but I think the likelihood is very high, especially since it was originally intended to be their gpt-5 model

17d

@MalachiteEagle i think 4o was trained more recently than 4.5, which has an october 2023 knowledge cutoff. seems likely to me that 4o was an experimental model testing this type of multimodality and previous models did not have it

17d

@SaviorofPlant they have a fine-tuning/continual-learning technique allowing them to update the knowledge cutoff from a given base model

17d

@SaviorofPlant they got multimodal working with gpt-4o, released it (without the multimodal output) and then started the work / training run for gpt-4.5. This produced lower-quality results than they hoped, so they went in circles for a few months until the episode with Q* happened

17d

@MalachiteEagle oh hang on, that's not right. They released gpt-4o in first half of 2024. Q* shenanigans happened in late 2023

17d

@SaviorofPlant but point being that I'm pretty sure gpt-4.5 training run was a scaled up version of what they got working for gpt-4o

boughtṀ550Google YES

@Bayesian Nice snag. Now go beat Hillary in Masters. 😆

bought Ṁ15 Google NO18d

@Quroe i might make more from losing we'll see

18d

nvm not really significant

answered1mo
Meta
bought Ṁ11,075 Meta NO20d

@jim you’re buying this at 2% when it’s available at 0.1c on polymarket

20d

in other words: not gonna happen

bought Ṁ500 Alibaba YES1mo
bought Ṁ100 Alibaba NO1mo

@Frankas impressive but doesn't seem likely to lead to a rank 1 placing

@jim their "max" model is supposedly coming soon

https://x.com/Alibaba_Qwen/status/1894130603513319842

opened a Ṁ3,000 OpenAI YES at 50% order1mo
1mo

Wouldn't they already be testing these models anonymously on lmsys arena if they were to take the lead there by the end of the month?

1mo

I take that to mean there's a new anonymous model on lmarena 🤔

Haven't seen anyone mention it on twitter yet though

1mo

@MalachiteEagle Not necessarily no, there’s 40 days left

1mo

Oh sorry I misread that as "before March".

Here's a question in case one of these appears tomorrow: https://manifold.markets/MalachiteEagle/will-there-be-a-new-and-smart-anony?play=true

What is this?

What is Manifold?
Manifold is a social prediction market with real-time odds on wide ranging news such as politics, tech, sports and more!
Participate for free in sweepstakes markets to win sweepcash which can be withdrawn for real money!
Are our predictions accurate?
Yes! Manifold is very well calibrated, with forecasts on average within 4 percentage points of the true probability. Our probabilities are created by users buying and selling shares of a market.
In the 2022 US midterm elections, we outperformed all other prediction market platforms and were in line with FiveThirtyEight’s performance. Many people who don't like trading still use Manifold to get reliable news.
Why should I trade?
Trading contributes to accurate answers of important, real-world questions and helps you stay more accountable as you make predictions.
Trade with S Sweepcash (𝕊) for a chance to win withdrawable cash prizes.
Get started for free! No credit card required.
What are sweepstakes markets?
There are two types of markets on Manifold: play money and sweepstakes.
By default all markets are play money and use mana. These markets allow you to win more mana but do not award any prizes which can be cashed out.
Selected markets will have a sweepstakes toggle. These require sweepcash to participate and allow winners to withdraw any sweepcash won to real money.
As play money and sweepstakes markets are independent of each other, they may have different odds even though they share the same question and comments.
Learn more.