What will be the first year that an AI gets gold on an International Mathematical Olympiad?
➕
Plus
51
Ṁ15k
2033
1.6%
2023
6%
2024
58%
2025
8%
2026
7%
2027
5%
2028
4%
2029
2%
2030
2%
2031
1.8%
2032
5%
Not by 2032

The market is meant to reflect the resolution criteria of /Austin/will-an-ai-get-gold-on-any-internat . I will defer to the creator of that market as well as the people who originally made the bet. Barring no further information, here is how I interpret the resolution criteria as they stand at market creation:

  • The AI code must be finalized before the release of the IMO it gets gold on.

  • The AI can be either fully NLP based or fully formal methods based or in between.

  • The AI getting gold must happen before the end of the year in question.

Only options present at market creation are valid.

Get Ṁ1,000 play money
Sort by:

Now that 2023 has come and gone with no IMO claims, I am putting a limit order on NO 2023 at 1% in case there were people whose loans have not vested and want to get out at that price.

In the spirit of sharing one's beliefs:

The 3% on 2023 seems too high to me: it requires that some relevant entity aimed for IMO gold *and* they succeeded (meaning a huge increase in AI math capabilities) *and* such a model has existed for over a month but it's not publicly known.

I think the combination of the first two is quite unlikely, even given that they are not independent. Taking the conjunction with the third statement puts the probability below 3% for me.

Regarding the other years: I think the main market (https://manifold.markets/Austin/will-an-ai-get-gold-on-any-internat), currently at 31%, is too high (and I've been betting NO on it). This market's 7%+17%=24% for 2024-2025 seems reasonable to me. For later years I have no strong opinions - long-term future forecasts are hard, doubly so in AI - and have not bet on them.

@Loppukilpailija I think the last condition of your 2023 decomposition is not necessarily necessary. If an LLM for which dataset collection finished in say May 2023, but training finishes in November 2023, achieves gold on this year's IMO (so on IMO whose problems were not in its training set), my impression is that that would probably count as gold in 2023 by judgment of Christiano/Yudkowsky.

Yudkowsky's quote from the bet post which I think supports this kind of pragmatism:
> I see they want to demand that the AI be open-sourced publicly before the first day of the IMO, which unfortunately sounds like the sort of foolish little real-world obstacle which can prevent a proposition like this from being judged true even where the technical capability exists.  I'll stand by a >16% probability of the technical capability existing by end of 2025

@Lovre The original blog post was sparked by the IMO grand challenge, which is a challenge to write an AI that produces machine-checkable proofs. The IMO grand challenge indeed requires the program to be open source.

However, Paul Christiano has clarified in a comment on the main market that he will concede the bet if the AI produces a natural language proof, no need for it to be machine-checked. Since he is being more liberal with the conditions, I'm almost sure that he will concede the bet even if the AI is not open source.

@Lovre I indeed did not think of the scenario "the code has already been written before the IMO, but the model is still being trained". I don't think that's particularly likely, but it makes me a bit less confident in that the year cannot be 2023.

relevant individual markets:
(note that all the "by"'s are inclusive, except maybe for the 2032 market)

https://manifold.markets/BoltonBailey/will-an-ai-get-gold-on-any-internat-2b53f4a252d8#wRwN6ej6uONGMXD9IjMN and https://manifold.markets/Austin/will-an-ai-get-gold-on-any-internat are currently at 9% and 39%, which means that 2025 should have 30% probability. Who wants to arbitrage with the current 17% on this market?

I made a version of this market specifically for the (formalized) IMO grand challenge:

I'm avoiding trading but note that the market linked in the description is at 38% now, while the probabilities here would imply 26%.

@BoltonBailey I've moved the timelines up for this reason. I think the Metaculus one is slightly unreliable as around 10% of the probability mass is in the past.