What will be true of OpenAI's Orion (GPT-4.5) model?
➕
Plus
82
Ṁ9910
Feb 22
96%
The preparedness scorecard for the model will not be above Medium risk for any category
88%
It will score better on SWE-Bench Verified than Claude 3.5 Sonnet (October version)
83%
It will cost more than 4o on the API ($10/1M output tokens)
82%
It will be able to output audio without calling another model
72%
It will score better on GPQA than o1-preview (73% pass@1)
59%
It will be released before Claude 3.5 Opus or Claude 4
54%
Once it is available to the public, a Manifold poll asking if it is better or worse than expected will find that it is better than expected
34%
It will have a context window of >= 500K tokens
33%
It will cost more than 5x more than 4o ($50/1 million output tokens)
26%
It will be able to take video as input
25%
It will be able to output images without calling another model
15%
It will have a context window of >= 1 million tokens
1.0%
It will be called GPT-5

  • Update 2024-21-12 (PST): - If Orion is not planned for release, most options will be resolved as N/A (AI summary of creator comment)

  • Update 2025-02-13 (PST) (AI summary of creator comment): Resolution Update:

    • Release Confirmation: Orion is now confirmed to be released.

    • Resolution Timing: The market will resolve next month based on this confirmed release.

  • Update 2025-02-16 (PST) (AI summary of creator comment): Anthropic Model Naming Exception

    • If Anthropic releases its reasoning model before Orion, but it is not named Claude 3.5 Opus or Claude 4, the market will resolve YES.

Get Ṁ1,000 play money
Sort by:
It will be released before Claude 3.5 Opus or Claude 4
bought Ṁ168 It will be released ... NO

@SaviorofPlant How would this be resolved if Anthropic had released its reasoning model before Orion, but it would be not named Claude 4?

sold Ṁ20 It will be released ... YES

@JanPydych Would resolve YES. Don't really understand why it's trading so low, seems plausible the new Anthropic model will not be named either of those things?

@SaviorofPlant To be honest, I think that one of the options could be the release of a new checkpoint of Claude 3.5 Sonnet, but with the addition of the "reasoning_effort" parameter (or however Anthropic will name it).

It will be called GPT-5

@SaviorofPlant As I understand, according to the latest from Sam Altman, GPT-5 is planned to be a combination of 4.5 and o3, or something like that? Will probably N/A this option in that case.

bought Ṁ20 It will be able to t... YES

What happens if it is not released by Jan 1?

bought Ṁ40 It will score better... NO

@JoshYou I will extend the close date of this market until the release is announced.

If the release is delayed and it's unclear whether a released model is Orion, I'll wait for high quality reporting on whether or not a new model is Orion or not. If this never comes, every answer N/As (besides the "it will be released before X" options).

@SaviorofPlant Based on The Information articles, Orion is apparently not planned for release. Not sure how long I'll wait before N/Aing most of these

@SaviorofPlant Looks like it's being released after all, this market should resolve next month. Reopening for a week