Will OpenAI Release a Model Capable of Reliably performing Gradeschool Math from Reasoning by Jan 1, 2025? | Manifold

Will OpenAI Release a Model Capable of Reliably performing Gradeschool Math from Reasoning by Jan 1, 2025?

Plus

36

Ṁ1409

Jan 1

79%

chance

1D

1W

1M

ALL

I.e. will a "Q*" model be released?

GPT-5 Capabilities

#️ Technology

#Technical AI Timelines

Get Ṁ1,000 play money

Sort by:

How do you define "gradeschool math"? Does it include gradeschool Geometry?

How do you measure "reliably"?

predicts YES

@0482 sentiment

@Luca3f84 fair, but given this subjective criteria I will challenge any YES resolution is the success rate is below 90%, there are obvious consistent blind spots, or Manifold poll will show majority says it is not reliable

predicts YES

@0482 completely fair. I would do the same. Fwiw, the way the question is worded, it excludes the use of code interpreter, mere interpretation from vision etc.

Related questions

Will an AI achieve >85% performance on the FrontierMath benchmark before 2028?

Will Open AI release a model that can reliably compute a 20 digits multiplication correctly in 2025?

Will any AI model score >80% on Epoch's Frontier Math Benchmark in 2025?

Will OpenAI claim that it has achieved AGI in 2025?

Will OpenAI offer a model that updates its weights while running during 2025?

Will any AI model achieve > 40% on Frontier Math before 2026?

Will OpenAI release a model which generates images using reasoning / inference-time scaling before 2026?

Will OpenAI models achieve ≥90% on SimpleBench by the end of 2025?

Will OpenAI fold in 2025?

Will OpenAI announce a new model that EpochAI estimates is at least as large as GPT-4.5, before August 2026?

Related questions

Will an AI achieve >85% performance on the FrontierMath benchmark before 2028?

Will any AI model achieve > 40% on Frontier Math before 2026?

Will Open AI release a model that can reliably compute a 20 digits multiplication correctly in 2025?

Will OpenAI release a model which generates images using reasoning / inference-time scaling before 2026?

Will any AI model score >80% on Epoch's Frontier Math Benchmark in 2025?

Will OpenAI models achieve ≥90% on SimpleBench by the end of 2025?

Will OpenAI claim that it has achieved AGI in 2025?

Will OpenAI fold in 2025?

Will OpenAI offer a model that updates its weights while running during 2025?

Will OpenAI announce a new model that EpochAI estimates is at least as large as GPT-4.5, before August 2026?