Will OpenAI Release a Model Capable of Reliably performing Gradeschool Math from Reasoning by Jan 1, 2025?
➕
Plus
36
Ṁ1309
Jan 1
77%
chance

I.e. will a "Q*" model be released?

Get Ṁ1,000 play money
Sort by:

How do you define "gradeschool math"? Does it include gradeschool Geometry?

How do you measure "reliably"?

predicts YES

@0482 sentiment

@Luca3f84 fair, but given this subjective criteria I will challenge any YES resolution is the success rate is below 90%, there are obvious consistent blind spots, or Manifold poll will show majority says it is not reliable

predicts YES

@0482 completely fair. I would do the same. Fwiw, the way the question is worded, it excludes the use of code interpreter, mere interpretation from vision etc.