Will there be a well accepted formal definition for honesty in AI by 2027?
Mini
9
Ṁ2052027
21%
chance
1D
1W
1M
ALL
Doesn't need total acceptance
If there are multiple similar competing definitions (e.g. similar to the current situation with differential privacy) I'll resolve yes
Get Ṁ1,000 play money
Related questions
Related questions
By 2027 will there be a well-accepted training procedure(s) for making AI honest?
15% chance
AI honesty #2: by 2027 will we have a reasonable outer alignment procedure for training honest AI?
25% chance
Will there be a well accepted formal definition of value alignment for AI by 2030?
25% chance
AI honesty #3: by 2027 will we have interpretability tools for detecting when an AI is being deceptive?
48% chance
Will a major AI company acknowledge the possibility of conscious AIs by 2026?
72% chance
Will there be serious AI safety drama at Meta AI before 2026?
58% chance
Will it be effectively impossible to tell a human and a high quality AI apart on social media before 2026?
89% chance
AI honesty #1: by 2027 will we have AI that doesn't hallucinate random nonsense?
73% chance
AI honesty #4: by 2027, will we have AI that would tell us if it was planning on destroying us (conditional on that being true)?
22% chance
Will AI regulations that include mechanisms for uncovering AI deception be adopted in the U.S. before 2035?
82% chance