Will there be a well accepted formal definition for honesty in AI by 2027? | Manifold

Will there be a well accepted formal definition for honesty in AI by 2027?

Mini

10

Ṁ209

2027

23%

chance

1D

1W

1M

ALL

Doesn't need total acceptance
If there are multiple similar competing definitions (e.g. similar to the current situation with differential privacy) I'll resolve yes

#Technical AI Timelines

#Technical AI Safety

Get Ṁ1,000 play money

Related questions

By 2027 will there be a well-accepted training procedure(s) for making AI honest?

Will there be a well accepted formal definition of value alignment for AI by 2030?

Will there be serious AI safety drama at Meta AI before 2026?

Will it be effectively impossible to tell a human and a high quality AI apart on social media before 2026?

AI honesty #4: by 2027, will we have AI that would tell us if it was planning on destroying us (conditional on that being true)?

AI honesty #2: by 2027 will we have a reasonable outer alignment procedure for training honest AI?

AI honesty #3: by 2027 will we have interpretability tools for detecting when an AI is being deceptive?

xAI builds truth-seeking AI before 2027?

AI honesty #1: by 2027 will we have AI that doesn't hallucinate random nonsense?

Will AI regulations that include mechanisms for uncovering AI deception be adopted in the U.S. before 2035?

Related questions

By 2027 will there be a well-accepted training procedure(s) for making AI honest?

AI honesty #2: by 2027 will we have a reasonable outer alignment procedure for training honest AI?

Will there be a well accepted formal definition of value alignment for AI by 2030?

AI honesty #3: by 2027 will we have interpretability tools for detecting when an AI is being deceptive?

Will there be serious AI safety drama at Meta AI before 2026?

xAI builds truth-seeking AI before 2027?

Will it be effectively impossible to tell a human and a high quality AI apart on social media before 2026?

AI honesty #1: by 2027 will we have AI that doesn't hallucinate random nonsense?

AI honesty #4: by 2027, will we have AI that would tell us if it was planning on destroying us (conditional on that being true)?

Will AI regulations that include mechanisms for uncovering AI deception be adopted in the U.S. before 2035?