Will mechanistic/transformer interpretability [eg Neel Nanda] end up affecting p(doom) more than 5%? | Manifold

Will mechanistic/transformer interpretability [eg Neel Nanda] end up affecting p(doom) more than 5%?

Mini

4

Ṁ190

2223

10%

chance

1D

1W

1M

ALL

Get Ṁ1,000 play money

Related questions

What will Manifold's P(doom) be at the end of 2025?

-10% 1d20% chance

Will mechanistic interpretability have more academic impact than representation engineering by the end of 2025?

By 2035, will mechanistic interpretability enable Nobel Prize-winning work?

Will agent foundations [eg Scott Garrabrant] end up affecting p(doom) more than 5%?

Will mechanistic interpretability be essentially solved for GPT-4 before 2030?

What is your P(doom) right now? (used to resolve end of 2025 question)

Are Mixture of Expert (MoE) transformer models generally more human interpretable than dense transformers?

Will manifold markets meaningfully affect p(doom) by more than 3%?

Will davidad meaningfully affect p(doom) by more than 3%?

Will MIRI meaningfully affect p(doom) by more than 5%?

Related questions

What will Manifold's P(doom) be at the end of 2025?

What is your P(doom) right now? (used to resolve end of 2025 question)

Will mechanistic interpretability have more academic impact than representation engineering by the end of 2025?

Are Mixture of Expert (MoE) transformer models generally more human interpretable than dense transformers?

By 2035, will mechanistic interpretability enable Nobel Prize-winning work?

Will manifold markets meaningfully affect p(doom) by more than 3%?

Will agent foundations [eg Scott Garrabrant] end up affecting p(doom) more than 5%?

Will davidad meaningfully affect p(doom) by more than 3%?

Will mechanistic interpretability be essentially solved for GPT-4 before 2030?

Will MIRI meaningfully affect p(doom) by more than 5%?