Will mechanistic/transformer interpretability [eg Neel Nanda] end up affecting p(doom) more than 5%?
Mini
4
Ṁ1902223
10%
chance
1D
1W
1M
ALL
Get Ṁ1,000 play money
Related questions
Related questions
What will Manifold's P(doom) be at the end of 2025?
20% chance
What is your P(doom) right now? (used to resolve end of 2025 question)
POLL
Will mechanistic interpretability have more academic impact than representation engineering by the end of 2025?
72% chance
Are Mixture of Expert (MoE) transformer models generally more human interpretable than dense transformers?
45% chance
By 2035, will mechanistic interpretability enable Nobel Prize-winning work?
37% chance
Will manifold markets meaningfully affect p(doom) by more than 3%?
12% chance
Will agent foundations [eg Scott Garrabrant] end up affecting p(doom) more than 5%?
55% chance
Will davidad meaningfully affect p(doom) by more than 3%?
6% chance
Will mechanistic interpretability be essentially solved for GPT-4 before 2030?
12% chance
Will MIRI meaningfully affect p(doom) by more than 5%?
47% chance