Will we find polysemanticity via superposition in neurons in the brain before 2040?

Mini

Ṁ244

2040

64%

chance

ALL

Polysemantic neurons in a neural network fire on a wide conceptual range of inputs, or in other words do not correspond to a single semantic pattern.

A leading theory behind why Polysemantic artificial neurons form is superposition (https://arxiv.org/abs/2209.10652), roughly the theory that an overcomplete basis of concepts, represented by vectors, are packed into relatively low-dimensional representation space, at the cost of occasional interference effects.

Will we identify neurons which are polysemantic, in the sense that their firing pattern does not closely correspond to a human-interpretable cause or concept? For a YES resolution, we must find such neurons along with clear evidence that the primary cause of this polysemanticity is superposition.

One way to demonstrate superposition might involve showing that the neuron lies in a local cluster which corresponds to an artificial embedding space, and making accurate predictions about how interference will occur according to the theory of superposition.

The terms here are meant to be used loosely, in the sense that I will resolve YES in situations which do not exactly match this description as long as they fit the spirit of this market. For example, I’ll resolve YES if we observe polysemanticity not in neurons but among slightly larger circuits of neurons.

#Futurism

#Biology

#Mechanistic interpretability

#Neuroscience

#Polysemanticity

Get Ṁ1,000 play money

2 Comments

Sort by:

We already know of neurons which are polysemantic in the brain, in fact most of them are. Very rarely do we see a mostly monosemantic neuron. So, whether the cause of polysemanticity is superposition - that is the demonstration that question asks for.

Superposition hypothesis says that there are more features than dimensions available to represent them fully. That's it.

The toy models of superposition paper showed that while superposition might be a cause for polysemanticity, it is not the only one - indeed, they eliminate superposition to still find polysemanticity.

@firstuserhere Yes, at this point the most interesting question I am considering is what standard to use to determine whether superposition is a "primary cause." Likely I'll just adopt whatever standard 2040 treats as a sufficient metric for these sorts of questions.

Related questions

Related questions