[Prediction] We are in an Algorithmic Overhang

[-]gjm4y270

The linked post by Steven Byrnes uses the term "hardware overhang" to describe the situation where we have hardware much more powerful than would be needed but algorithms aren't good enough. Your post here uses the term "algorithmic overhang". One could justify either terminology, but clearly not both. (I think SB's is better. Maybe something more explicit like "hardware-ahead-of-algorithms overhang" would be better than either, since evidently different people have different intuitions about which of two opposite situations an "X overhang" describes.)

[-]lsusr4y30

The linked post by Steven Byrnes uses the term "hardware overhang" to describe the situation where we have hardware much more powerful than would be needed but algorithms aren't good enough. Your post here uses the term "algorithmic overhang". One could justify either terminology, but clearly not both.

Thank you for pointing this out. I think we are in a hardware-ahead-of-algorithms overhang.

[-]p.b.4y20

Maybe worth pointing out that "hardware overhang" is a pretty old (>10years) and well known term that afaik was not coined by Steven Byrnes. So your title must be confusing to quite a lot of people.

[-]gjm4y80

It is not clear whether, for each of your points 1-5, you are saying

"This is definitely true and I have conclusive evidence for it"
"I am confident that this is true but I don't have evidence that would reliably convince others"
"I think this is probably true"
"This might be true, which given the potential consequences is already enough to be interesting"
something else.

(I'm guessing probably "I think this is probably true"?)

[-]lsusr4y20

I'm not sure. I think it's at least 10%.

I don't really think in terms of beliefs. I think in terms of bets. My confidence is high enough to stake some reputation on a public prediction.

[-]Pattern4y20

The core learning algorithm [of] human beings could be written

[-]lsusr4y20

Fixed. Thanks.

[-]ShardPhoenix4y20

Can you expand on #3? Do you mean that this is happening now, that it could happen now (ie the data is already out there and just needs to be interpreted), or that it might happen in the near future (eg if brain science gets a bit better)? How sure are you of this?

[-]lsusr4y20

Can you expand on #3? Do you mean that this is happening now, that it could happen now (ie the data is already out there and just needs to be interpreted), or that it might happen in the near future (eg if brain science gets a bit better)?

Might have happened already. Probably hasn't yet. Einstein's papers were an anomaly. I wouldn't be surprised some of the insights can be found here and there in the haystack but they haven't been completed yet.

I don't think the timeline has much to do with practical brain science. Math tends to advance faster than biology. Brain science just puts an upper bound on how long it'll take to get there. In the slowest possible world, we crack the basic biological cortical algorithm by observing brains with high fidelity in real time.

How sure are you of this?

At least 10%.

[-]Bernhard4y10

First of all, kudos to you for making this public prediction.

To keep this brief: 1 (95%), 2 (60%), 3 (75%), 4(<<5%), 5 (<<1%)

I don't think we are in a hardware overhang, and my argument is the following:

Our brains are composed of ~10^11 neurons, and our computers of just as many transistors, so in a first approximation, we should already be there.

However, our brains have approximately 10^3 to 10^5 synapses per cell, while transistors are much more limited (I would guess maybe 10 on average?).

Even assuming that 1 transistor is "worth" one neuron, we come up short.

I remember learning that a perceptron with a single hidden layer of arbitrary width can approximate any function, and thereby any perceptron with finite width, but with more hidden layer. (I think this is called the "universal approximaten theorem"?)

After reading your post, I kept trying to find some numbers of how many neurons are equivalent to an additional layer, but came up empty.

I think the problem is basically that each additional layer contributes superlinearly to "complexity" (however you care to measure that). Please correct me if I'm wrong, I would say this point is my crux. If we are indeed in a territory where we have available transistor counts comparable to a "single-hidden-layer-perceptron-brain-equivalent", then I would have to revise my opinion.

I'm personally very interested in this highly parallel brain architecture, and if I could, I would work on ways to investigate/build/invent ways to create similar structures. However, besides self-assembly (as in living growing things), I don't yet see how we could build things of a similar complexity in a controlled way.

[-]Bernhard4y30

Just for completeness, I found [this paper](http://dx.doi.org/10.1016/j.neuron.2021.07.002), where they try to simulate the output of a specific type of neuron, and for best results require a DNN of 5-8 layers (with widths of ~128)

[-]lsusr4y30

I think the problem is basically that each additional layer contributes superlinearly to "complexity" (however you care to measure that). Please correct me if I'm wrong, I would say this point is my crux.

Do you mean that each additional layer contributed too much to hypothesis space entropy?

[-]Bernhard4y10

I guess that would be one way to frame it. I think a simpler way to think of it (Or a way that my simpler mind thinks of it) is that for a given number of parameters (neurons), more complex wiring allows for more complex results. The "state-space" is larger if you will.

3+2, 3x2 and 3² are simply not the same.

From my limited knowledge (undergraduate-level CS knowledge), I seem to remember, that typical deep neural networks use a rather small number of hidden layers (maybe 10? certainly less than 100?? (please correct me if I am wrong)). I think this choice is rationalized with "This already does everything we need, and requires less compute"

To me this somewhat resembles a Chesterton's fence (Or rather its inverse). If we were to use neural nets of sufficient depths (>10e3), then we may encounter new things, but before we get there, we will certainly realize that we still have a ways to go in terms of raw compute.

[-][anonymous]4y10

I agree, although I doubt the brain algorithm will end up at the core of AGI.

LESSWRONG
LW

LESSWRONG
LW

31

[Prediction] We are in an Algorithmic Overhang

31

31