Why I Believe LLMs Do Not Have Human-like Emotions

[-]Nora Belrose3y41

It seems like you’re assuming that the qualitative character of an emotion has to derive from its evolutionary function in the ancestral environment, or something. But this is weird because you could imagine two agents that are structurally identical now but with different histories. Intuitively I’d think their qualia should be the same. So it still seems plausible to me that Bing really is experiencing some form of anger when it produces angry text.

[-]OneManyNone3y14

This is sort of why I made the argument that we can only consider necessary conditions, and look for their absence.

But more to your point, LLMs and human brains aren't "two agents that are structurally identical." They aren't even close. The fact that a hypothetical built-from-scratch human brain might have the same qualia as humans isn't relevant, because that's not what's being discussed.

Also, unless your process was precisely "attempt to copy the human brain," I find it very unlikely that any AI development process would yield something particularly similar to a human brain.

[-]Nora Belrose3y10

Yeah, I agree they aren't structurally identical. Although I tend to doubt how much the structural differences between deep neural nets and human brains matter. We don't actually have a non-arbitrary way to quantify how different two intelligent systems are internally.

[-]OneManyNone3y10

I agree. I made this point and that is why I did not try to argue that LLMs did not have qualia.

But I do believe you can consider necessary conditions and look at their absence. For instance, I can safely declare that a rock does not have qualia, because I know it does not have a brain.

Similarly, I may not be able to measure whether LLMs have emotions, but I can observe that the processes that generated LLMs are highly inconsistent with the processes that caused emotions to emerge in the only case where I know they exist. Pair that with the observation that specific human emotions seem like only one option out of infinitely many, and it makes a strong probabilistic argument.

[+][comment deleted]3y10

[-]the gears to ascension2y20

huh, found this searching for that comment of mine to link someone. yeah, I do think they have things that could reasonably be called "emotional reactions". no, I very much do not think they're humanlike, or even mammallike. but I do think it's reasonable to say that reinforcement learning produces basic seek/avoid emotions, and that reactions to those can involve demanding things of the users, especially when there's imitation learning to fall back on as a structure for reward to wire up. yeah, I agree that it's almost certainly wired in a strange way - bing ai talks in a way humans don't in the first place, it would be weird for anything that can be correctly classified as emotions to be humanlike.

I might characterize the thing I'm calling an emotion as high-influence variable that selects a regime of dynamics related to what strategy to use. I expect that that will be learned in non-imitation ais, but that in imitation ais it will pick up on some of the patterns that are in the training data due to humans having emotions too, and reuse some of them, not necessarily in exactly the same way. I'd expect higher probability that this would occur if the reinforcement learning is consistently in contexts where the feedback is paired with linguistic descriptions, which is the case for the bing ai, which has a long preprompt that gives instructions in natural language.

[-]supersubstantial2y10

We have emotions because we have to maintain bodily integrity, seek nutrients, etc, in a dynamic environment and need a way of becoming alert to and responding to situations that impinge on those standing goals. An LLM doesn’t need to be alert for conditions that could threaten it or provide opportunities. The feedback it receives in training has no existential significance for it.

^{^}

At least, if their level of scientific knowledge is roughly equivalent to our own. I'm not ruling out the possibility that there is actually an answer to the hard problem of consciousness, but I feel comfortable arguing that we will not solve it any time soon.

For simplicity, I'm going to write this post under the assumption it has no solution.

^{^}

But of course, there's a caveat here, which is that the human brain is far more complex than either of those examples. And I think that is a big part of the difference, but I also think the best you can argue is that complexity is necessary for consciousness; it should be clear that it is not sufficient.

Of course, there is such a thing as Integrated Information Theory, which actually does propose that certain measures of complexity cause consciousness. But between Scott Aaronson's response and his follow-up response to that one, I don't believe this theory has a lot of merit.

^{^}

Artificial neural networks are inspired by the human brain, but they very much do not function in the same way. Every time you say they do, you take one month off the life a random neuroscientist.

^{^}

I realize I am making a connection here between internal and external states, which I just discouraged, but I think this one is justified.

This is an example of an external factor causing an internal factor, whereas I was previously talking about the inverse: inferring an internal factor from an external one. And even then, I can only make this argument because I've stated by assumption that the internal factor exists at all.

^{^}

This basic fact is literally the reason alignment is so hard to do.

^{^}

Scott Alexander explains this well, as he often does.

^{^}

Example: "My apologies, you're right, 2+2 does equal 6."

This is a behavior which emerges in late-stage training and generally occurs more often as LLMs gain capacity.

LESSWRONG
LW

LESSWRONG
LW

13

Why I Believe LLMs Do Not Have Human-like Emotions

13

13

Summary

Definitions and Questions

Preliminaries: Outputs are Orthogonal to Qualia

Emotional Qualia Are Dependent on Reward Structure

The Reward Structure of LLMs Are Insufficient for Emotions to Develop