[Valence series] 1. Introduction

[-]Seth Herd1y105Review for 2023 Review

This series explains why we like some things and not others, including ideas. It's cutting edge psychological theory.

To take the by now stereotypical action for me, here's a connection to Buddhism. There are a few passages in the Buddhist Canon where someone comes to the Buddha and asks for a really simple practice that will nonetheless take them far. One of the more interesting answers is that continuous 'mindfulness of vedana' will get you there. Vedana corresponds to the concept of valence in that it is posited as the positive, negative, or neutral quality of mental objects that appears to untrained perception as already bundled together with those mental objects.

[-]Morpheus1y50Review for 2023 Review

I found Steven Byrnes valence concept really useful for my own thinking about psychology more broadly and concretely when reading text messages from my contextualizing friend (in that when a message was ambiguous, guessing the correct interpretation based on valence worked surprisingly well for me).

[-]Seth Herd2y50

I'm excited to have this written up so clearly, nice work! I think this is important for alignment work in two ways: discourse and thinking about alignment is affected by powerful cognitive biases that this hypothesis explains; and, as you point out, we might build AGI that works like this, since it's so effective for human cognition.

I'm very curious if this "rings true" to other readers based on their introspection and observation of others' thinking patterns. I think this is both true and important. I'd arrived at this conclusion over the course of a research career studying dopamine and higher cognition. When we started researching cognitive biases, this came together, and I think this ubiquitous valence effect is the source of the most important cognitive biases. This goes by the names motivated reasoning, confirmation bias, and the halo effect; they have overlapping behavioral definitions. I think they're the major stumbling block to humans behaving rationally.

I think this hypothesis is consistent with a vast array of empirical work on dopamine function and related cognitive function. But the evidence isn't adequate to firmly establish that dopamine signals valence. That's part of why I'd never written this up adequately, and because hypotheses this broad are outside of the scope of standard neuroscience funding.

I'm looking forward to the rest of the series, and hoping the posts addressing cognitive biases generate some discussion about how those biases affect alignment discussions. I think the combination of motivated reasoning, confirmation bias, and the halo/horns effects create powerful polarization that's a big obstacle to rational discussions of alignment

[-]Gunnar_Zarncke2y40

Great post! And I was wondering what you meant by valence, but now it is clear.

I hope to write a longer comment later, but here is short question:

Is there some neurobiological evidence of the valence channel also going into the cortex, roughly?

[-]S Benfield2y30

I claim that valence plays an absolutely central role in the brain

I believe you are right. I am working on a comprehensive theory that covers valence, emotional evaluation, and belief sets. I propose that it is fairly easy to predict emotional response when certain information is known, essentially a binary tree of decision questions will lead to various emotions; the strength of the resulting emotions is based on a person's current valance toward action/inaction as well as the result of normal emotional evaluation. Valences are added/subtracted to the final emotional evaluation. For example, something may produce happiness but if your valence is very low (depression), you will discount it. Valence and emotional evaluation are feedback loops resulting in greater and greater inhibition of action or greater and greater drivers of action. At the extremes we call these depression and mania. I'm going to digest your valence series a bit more and will be publishing some of my thoughts soon. Although if you are interested, would love to talk to you about them and possibly publish together. My knowledge of the mechanics of the brain are limited, I'm more of an algorithm/pattern person and only need enough detail to form my hypothesis. I don't live in details like most. I abstract very quickly and that is where I play and think.

[-]Steven Byrnes2y20

I’m probably not interested in coauthoring but I’ll be interested to read your ideas! :) Let me know when you publish anything so I don’t miss it (steven.byrnes@gmail.com).

[-]S Benfield2y10

Thanks. I take that as encouragement to hurry the f*** up.

Have you considered the fact that emotional evaluation comes at a high cost? It takes energy to evaluate the actual emotion as well as the valence. And it is all situational of course because to do emotions evaluation of a moment, you need to take beliefs/thoughts as well as sensory input. You model doesn't point that out enough. The human brain grew from the brainstem/limbic to the cortex AND the motor cortex. Our CNS is part of our brain, period. And it all works on valence. The actions you take are informed by valance.

So the brain has to take beliefs and current input and evaluate it. Now, how much energy do you think that evaluation takes? And the higher the valence, the higher the urgency of your action/intents.

In the end, yes, the brain is an RL model. However, how is emotional valuation conducted? What brings back the decision for action? You say it is a sum total of micro valences. And it is . Each micro valence is make up of binary decisions about self, other, the topic. But what about the possible actions to take and the predicted benefit of each? That is for my paper.

So I will say that you have the gist of hte valence model correct as I see it. And because you published it first, I will ensure that I incorporate what you're put together in my final model. I am working with a neuropsychologist on it and we plan to publish sometime this year. She is working on some experiments we can do to back up the paper's claims.

[-]M. Y. Zuo2y30

The brain has a model—If I go to the toy store, I expect to be able to buy a ball.

This is clearly not true for edge cases, such as when the Corpus callosum is severed, and the left and right hemispheres cannot reconcile with each other.

At best it can be said each hemisphere 'has a model'.

[-]Steven Byrnes2y20

I think, when I say “model”, I have in mind something very broad like “a model is a thing that can be used for predictions, and is trained specifically to be good at predictions, e.g. by self-supervised learning”, and when you read the word “model”, you have in mind something very narrow, maybe “a model is something that is just like the model in AlphaZero or other such ML papers”.

For example, I can ask you “what will happen if I do X?” and you might say “If you do X, then Y will happen … oh wait, maybe Z will happen … umm, I’m not sure”. That would never happen in the “model” of AlphaZero. The “model” of AlphaZero takes in actions (moves) and spits out a board position, and this answer is clean and unique and (in the case of AlphaZero but not MuZero) guaranteed-to-be-correct. Obviously the kind of “model” built by the brain is not like that. Sometimes it issues somewhat-self-contradictory predictions and so on.

The thing you mention about split-brain patients is an extreme version, but I think it’s on a continuum with more mundane things like “if I think about it in this way, I predict X, and if I think about it in a different way, I predict Y”. Nevertheless, we are obviously able to make good predictions about the future, and we do so a zillion times a day—“I’m going to walk to the light-switch and flip it off” involves a model-based prediction that we are capable of straightforwardly walking to the light-switch and switching it off, and that if we do so, the switch will stay off and the room will be dark.

Those kinds of predictions (I claim) have all the properties that make it “a model” in my book: what we expect is not always what we want, and what we expect is much more likely to actualize than chance, and mistaken expectations tend to lead to model updates in a direction that will reduce the error in similar situations in the future. Yes it’s kinda messy, like sometimes your temporal lobe can’t reach perfect consensus with your parietal lobe, or your left hemisphere with your right hemisphere, and sometimes “what we expect” has other kinds of self-inconsistencies, etc. But it’s still definitely “a model”, in the (broad) way I use the term. :)

[-]S Benfield2y10

The brain has a model--an over arching one. At best is can be said the entire brain. Now, that model includes both hemispheres. Redundancy for one, but also just too may things to do and the need to many clusters of neurons. It is still true for edge cases like you said--in that case, when there is a severed corpus callous, the model is still there. You've just severed the highest level connection--a physical act that doesn't change he fact that the brain has a model it is working with.

[-]M. Y. Zuo2y10

It is still true for edge cases like you said--in that case, when there is a severed corpus callous, the model is still there.

Huh? How is the model 'still there' for someone with a severed Corpus callosum?

As far as I'm aware it doesn't grow back within a normal human lifespan...

[-]Paradiddle2y30

Enjoyable post, I'll be reading the rest of them. I especially appreciate the effort that went into warding off the numerous misinterpretations that one could easily have had (but I'm going to go ahead an ask something that may signal I have misinterpreted you anyhow).

Perhaps this question reflects poor reading comprehension, but I'm wondering whether you are thinking of valence as being implemented by something specific at a neurobiological level or not? To try and make the question clearer (in my own head as much as anything), let me lay out two alternatives to having valence implemented by something specific. First, one might imagine that valence is an abstraction over the kind of competitive dynamics that play out among thoughts. On this view, valence is a little like evolutionary fitness (the tautology talk in 1.5.3 brought this comparison to mind). Second, one might imagine that valence is widely distributed across numerous brain systems. On this view, valence is something like an emotion (if you'll grant the hopefully-no-longer-controversial claim that the neural bases of emotions are widely distributed). I don't think either of these alternatives are what you are going for, but I also didn't see the outright claim that valence is something implemented by a specific neurobiological substrate. What do you believe?

[-]Steven Byrnes2y50

Thanks!

I think in much much simpler animals, valence is a literal specific signal in the brain, basically the collective spiking activity of a population of dopamine neurons. In mammals, that’s still sorta-close-to-true, but I would need to add a whole bunch of caveats and footnotes to that, for reasons hinted at in §1.5.6–1.5.7.

(I have a bunch of idiosyncratic opinions about what exactly the basal ganglia is doing and how, but I don’t want to get into it here, sorry!)

I reject both the “first” and the “second” thing you mention. I’m much closer to “valence is pretty straightforwardly encoded by spikes going down specific known axons”.

Separately, I might or might not agree with “the neural bases of emotions are widely distributed”, depending on how we define the word “emotions” (and also how we define “neural bases”, I suppose!), see here.

[-]S Benfield2y10

I don't know if I buy that valence is based on dopamine neurons but I do believe valance is delta between current state and possible future state. Very much like action potential or potential energy. If one possible outcome could grant you the world, then you will have a very high valance to do the actions needed. Likewise if you life is on the line, that is very high valance. That turns anger to rage. Unfortunately, my model also says that too many positive thoughts, lead to a race condition between dopamine generation and thought analysis can can lead to mania/psychosis. When you want things too much (desire) or too little (doubt/despair), the valences can get too high. And even evaluation of innocuous things can lead you to forming emotions or actions out of line with the current evaluation. That is, valance does not go to zero easily. And the valence of now, informs the valence of later. And I believe it is more like a 1/x function so when you get to extremes of valance, the desire to act or desire to not act, gets really high and is hard to over come.

[-][anonymous]2y30

Very cool post! We need a theory of valence that is grounded in real neuroscience, since understanding valence is pretty much required for any alignment agenda that works the first time.

^{^}

Definition for non-neuroscientists: “Final common pathway” is a term I like. Start with a typical example from the literature of someone using that term: “motor neurons [in the spinal cord]…are the final common pathway for transmitting neural information from a variety of sources to the skeletal muscles.” What that means is: There’s some signal going down the spine to the muscles, and that signal will completely control what the muscles will do. But upstream of that signal, there’s a lot going on! Lots of different systems in the brain are all contributing to that signal, and modulating it, in complicated ways.

By the same token, when I call valence a “final-common-pathway signal”, I’m saying that there’s one brain signal called “valence” (ignoring some fine print, see §1.5.7), and it’s a real signal, encoded by real neurons firing, and this signal has extraordinarily important impacts on the brain. But the fact that it’s just one signal does not imply that it’s calculated in a simple way by a single system. There’s a single point of departure of the signal, but that’s merely the last step of a complex calculation involving systems all over the brain.

^{^}

Yes, at least in rodents, the hypothalamus seems to have an innate circuit that specifically tracks how many days it’s been since I felt the comforting touch of a friend or family member. See Liu et al. (2023).

LESSWRONG
is fundraising!
LW

LESSWRONG
is fundraising!
LW

101

[Valence series] 1. Introduction

101

101

1.1 Summary & Table of Contents

1.1.1 Summary & Table of Contents—for the whole Valence series

1.1.2 Summary & Table of Contents—for this first post in particular

1.2 Model-based reinforcement learning (RL)

1.3 Actor-critic RL, and “valence”

1.4 Terms closely related to “valence”

1.5 Clarifications

1.5.1 “Valence” (as I’m using the term) is a property of a thought—not a situation, nor activity, nor course-of-action, etc.

1.5.2 Valence (as I’m using the term) is different from “hedonic valence” / pleasantness

1.5.3 “We do things exactly when they’re positive-valence” should feel almost tautological

1.5.4 Valence is also part of the world model, and hence (confusingly) a valence can be either real or imagined

1.5.5 Valence is just one of many dimensions of conscious interoceptive experience

1.5.6 Fine print: Throughout this series, I’m only talking about the brain’s “main” reinforcement learning system

1.5.7 I’m sweeping some complexity under the rug

1.6 Conclusion