Control Theory Commentary

It is 11:30 by my clock, so published on Wednesday, though perhaps not at the time I had in mind ;)

Thanks again to Ari Rabkin, Peter McCluskey, Christian Kleineidam, Carlos Serrano, Daniel Light, Harsh Pareek, and others for helpful comments on drafts.

An amusing typo I discovered while proofreading one of the new parts of this post: one sentence originally read "when presented with a choice, it is often better to make out than not make one." Words to live by.

[-]kpreid11y20

Another typo: first paragraph, “effect used” → “effort used”.

[-]Vaniver11y10

Fixed, thanks!

[-]pjeby11y70

FWIW, my enthusiasm over PCT has cooled considerably. Not because it's not true, just because it's gone from "OMG this explains everything" to just "how things work". It's a useful intuition pump for lots of things, not the least of which is the reason humans are primarily satisficers, and make pretty crappy maximizers. (To maximize, we generally need external positive feedback loops, like competition.)

(It's also a useful tool for understanding the difference between what's intuitive to a human and intuitive to an AI. When you tell a human, "solve this problem", they implicitly leave all their mental thermostats to "and don't change anything else". Whereas a generic planning API that's not based on a homeostatic control model implicitly considers everything up for grabs, the human has a hererarchy of controlled values that are always being kept within acceptable parameters, so we don't e.g. go around murdering people to use their body parts for computronium to solve the problem with. This behavioral difference falls naturally out of the PCT model.)

At the same time, I have seen that not everything is a negative-feedback control loop, despite the prevalence of them. Sometimes, a human is only one part of a larger set of interactions, that can create either positive or negative feedback loops, even if individual humans are mostly composed of negative-feedback control loops.

Notably, for many biological processes, nature doesn't bother to evolve negative control loops for things that didn't need them in the ancestral environment, due to resource limitations or competition, etc. If this weren't true, superstimuli couldn't exist, because we'd experience error as the stimulus increased past the intended design range. And then we wouldn't e.g. get hooked on fast food.

That being said, here's an example of something "self-help useful" about the PCT model, that is not (AFAIK) predicted by any other psychological or neurological model: PCT says that a stable control system requires that higher level controls operate on longer time scales than lower ones. More precisely, a higher-level perceptual value must always a function of lower-level perceptions sampled over a longer time period than the one those lower-level perceptions are sampled on. (Which means, for example, that if you base your happiness on perceptions that are moment-to-moment rather than measured over longer periods, you're gonna have a bad time.)

Another idea, stated in a lot of "traditional" self-help, is that you can't get something until you can perceive what it is. Some schools treat this as a hierarchical process, and a few even treat this as a formalism, ie., that your goal is not well-formed until you can describe it in terms of what sensory evidence you would observe when the goal is reached. And even my own "desk-cleaning trick", developed before I learned about PCT, is built on a perceptual contrast.

And speaking of contrast, the skill of "mental contrasting" is all the rage these days, and it's also quite similar to what PCT says about perceptual contrast. (Not to mention being similar the desk-cleaning trick.)

However, there's a slight difference between what PCT would predict as optimal contrasting, and what "mental contrasting" is. I believe that PCT would emphasize contrasting not with anticipated difficulties, but rather, with whatever the current state of reality is. As it happens, Robert Fritz's books and creativity training workshops (developed, AFAICT independently of PCT) take this latter approach, and indeed the desk-cleaning trick was the result of me noticing that Fritz's approach could be applied in an instantaneous manner to something rather less creative than making art or a business. (Again, prior to PCT exposure on my part.)

I would be interested to see experiments comparing "mental contrasting" as currently taught, with "structural tension" as taught by Fritz and company. I suspect that they're not terribly different, though, because one byproduct of contrasting the goal state and current state is a sudden awareness of obstacles and/or required subgoals. So, being told to look for problems may in fact require people to implicitly perform this same comparison, and being told to do it the other way around might therefore only make a small difference.

[-]Richard_Kennaway11y60

[PCT]'s gone from "OMG this explains everything" to just "how things work".

This is high praise.

[-]Vaniver11y40

FWIW, my enthusiasm over PCT has cooled considerably. Not because it's not true, just because it's gone from "OMG this explains everything" to just "how things work".

I'm agreed with Kennaway on this.

It's a useful intuition pump for lots of things, not the least of which is the reason humans are primarily satisficers, and make pretty crappy maximizers.

Technically, I disagree, because I want 'satisficer' to keep the original intended sense of "get X to at least this particular threshold value, and then don't worry about getting it any higher." I think controls point at... something I don't have a good word for yet, but 'proportioners' that try to put in effort appropriate to the level of error.

(An aside: I was at the AAAI workshop on AI and Ethics yesterday, and someone shared the story of telling people about their simulated system which proved statements like "if a person is about to fall in the hole, and the robot can find a plan that saves them, then the person never falls into the hole," and had their previous audience respond to this with "well, why doesn't the robot try to save someone even if they know they won't succeed?". This is ridiculous in the 'maximizer' model and the 'satisficer' model, but makes sense in the 'proportioner' model--if something needs to be done, then you need to try, because the effort is more important than the effect.)

[-]pjeby11y00

want 'satisficer' to keep the original intended sense of "get X to at least this particular threshold value, and then don't worry about getting it any higher." I think controls point at... something I don't have a good word for yet, but 'proportioners' that try to put in effort appropriate to the level of error.

And yet, that's what they do. I mean, get X to a threshold value. It's just that X is the "distance to desired value", and we're trying to reduce X rather than increase it. Where things get interesting is that the system is simultaneously doing this for a lot of different perceptions, like keeping effort expenditure proportionate to reward.

if something needs to be done, then you need to try, because the effort is more important than the effect.

I don't understand this. People put forth effort in such a situation for various reasons, such as:

Lack of absolute certainty the attempt will fail
Embarassment at not being seen to try
Belief they would be bad if they don't try

etc. It's not about "effort" or "effect" or maximizing or satisficing per se. It's just acting to reduce disturbances in current and predicted perceptions. Creating a new "proportioner" concept doesn't make sense to me, as there don't seem to be any leftover things to explain. It's enough to consider that living beings are simultaneously seeking homeostasis across a wide variety of present and predicted perceptual variables. (Including very abstract ones like "self-esteem" or "social status".)

[-]Vaniver11y10

Thinking about it more, maybe I should just use "controller" to point at what I want to point at, but the issue is that is a normal English word with many more implications than I want.

Creating a new "proportioner" concept doesn't make sense to me, as there don't seem to be any leftover things to explain.

Mathematically, there definitely is. That is, consider the following descriptions of one-dimensional systems (all of which are a bit too short to be formal, but I don't feel like doing all the TeX necessary to make it formal and pretty):

max x s.t. x=f(u)
min u s.t. x≥x_min, x=f(u)
u=-k*e, e=x-x_ref, y=f(u,x)

The first is a maximizer that tries to get x as high as possible, the second is a lazy satisficer that tries to do as little as possible while getting x above some threshold (in general, a satisficer just cares about hitting the threshold and not effort spent), the third is a simple negative feedback controller and it behaves differently from the maximizer and from the satisficer (approaching the reference asymptotically, reducing the control effort as the disturbance decreases).

My suspicion is that typically, when people talk about satisficers, they have something closer to 3 than 2 in mind. That is...

It's just acting to reduce disturbances in current and predicted perceptions.

Agreed. But that's not what a satisficer does (in the original meaning of the term).

[-]Richard_Kennaway11y30

humans are primarily satisficers, and make pretty crappy maximizers

Is there some reason they should be maximisers?

And then we wouldn't e.g. get hooked on fast food.

"What do you mean by 'we', paleface?" :)

How do you explain why some people do not get hooked on fast food? To me, what McDonalds and similar places serve does not even count as food. It is simply not my inclination to eat such things. I don't play computer games much either, to name another "superstimulus". I do not click on any link entitled "10 things you must..." This isn't the wisdom of age; the same is true of my younger adult selves (mutatis mutandis -- some of those things had not been invented in those days).

Ok, that's just me, but it's an example I'm very familiar with, and it always feels odd to see people going on about superstimuli and losing weight and the ancestral environment and the latest pop sci fads and observe that I am mysteriously absent, despite not being any sort of alien in disguise.

[-]Gunnar_Zarncke11y50

I'm quite happy to see PCT is a real thing. I always had trouble explaining my own mental model of behavior in traditional psychological terms and now I only need to point to PCT.

What I am missing is a treatment of the formation of the control loops. For the lower levels it is quite clear: These evolved. But what about the higher levels. I don't think the whole hierarchy is fixed. It is fixed only on the lower levels (and I hear that even there is some plasticity in weights). The higher you get the more variable the control loops become. Sure there must be some main controls meshing in desires and values but how do these attach to the higher control loops? I mean it's not like the top level control is some one-dimensional reward channel controlling fixed control loops for very abstract behaviors. This is only partly addressed by the treatment of conflict which implies multiple high level control signals.

We can look at a particular person and behavior and try to describe the control loops involved. But that doesn't answer how these came about.

Consider habits. Apparently it is possible to establish very complex habits. Habits are basically control loops on a level above sequential actions. But the control loop comes into being without being pre-wired. It realizes some more or less successful behavior sequence that results in the satisfaction of some higher level control.

For example how does automatically locking the door when leaving the house come into being? Sure it results in a feeling of security which is a higher level control - but that is not the reason the behavior evolved. The behavior (the control loop causing the locking of the door) initially doesn't exist as such. It started off with the chaining of the individual acts. But the brain is good at finding patterns also in its own behaviors and I see this locking the door as an aggregate that is pattern matched against the feeling of security after locking the door and again confirmed upon finding the door locked on return thus reinforcing the control loop as a whole that didn't exist before.

Thus it appears to me that the complex mesh of behaviors may well be a deep hierachy of nested control loops but especially the higher levels of the hierachy are to a large degree ad-hoc instantiations of recognized patterns in ones own behaviors aquired earlier or later. Many primary behaviors originate during child development many of which are very strongly related to neccessary development that come into being in mostly comparable or at least recognizable form for most people. This surely resutls partly from the way some controls are pre-wired (hard-wired curiosity for certain stimuli surely causes lots of early parallel development; I can definitely confirm this from experience of my own children).

But I often see strange behaviors and I then I wonder how these strange control loops came into being and how one might modify them for everyones benefit. The hierarchy really is a big messy graph. Lots of local control loops working hard to reduce their error signal only to be abandoned (getting silent) when their applicability pattern doesn't match any more.

There can be lots of control loops active at a time (conflict) and the steering effect of one loop can cause another loop to become active. Depending on how sensitive these loops are to circumstances the result may be not a chain of successor loops (which could be picked up and become a larger pattern) but cause a random or chaotic sequence of action.

Looking back I see this in arguments in relationships a lot (my own included). In a pair one incompatble habit leads to a (delayed) reaction by the other, say a kind of dissatisfaction response which then leads to one of multiple counter responses (multiple because it is aversive and if one control loop fails higher level control suggests that others might). Some may work in some circumstances. Each may be followed by multiple return responses. If enough compatibility exists the joint system will ultimately converge back (OK, it will almost always converge because in the end joint exhaustion becomes the dominant (and joint) control error).

I think one dark art lurking here is to find the patterns people's control loops match on and use these. This is related to framing. People behave different in different contexts. PCT-wise this means different control loops are active. Change the context and the behavior will follow.

Two examples:

My brother-in-low once applied geek-fu to deflect a threatening guy by saying: "cool sticker on your jacket, where did you buy it?" thus totally changing the frame and immediately relaxing the guy.
I have often trouble to prevent my recalcitrant son from wreaking havoc. Sometimes I succeed in changing the frame by pointing to some new thing, retreating and playing an interesting game with his brothers, asking him a question totally unrelated to the situation but inherently interesting to him.

[-]Kaj_Sotala11y20

Ideally, this is where I would exhibit some example that demonstrates the utility of thinking this way: an ethical problem that utilitarianism can't answer well but a control theory approach can, or a self-help or educational problem that other methods couldn't resolve and this method can.

So I'm not entirely sure whether this is actually correct, and I could be entirely off, but could the control theory approach be relevant for problems like:

If you have an unbounded utility function, it won't converge
If you have a bounded utility function, you may consider a universe with (say) 10^18 tortured people to be equally bad as a universe with any higher number of tortured people
Conversely, if you have a bounded utility function, you may consider a universe with (say) 10^18 units of positive utility to be equally good as a universe with any higher number of good things
If you do have some clear specific goal (e.g. build a single paperclip factory), then after that goal has been fulfilled, you may keep building more paperclip factories just in case there was something wrong with the first factory, or your sense data is mistaken and you haven't actually built a factory, etc.

Intuitively it seems to me that the way that human goal-directed behavior works is by some mechanism bringing either desirable or undesirable things into our mental awareness, with the achievement or elimination of that thing then becoming the reference towards which feedback is applied. This kind of architecture might then help fix problems 2-3, in that if an AI becomes aware of there existing more bad things / there being the potential for more good things, it would begin to move towards fixing that, independent of how many other good things already existed. Problem 4 is trickier, but might be related to there some being set of criteria governing whether or not possibilities are brought into mental awareness.

Does this make sense?

[-]Vaniver11y20

Does this make sense?

This does look like a fruitful place to look, but one of the main problems here with demonstrating superiority is that the systems can emulate each other pretty well. Claims of superiority typically take the form of "X seems more intuitive" or "I can encode X in less space using this structure" rather than "X comes to a different, better conclusion." For example:

If you have a bounded utility function, you may consider a universe with (say) 10^18 tortured people to be equally bad as a universe with any higher number of tortured people

You can have asymptotic bounds that "mostly" solve this problem, or at least they solve this problem about as well as a controller would.

For example, suppose my utility based on the number of people that are alive is the logistic function (with x0 set to, say, 1,000 or 1,000,000). Then I will always prefer a world where X1 people are alive to a world where X2 people are alive iff X1>X2, but the utility is bounded above by 1, and has nice global properties.

Basically, it smooths together the "I would like more people to be alive" desire and the "I would like humanity to continue" desire in a continuous fashion, such that a 50-50 flip that doubles the human population (and wealth and so on) on heads and eliminates them on tails looks like a terrible idea (despite being neutral if your utility function is linear in the number of humans alive). I'm not sure that the logistic has the local behavior that we would want at any particular population size, but something like it probably does.

The solution that a controller would apply to this is typically referring to "upper bound on control effort." That is, the error can be arbitrarily large, but at some point you simply don't have any more ability to adjust the system, and so having 1e18 more people tortured than you want is "just as bad" as having 1e6 more people tortured than you want because both situations are bad enough to employ your maximal effort trying to reduce the number. One thing about this approach is that the bound is determined by your ability to affect the world rather than your capacity to care, but it's not clear to me if that actually makes much of a difference, either mathematically or physically.

[-]Kaj_Sotala11y00

Thanks, that makes sense.

On the topic of comparing controllers to utility functions - how does a controller decide what kinds of probabilistic tradeoffs are worth making? For instance, if you have a utility function, it's straightforward to determine whether you prefer, say, a choice that creates X1 new lives with probability P_x1 and kills Y1 people with probability P_y1, versus a choice that creates X2 new new lives with probability P_x2 and kills Y2 people with probability P_y2. How does one model that choice in a control theory framework?

[-]Vaniver11y20

How does one model that choice in a control theory framework?

I see two main challenges. First, we need to somehow encode distributions, and second, we need to look ahead. Both of those are doable, but it's worth mentioning explicitly that the bread and butter of utility maximization (considering probabilistic gambles, and looking ahead to the future) are things that need to be built in the control theory framework, and can be built in a number of different ways. (If we do have a scenario where it's easy to enumerate the choice set, or at least the rules that generate the choice set, and it's also easy to express the preference function, then utility is the right approach to take.)

The closest to the utility framework is likely to wrap the probability distributions over outcomes as the states, and then the 'error' is basically a measure of how much one distribution differs from the distribution we're shooting for. Possible actions are probably fed into a simulator circuit, that spits out the expected distribution. It looks like we could basically express this problem as "minimize opportunity cost while pursuing many options," as if we ever simulate a plan and think it's better than our current best plan we replace the current best plan, but if we simulate a plan and it's not better than our current best plan we look for a new plan to simulate. (You'd also likely bake in some stopping criterion.)

So it would probably look at choice 1, encode the discrete pmf as the reference state, then look at choice 2, and decide whether or not the error is positive (which it responds to by switching to choice 2) or negative (which it responds to by acting on choice 1). But in order to compare pmfs and get a sense of positive or negative I need to have some mathematical function, which would be the utility function in the utility framework.

We also might notice that this makes it easy for endowment effect problems to creep in- if none of the options are obviously better than any of the other options, it would default to whichever one came first. On the flip side, it makes it easy to start working with the first mediocre plan we come across, and then abandon that plan if a better one shows up. That is, this is more suited to operating in continuous time than a "plan, then act" utility maximization framework.

[-]alienist11y00

Also, controllers are more robust then utility agents. Utility agents tend to go haywire upon discovering that some term in their utility function isn't actually quite well-defined. Keep in mind that it's impossible to predict future discoveries ahead of time and what their implications for the well-definiteness of terms might be.

LESSWRONG
LW

LESSWRONG
LW

27

Control Theory Commentary

27

27

History

Utility Theory

Previous Discussion on LW

Intelligent Action without Environmental Simulation

Preference Modeling and Conflicts

Reasoning About AI