Behavior: The Control of Perception

[-][anonymous]11y60

Excellent post. I've been enjoying your series so far. Control theory feels useful in a "this is the key to everything" sort of way.

[-]Kaj_Sotala11y40

Unfortunately, I'm not an expert in this field, so I can't tell you what the state of the academic discussion looks like now. I get the impression that a number of psychologists have at least partly bought into the BCP paradigm (called Perceptual Control Theory) and have been working on their interests for decades, but it doesn't seem to have swept the field.

At least on a superficial level, the model reminds me somewhat of the hierarchical prediction model, in that both postulate the brain to be composed of nested layers of controllers, each acting on the errors of the earlier layer. (I put together a brief summary of the paper here, though it was mainly intended as notes for myself so it's not as clear as it could be.) Do you have a sense on how similar or different the models are?

[-]Vaniver11y20

Thanks for the paper! It was an interesting read and seems very relevant (and now I've got some reference chains to follow).

Do you have a sense on how similar or different the models are?

My impression is that if they describe someone as a cyberneticist, then they're operating on a model that's similar enough. First three sentences of the paper:

“The whole function of the brain is summed up in: error correction.” So wrote W. Ross Ashby, the British psychiatrist and cyberneticist, some half a century ago. Computational neuroscience has come a very long way since then. There is now increasing reason to believe that Ashby’s (admittedly somewhat vague) statement is correct, and that it captures something crucial about the way that spending metabolic money to build complex brains pays dividends in the search for adaptive success.

From my read of the rest of paper, the similarities go deep. Control theory is explicitly discussed in this section:

A closely related body of work in so-called optimal feedback control theory (e.g., Todorov 2009; Todorov & Jordan 2002) displays the motor control problem as mathematically equivalent to Bayesian inference. Very roughly – see Todorov (2009) for a detailed account – you treat the desired (goal) state as observed and perform Bayesian inference to find the actions that get you there. This mapping between perception and action emerges also in some recent work on planning (e.g., Toussaint 2009). The idea, closely related to these approaches to simple movement control, is that in planning we imagine a future goal state as actual, then use Bayesian inference to find the set of intermediate states (which can now themselves be whole actions) that get us there. There is thus emerging a fundamentally unified set of computational models which, as Toussaint (2009, p. 29) comments, “does not distinguish between the problems of sensor processing, motor control, or planning.” Toussaint’s bold claim is modified, however, by the important caveat (op. cit., p. 29) that we must, in practice, deploy approximations and representations that are specialized for different tasks. But at the very least, it now seems likely that perception and action are in some deep sense computational siblings and that:

The best ways of interpreting incoming information via perception, are deeply the same as the best ways of controlling outgoing information via motor action … so the notion that there are a few specifiable computational principles governing neural function seems plausible. (Eliasmith 2007, p. 380)

Action-oriented predictive processing goes further, however, in suggesting that motor intentions actively elicit, via their unfolding into detailed motor actions, the ongoing streams of sensory (especially proprioceptive) results that our brains predict. This deep unity between perception and action emerges most clearly in the context of so-called active inference, where the agent moves its sensors in ways that amount to actively seeking or generating the sensory consequences that they (or rather, their brains) expect (see Friston 2009; Friston et al. 2010). Perception, cognition, and action – if this unifying perspective proves correct – work closely together to minimize sensory prediction errors by selectively sampling, and actively sculpting, the stimulus array. They thus conspire to move a creature through time and space in ways that fulfil an ever-changing and deeply inter-animating set of (sub-personal) expectations. According to these accounts, then:

Perceptual learning and inference is necessary to induce prior expectations about how the sensorium unfolds. Action is engaged to resample the world to fulfil these expectations. This places perception and action in intimate relation and accounts for both with the same principle. (Friston et al. 2009, p. 12)

Basically, it looks like their view fits in with the hierarchical controls view and possibly adds burdensome details (in the sense that they believe the reference values take on a specific form that the hierarchical control theory view allows but does not require).

[-]majus11y30

The quote on conflict reminds me of Jaak Panksepp's "Affective Neuroscience: The Foundations of Human and Animal Emotions", or a refracted view of it presented in John Gottman's book, "The Relationship Cure". Panksepp identifies mammalian emotional command systems he names FEAR, SEEKING, RAGE, LUST, CARE, PANIC/GRIEF, PLAY; Gottman characterizes these systems as competing cognitive modules: Commander-in-chief, Explorer, Sentry, Energy Czar, Sensualist, Jester or Nest Builder. It is tempting now to think of them as very high-level controllers in the hierarchy.

[-]dvasya11y30

Here's another excellent book roughly from the same time: "The Phenomenon of Science" by Valentin F. Turchin (http://pespmc1.vub.ac.be/posbook.html). It starts from largely similar concepts and proceeds through the evolution of the nervous system to language to math to science. I suspect it may be even more AI-relevant than Powers.

[-]Vaniver11y00

Thanks for the link (which has the free pdf, for anyone else interested)! After a few months at being only at a book or two, my reading queue is up towards a dozen again, so I'm not sure when I'll get to reading it.

[-]msheehan11y20

In terms of robotics, BCP or PCT seems a lot like Rodney Brooks' Subsumption Architecture: Eliezer has written a not particularly favourable post about it. It was such an important idea, it formed the basis for nearly all robots for quite a long time. It is an idea he had in the 1980s when we was bitten by a mosquito in Indonesia while on holiday there I believe. At the time, all robots were programmed using rules, probably close to the lookup table approach you mention, and were very slow, and not particularly useful. Brooks' idea was to mimic the behaviour of very simple animals and work up to humans (bottom-up approach), rather than the other way around, which was try and create robots from logic given certain situations and rules (top-down approach). A summary of his ideas is this 1990 paper, Elephants don't play chess. I thought it would be interesting to introduce this idea here since subsumption in my mind links with BCP (or PCT) to an important sub-set of the AI world, robotics, which shows clear examples of the theory in practice. Just to let you know, the main problems that have been found with implementation of subsumption-style AI is that successive layers or heirachies of control get quite difficult to implement. Its major criticism has been although it has led to the ability to create robots that handle real-world environments very well, they tend to 'think' on the same level as insects; there are difficulties in implementing higher level thinking, i.e. logic and reasoning.

[-]Richard_Kennaway11y30

There is an important difference between hierarchical PCT and subsumption.

In subsumption, higher-level controllers operate instead of lower-level controllers. When a higher-level controller needs to do something, it overrides lower-level controllers in order to do it. The robot senses an obstacle, so the "walk forwards" code is suspended while the "avoid obstacle" code takes over driving the legs.

In HPCT, higher-level controllers operate by means of lower-level controllers. When a higher-level controller needs to do something, it does so by setting reference levels for lower-level controllers. When the robot encounters an obstacle, the reference for desired direction of motion is changed, and the walk controllers continue to do their job with a different set of reference signals. The obstacle-avoidance controller does not even need to know whether the robot is on legs or wheels, only that it can send a signal "go in this direction" and it will happen. Each layer of controllers, in effect, implements a set of virtual actuators for the next level up to use.

[-]SarahNibs11y20

Suppose I am in the presence of a bunch of data going this way and that into and out of a bunch of black boxes. What kind of math or statistics might tell me or suggest to me that boxes 2, 7, and 32 are probably simple control systems with properties x, y, and z? Seems I should be looking for a function of the inputs that is "surprisingly" approximately constant, and if there's a simple map from that function's output to states of some subset of the outputs then we've got a very strong clue, or if we find that some output strongly negatively correlates with a seemingly unrelated time series somewhere else that might be a clue... Anyone have a link to a good paper on this?

[-]Vaniver11y50

Seems I should be looking for a function of the inputs that is "surprisingly" approximately constant

I think in most situations where you don't have internal observations of the various actors, it's more likely that outputs will be constant than a function of the inputs. That is, a control system adjusts the relationship between an input and an output, often by counteracting it completely--thus we would see the absence of a relationship that we would normally expect to see. (But if we don't know what we would normally expect, then we have trouble.)

Anyone have a link to a good paper on this?

I'm leaning pretty heavily on a single professor/concept for this answer, but there's a phrase called "Milton Friedman's Thermostat," perhaps best explained here (which also has a few links for going further down the trail):

If a house has a good thermostat, we should observe a strong negative correlation between the amount of oil burned in the furnace (M), and the outside temperature (V). But we should observe no correlation between the amount of oil burned in the furnace (M) and the inside temperature (P). And we should observe no correlation between the outside temperature (V) and the inside temperature (P).

An econometrician, observing the data, concludes that the amount of oil burned had no effect on the inside temperature. Neither did the outside temperature. The only effect of burning oil seemed to be that it reduced the outside temperature. An increase in M will cause a decline in V, and have no effect on P.

A second econometrician, observing the same data, concludes that causality runs in the opposite direction. The only effect of an increase in outside temperature is to reduce the amount of oil burned. An increase in V will cause a decline in M, and have no effect on P.

But both agree that M and V are irrelevant for P. They switch off the furnace, and stop wasting their money on oil.

They also give another example with a driver adjusting how much to press the gas pedal based on hills here, along with a few ideas on how to discover the underlying relationships.

I feel like it's worth mentioning the general project of discovering causality (my review of Pearl, Eliezer's treatment), but that seems like it's going in the reverse direction. If a controller is deleting correlations from your sense data, that makes discovering causality harder, and it seems difficult to say "aha, causality is harder to discover than normal, therefore there are controllers!", but that might actually be effective.

[-]Richard_Kennaway11y60

If a controller is deleting correlations from your sense data, that makes discovering causality harder, and it seems difficult to say "aha, causality is harder to discover than normal, therefore there are controllers!", but that might actually be effective.

Yes, in the PCT field this is called the Test for the Controlled Variable. Push on a variable, and if it does not change, and it doesn't appear to be nailed down, there's probably a control system there.

I have an unpublished paper relating the phenomenon to causal analysis à la Pearl, but it's been turned down by two journals so far, and I'm not sure I can be bothered to rewrite it again.

[-]V_V11y20

I have an unpublished paper relating the phenomenon to causal analysis à la Pearl, but it's been turned down by two journals so far, and I'm not sure I can be bothered to rewrite it again.

arXiv?

[-]Richard_Kennaway11y00

arXiv?

I looked at arXiv, but there's still a gateway process. It's less onerous than passing referee scrutiny, but still involves getting someone else with sufficient reputation on arXiv to ok it. As far as I know, no-one in my university department or in the research institute I work at has ever published anything there. I have accounts on researchgate and academia.edu, so I could stick it there.

[-]IlyaShpitser11y40

I have never had any issues putting things up on the arXiv (just have to get through their latex process, which has some wrinkles). I think I have seen a draft of your paper, and I don't see how arXiv would have an issue with it. Did arXiv reject your draft somehow?

[-]Richard_Kennaway11y20

I haven't sent it there. I created an account on arXiv a while back, and as far as I recall there was some process requiring a submission from someone new to be endorsed by someone else. This, I think, although on rereading I see that it only says that they "may" post facto require endorsement of submissions by authors new to arXiv, it's not a required part of the submission process. What happened the very first time you put something there?

[-]satt11y20

(I know I'm not IlyaShpitser, but better my reply than no reply.) I have several papers on the arXiv, and the very first time I submitted one I remember it being automatically posted without needing endorsement (and searching my inbox confirms that; there's no extra email there asking me to find an endorser). If you submit a not-obviously-cranky-or-offtopic preprint from a university email address I expect it to sail right through.

[-]Richard_Kennaway11y20

Well, I've just managed to put a paper up on arXiv (a different one that's been in the file drawer for years), so that works.

[-][anonymous]11y20

Because they're so small, I feel like their policies can be really inconsistent from circumstance to circumstance. I've got a couple papers on arXiv, but my third one has been mysteriously on hold for some months now for reasons that are entirely unclear to me.

[-]alienist11y00

(I know I'm not IlyaShpitser, but better my reply than no reply.) I have several papers on the arXiv, and the very first time I submitted one I remember it being automatically posted without needing endorsement

How long ago was this? I believe the endorsement for new submitters requirement was added ~6 years ago.

[-]satt11y00

My first submission was in 2012. I'm fairly sure I read about the potential endorsement-for-new-submitters condition at the time, too.

[-]Lumifer11y20

SSRN?

[-]Vaniver11y00

I have an unpublished paper relating the phenomenon to causal analysis à la Pearl, but it's been turned down by two journals so far, and I'm not sure I can be bothered to rewrite it again.

I'd be interested in seeing it, if you don't mind! (My email is my username at gmail, or you can contact me any of the normal ways.)

[-]Richard_Kennaway11y20

That is, a control system adjusts the relationship between an input and an output, often by counteracting it completely--thus we would see the absence of a relationship that we would normally expect to see.

The words "input" and "output" are not right here. A controller has two signals coming into it and one coming out of it. What you above called the "output" is actually one of the input signals, the perception. This is fundamental to understanding control systems.

The two signals going into the controller are the reference and the perception. The reference is the value at which the control system is trying to bring the perception to. The signal coming out of the controller is the output, action or behaviour of the controller. The action is being emitted in order to bring the perception towards the reference. The controller is controlling the relationship between its two input signals, trying to make that relationship the identity. The italicised words are somewhere between definitions and descriptions. They are the usual words used to name these signals in PCT, but this usage is an instance of their everyday meanings.

In concrete terms, a thermostat's perception is (some measure of) the actual temperature. Its reference signal is the setting of the desired temperature on a dial. Its output or behaviour is the signal it sends to turn the heat source on and off. In a well-functioning control system, one observes that as the reference changes, the perception tracks it very closely, while the output signal has zero correlation with both of them. The purpose of the behaviour is to control the perception -- hence the title of William Powers' book, "Behavior: The Control of Perception". All of the behaviour of living organisms is undertaken for a purpose: to bring some perception close to some reference.

[-]Vaniver11y00

The words "input" and "output" are not right here.

Yeah, that paragraph was sloppy and the previous sentence didn't add much, so I deleted it and reworded the sentence you quoted. I'm used to flipping my perspective around a system, and thus 'output' and 'input' are more like 'left' and 'right' to me than invariant relationships like 'clockwise' and 'counterclockwise'-- with the result that I'll sometimes be looking at something from the opposite direction of someone else. "Left! No, house left!"

(In this particular case, the system output and the controller input are the same thing, and the system input is the disturbance that the controller counteracts, and I assumed you didn't have access to the controller's other input, the reference.)

[-]Arkanj3l11y00

Similar in theme is "Vehicles: Experiments in Synthetic Psychology" by Valentino Braitenberg, in that creating simple systems that aren't goal driven can nonetheless produce behavior that we characterize as emotional or thoughtful, somehow. It's more exploratory and illustrative than principled or conceptual, but should be a good read.

[+]Flextechmgmt11y-50

LESSWRONG
LW

LESSWRONG
LW

55

Behavior: The Control of Perception

55

55