What is control theory, and why do you need to know about it?

I have not studied control theory, but I think a PID controller may be the Bayes-optimal controller if:

the system is a second-order linear system with constant coefficients,
the system is controllable,
all disturbances in the system are additive white noise forcing terms,
there is no noise in perception,
the cost functional is the integral of the square of the error,
the time horizon is infinite in both directions (no transients), and
the prior belief distribution over possible reference signals is the same as if the reference signal was a Brownian motion (which needs first-order control) plus an integral of a Brownian motion (which needs second-order control).

What makes it a full Bayesian decision problem is the prior belief distribution over possible reference signals. At each time, you don't know what the future reference signal is going to be, but you have a marginal posterior belief distribution over possible future reference signals given what the reference signal has been in the past. Part of this knowledge about possible future reference signals is represented in the state of the system you have been controlling, and part of it is represented in the state of the I element of the controller. You also don't know what the delayed effects of past disturbances will be, but you have a marginal posterior belief distribution over possible future delayed effects given what the perception signal has been in the past. Part of this knowledge is also represented in the system and in the I element. (Not all of your knowledge about possible future reference signals and possible future delayed effects of past disturbances is represented, only your knowledge about possible future differences between them.) This representation is related to sufficient statistics ("sufficiency is the property possessed by a statistic . . . when no other statistic which can be calculated from the same sample provides any additional information") and to updating of the parameter for a parametric family of belief distributions.

In a real engineering problem, the true belief about expected possible reference signals would be more specific than a belief of a random Brownian motion. But if a reference signal would not be improbable for Brownian motion, then a PID controller can still do well on that reference signal.

I think these conditions are sufficient but not necessary. If I knew control theory I would tell you more general conditions. If the cost functional has a term for the integral of the squared control signal, then a PID controller may not be optimal without added filters to keep the control signal from having infinite power.

Example 6.3-1 in Optimal Control and Estimation by Robert F. Stengel (1994 edition, pp. 540-541) is about PID controllers as optimal regulators in linear-quadratic-Gaussian control problems.

I see optimal control theory as the shared generalization of Bayesian decision networks and dynamic Bayesian networks in the continuous-time limit. (Dynamic Bayesian networks are Bayes nets which model how variables change over discretized time steps. When the time step size goes to zero and the variables are continuous, the limit is stochastic differential equations such as the equations of Brownian motion. When the time step size goes to zero and the variables are discrete, the limit is almost Uri Nodelman's continuous-time Bayesian networks. Bayesian decision networks are Bayes nets which represent a decision problem and contain decision nodes, utility nodes, and information arcs.)

[-]Steve_Rayhawk17y10

(Not all of your knowledge about possible future reference signals and possible future delayed effects of past disturbances is represented, only your knowledge about possible future differences between them.)

So this isn't a sufficient statistic, it's only a sufficient-for-policy-implications statistic. Is there a name for that?

[-]Eliezer Yudkowsky17y-20

All "sufficient" statistics are only "sufficient" for some particular set of policy or epistemic implications. You could always care about the number of 1 bits, if you're allowed to care about anything.

[-]Steve_Rayhawk17y20

Then every "sufficient-for-policy-implications" statistic can become a "sufficient-for-implications-for-beliefs-about-the-future" statistic, under a coarsening of the sample space by some future-action-preserving and conditional-ratios-of-expected-payoff-differences-preserving equivalence relation?

(Would we expect deliberative thinking and memory to physically approximate such coarsenings, as linear controllers do?)

[-]Richard_Kennaway17y00

Thank you for those references -- exactly the sort of thing I've been looking for.

[-]gjm17y190

I like this perspective.

Control systems win while being arational. Either explain this in terms of Bayescraft, or explain why there is no such explanation.

This (from Richard's post) seems to me very much parallel to this (which I just made up):

Cricketers and baseball players win at ball-catching while knowing nothing about Newtonian mechanics, fluid dynamics, or solving differential equations. Either explain this in terms of physics, or explain why there is no such explanation.

Anyone who says anything close to "Cox's theorem; therefore you must make your decisions by making Bayesian calculations" is broken. But it could still be reasonable to say "However you make your decisions, the results should be as close as you can make them to those of an ideal reasoner doing Bayesian calculations on the information you have". I don't see any contradiction, or even any tension, here. As for an actual specific explanation that matches the facts, that would seem to need to be done afresh for every control system that works; for some cases (like our brains) the answers might be unmanageably complicated.

Do Bayesian processes have anything to do with the mechanism of brains?

In the same sense as differential equations have something to do with the mechanism of people catching balls: when brains function well at maintaining reasonable beliefs, on some level of abstraction they have to act at least a little bit like Bayesian systems. But there needn't be anything in the mechanisms that resembles the form (as opposed to the output) of the idealizations.

Does Bayescraft necessarily have anything to do with the task of building a machine that [...]

Since we might be able to do that by building a very big very low-level model of an entire human brain, without any understanding at all of what's going on, obviously in some sense the answer is no. But if you want to understand what you're doing -- well, how much physics do you need to know if you want to get a space probe to Neptune? My guess is that even if you do it by making something that you launch into space at random and that then goes looking heuristically for something that might be Neptune, the chances are you're going to want quite a lot of physics while you're designing it.

[-]Richard_Kennaway17y50

The ball-catching example is interesting, as it's another control problem, and has been studied as such. The fielder must get to where the ball will land. The predictive method would be to look at the ball, estimate its trajectory, then go to where you predict it will come down. This will not be very effective, because you cannot estimate the trajectory well enough. Instead, one method that will work is to move so as to maintain the direction from yourself to the ball constant in both azimuth and elevation. This is a control task, akin to the cursor-tracking task I discussed in the posting. You just have to move faster or slower and vary your direction, in whatever way will keep the direction constant. (The reason this works is that if the direction is constant, the ball is moving directly towards you in the frame of reference that moves with you. Or directly away, but in that case you won't be able to run fast enough to catch it.)

Devise such a control model, put in some parameters, add the physics of flying balls, solve the differential equations, and compare the results to the performance of actual fielders, and you have explained it in terms of physics.

How would Jeffreyssai analyse a PID loop?

[-]gjm17y70

The ball-catching example is interesting, as it's another control problem [...]

That's why I chose it.

How would Jeffreyssai analyse a PID loop?

I decline to speculate on the internal workings of someone who is (1) fictional and (2) much cleverer, or at least better-trained in relevant areas, than me. But a generic Bayesian rationalist might say something like this:

"My goal is to have my beliefs track the range of possible futures. The mathematics of probabilistic inference founded by Bayes, Lagrange, Jaynes, etc., tells me that if I do this then the dynamics of my belief-updating must satisfy certain equations. Unfortunately, my brain is not fast enough, precise enough, or reliable enough to do that in real time, so I'd better look for some more tractable approximation that will produce results similar enough to those of updating according to the equations. Hmm, let's see ... scribble scribble ... hack hack ... scribble hack think scribble think scribble ... OK, it turns out that in this special case, I can do well enough by just keeping track of the expected value minus half the standard deviation, and (provided things change in roughly the way I expect them to) that quantity satisfies this nice simple differential equation, which I can approximate with a finite-difference equation; so, simplifying a bit, it turns out that I can do a decent job by updating my estimate like so. [At which point he has written down the guts of a PID controller.] Unfortunately, that only gives me a point estimate; fortunately, if all goes according to plan my optimal posteriors are all of roughly the same shape and if I really have to I can get a decent approximation to the other parameter I need by doing this... [He writes down the coefficients for another PID controller.] I've had to make some assumptions that amount to having a prior with zero probabilities all over the place, which is ugly. Perhaps there's some quantity I can keep track of that will stay close to zero as long as my model holds, but that has no reason to do so if the model's wrong. ... scribble scribble think scribble hack ... Well, it's not great, but if I also compute this and this, then while my underlying assumptions hold they should be very close to equal, so a lower bound on Pr(the model is broken) is such-and-such, so let's watch that as well."

Of course all the actual analysis is missing here. That would be because "a PID loop" can describe a vast range of different systems. And I'm assuming that our hypothetical rationalist knows enough about the relevant domain to be able to do the analysis, because otherwise your question seems a bit like asking how Jeffreyssai would do the biological research to know that he could take Pr(evolution) to be very close to 1. (Answer: He doesn't need to; other people have already done it.)

(I have the feeling that one of us is missing the other's point.)

[-]SilasBarta17y50

RichardKennaway, very interesting post. I actually specialized in control theory in graduate school, but didn't finish the program. I must object to what you've said here, in that control theory most certainly does make extensive use of Bayesian inferenence, under the name of the Kalman filter.

The Kalman filter is a way of estimating the paramaters of a system, given your observations and your knowledge of the system's dynamics. While it may not help you pick a good control input algorithm, and while the problems you listed there may not need such accurate estimation of the data, it is an integral part of finding out how much the system deviates from where you want it to be, and is used extensively in controls.

[-]Cyan17y140

A couple of points. First, you've described only feedback control systems -- you've omitted control systems with feedforward components. Feedforward systems have another signal, the perturbation signal, in addition to the current state and target state signals. (Pure feedforward systems which have only the perturbation signal are also possible provided the process is extremely well-modelled -- which is to say, almost never.) Feedforward control systems see a lot of use in chemical engineering, where PID control may not be sufficient to satisfy the design specs. Information about a feedstream is extremely useful for keeping a chemical process at steady state.

Second, the ability of human eyes to track moving objects (while our own heads are also moving!) is a pure control problem with a solution implemented in neurons. Provided I understood correctly what Mimi Galiana taught me, I should point out that our object-tracking abilities aren't based on PID control -- they're based on a (consciously inaccessible but) explicit neural prediction circuit.

In short, the presented view is rather incomplete. There's a limit to how good you can do with PID control; beyond that, you need more information. That said, constraining the future is all about control (or vice verse?), and I think the connection between control theory and rationality is important.

[-]Tom_Talbot17y30

Coincidentally, today I was reading an interesting paper about forward and inverse models in the cerebellum. Here's a quote:

Humans demonstrate a remarkable ability to generate accurate and appropriate motor behaviour under many different and often uncertain environmental conditions. Considering the number of objects and environments, and their possible combinations, that can influence the dynamics of the motor system, the controller must be capable of providing approximate motor commands for a multitude of distinct contexts, such as different tasks and interactions with objects, that are likely to be experienced. Given this multitude of contexts, there are two qualitatively distinct strategies to motor control and learning. The first is to use a single controller that uses all the contextual information in an attempt to produce an appropriate control signal. However, such a controller would require enormous complexity to allow for all possible scenarios. If this controller were unable to encapsulate all the contexts, it would need to adapt every time the context of the movement changed before it could produce approximate motor commands - this would produce transient and possibly large performance errors. Alternatively, a modular approach can be used in which multiple controllers co-exist, with each controller suitable for one or a small set of contexts. Depending on the current context, only those appropriate controllers would be active to generate the motor command. While forward and inverse models could be learned by a single module, there are three potential benefits to employing a modular approach. First, the world is essentially modular, in that we interact with multiple qualitatively different objects and environments. By using multiple inverse models, each of which might capture the motor commands necessary when acting with a particular object or within a particular environment, we could achieve an efficient coding of the world. In other words, the large set of environmental conditions in which we are required to generate movement requires multiple behaviours or sets of motor commands, each embodied within a module. Secondly, the use of a modular system allows individual modules to adapt through motor learning without affecting the motor behaviours already learned by other modules. Thirdly, many situations that we encounter are derived from combinations of previously experienced contexts, such as novel conjoints of previously manipulated objects and environments. By modulating the contribution to the final motor command of of the outputs of the inverse modules, an enormous repertoire of behaviours can be generated. With as few as 32 inverse models, in which the output of each model either contributes or does not contribute to the final motor command, we have 2^32 or 10^10 behaviours - sufficient for a new behaviour for every second of one's life. Therefore, multiple internal models can be regarded conceptually as motor primitives, which are the building blocks used to construct intricate motor behaviours with an enormous vocabulary.

[-]Richard_Kennaway17y20

Thanks for that reference. For anyone who doesn't have access to a library subscribing to Trends in Cognitive Sciences, here's a copy that's free to access.

[-]Strange716y00

However, such a controller would require enormous complexity to allow for all possible scenarios. If this controller were unable to encapsulate all the contexts, it would need to adapt every time the context of the movement changed before it could produce approximate motor commands - this would produce transient and possibly large performance errors.

I've heard it said that when someone slips on a banana, the humor is closely connected to the way that normal walking movement continues into an inappropriate context. That sounds to me like a large performance error, and a brain is certainly complex.

[-]michael17y90

Isn’t a model of the outside world built in – implicit – in the robot’s design? Surely it has no explicit knowledge of the outside world, yet it was built in a certain way so that it can counteract outside forces. Randomly throwing together a robot most certainly will not get you such a behaviour – but design (or evolution!) will give you a robot with a implicit model of the outside world (maybe at some point one who can formulate explicit models). I wouldn’t be so fast and just throw away the notion of a model.

I find the perspective very intriguing, but I think of it more as nature’s (or human designer’s) way of building quick and dirty, simple and efficient machines. To achieve that goal implicit models are very important. There is no magic – you need a model, albeit one that is implicit.

[-]Richard_Kennaway17y50

Certainly I, as the designer, had a model of the robot and its environment when I wrote that program, and the program implements those models. But the robot itself has no model of its environment. It calculates the positions of its feet, relative to itself, by sensing its joint angles, knowing the lengths of its limb segments and calculating, so it does have a fairly limited model of itself: it knows its own dimensions. However, it does not know its own mass, or the characteristics of its sensors and actuators.

The fact that it works does not mean that it has an "implicit" model of the environment: "implicit", in a context like this, means "not". What is a model? A model is a piece of mathematics in which certain quantities correspond to certain properties of the thing modelled, and certain mathematical relationships between these correspond to certain physical relationships. Maxwell's equations model electromagnetic phenomena. The Newton-Raphson (EDIT: I meant Navier-Stokes) equation models fluid flow. "Implicit model" is what one says, when one expects to find a model and finds none. The robot's environment contains a simulated wind pushing on the robot, and a simulated hand giving it a shove. The robot knows nothing of this: there is no variable in the part of the program that deals with the robot's sensors, actuators, and control algorithms that represents the forces acting on it. The robot no more models its environment than a thermostat models the room outside it.

Since it is possible to build systems that achieve goals without models, and also possible, but in general rather more complicated, to build such systems that do use models, I do not think that the blind god of evolution is likely to have put models anywhere. It has come up with something -- us, and probably the higher animals -- that can make models, but nothing currently persuades me that models are how brains must work. I see no need of that hypothesis.

I'd rather like to build that robot. If I did, I would very likely use an onboard computer just to have flexibility in reconfiguring its control algorithms, but the controllers themselves are just PID loops. If, having got it to work robustly, I were to hard-wire it, the control circuitry would consist of a handful of analogue components for each joint, and no computer required. I still find it remarkable, how much it can do with so little.

[-]gjm17y20

The Newton-Raphson equation models fluid flow.

Er, I think you mean Navier-Stokes.

"Implicit model" is what one says, when one expects to find a model and finds none.

I think that's unfair. The notion of an implicit model (meaning something like "a model such that a system making use of it would behave just like this one") is a useful one; for instance, suppose you are presented with a system designed by someone else that isn't working as it should; one way to diagnose its troubles is to work out what assumptions about the world are implicit in its design (they might not amount to anything quite so grand as a "model", I suppose) and how they fail to match reality, and then -- with the help of one's own better model of the world -- to adjust the system's behaviour.

Or, of course, you can just poke at it until it behaves better. But then I'd be inclined to say that you're still using a model of the world -- you're exploiting the world's ability to be used as a model of itself. If a system gets "poked at until it behaves better" often enough and in varied enough ways, it can end up with a whole lot of information about the world built into it. If you don't want to call that an "implicit model", fair enough; but what's wrong with doing so?

[-]Strange716y20

Poking at it until it works isn't revising a model, in the same sense that walking toward the pole star when you want to go North isn't cartography.

[-]gjm16y00

I didn't say that poking at something until it works is revising a model, I said that it's using a model (in, doubtless, a rather trivial sense). And, if I'm understanding your analogy right, surely the analogous claim would be that walking (as nearly as possible given that one remains on the surface of the earth) towards the pole star isn't reading a map (even an "implicit" one), not that it isn't cartography; and I don't think that's quite so obvious. (Also: it seems to me that "maps" have more in common than "models", and I think that's relevant.)

[-]MrShaggy17y10

Could one argue the tuning by the programmer incorporates the relevant aspects of the model? (Which is what I think commenter meant by "implicit.") In my mom's old van, going down a steep hill would mess up the cruise control: as you say, if you push hard enough, you can over come a control loop's programming. So a guess as to relation to Bayescraft: certain real world scenarios operate within a narrow enough set of parameters enough of the time that one can design feedback loops that do not update based on all evidence and still work well enough.

[-]gjm17y10

nothing currently persuades me that models are how brains must work.

Who's saying that they are?

(And: Is what you're expressing skeptical about the idea that brains usually use models, or the idea that they ever do? I know that I use models quite often -- any time I try to imagine how something I do will work out -- and if it isn't my brain doing that, I don't know what it is.)

[-]MrHen17y10

I'd rather like to build that robot.

If you have not seen it yet, check out Ballbot. This video is it responding to a disturbance. I know nothing of its programming, but it acts as if it is using the same control systems you are describing.

Also, Beyond AI has a lot of discussion about how simple control structures may eventually work its way into building a general AI. I do not know if there is an online version hanging around, but if you are interested I can type up a summary article after the General AI topic ban is lifted.

In terms of your original post, another random example of simple control structures providing control over extremely complex systems would be video games. The controllers generally affect one thing and after my mind understands the movements I can guide a little soldier to kill other soldiers. I find that learning these control systems makes me a better driver, makes me better at operating small backhoes, or anything else that can be expressed in terms of simple control structures. An interesting side-topic to your article would be taking a look at how we control control structures and working to improve the feedback and response times. My talent for video games may be related to my intuitive ability to balance when walking on the curb or why I instinctively want to respond to a emotional tragedy by responding with a soft push toward emotional safety. "Fixing it all at once" is likely to overcorrect.

I am rambling now, but this article connected a few unassociated behaviors in my head. Cool.

[-]derekz17y10

For a continuation of the ideas in Beyond AI, relevant to this LW topic, see:

http://agi-09.org/papers/paper_22.pdf

[-]MrHen17y00

Thanks; added to reading list.

[-]JGWeissman17y60

Control systems win while being arational. Either explain this in terms of Bayescraft, or explain why there is no such explanation.

The control system a person uses to steer a car would fail if it were not calibrated by processing evidence in a manner idealized by Bayescraft. Knowing the correct amount to turn the wheel to correct a deviation of the perceived direction from the desired direction depends on one's previous experience turning the wheel, the evidence of how the car reacts to turning the wheel a given amount.

I often help to teach sailing classes, and I observe that inexperienced students have the problems with steering that would be expected for one unfamiliar with steering their boat. They are either too timid on the helm, allowing the boat to stay off course, or too aggressive, overshooting the desired course, and then over correcting again the other way. As they gain experience, that is, as they process the evidence of how the boat reacts to their use of the tiller, their control improves to the point that they can maintain their desired course. This is one reason we like students to start with smaller, more responsive boats, which give the evidence more quickly and obviously than larger boats that take time to react.

Control systems are useful, but they are useful because we use evidence to select the particular control system that wins.

[-]Tyrrell_McAllister17y10

Knowing the correct amount to turn the wheel to correct a deviation of the perceived direction from the desired direction depends on one's previous experience turning the wheel, the evidence of how the car reacts to turning the wheel a given amount.

That isn't the case with the control systems in the OP. A thermostat doesn't know how long it will need to stay on to reach the desired temperature from the current temperature. Even its designers didn't necessarily know that. It just

(1) turns on;

(2) checks the temperature

(3) stays on if still hasn't reached desired temperature; else turns off.

Moreover, it doesn't even learn from this experience. The next time it finds itself with exactly the same disparity between current and desired temperature, it will go through exactly the same procedure, without benefiting from its previous experience at all.

All that matters is that the system responds in a way that (1) approaches the desired state, and (2) won't overshoot---i.e., won't reach the desired state so quickly that the system can't turn off the response in time. These seem to be what were missing with your sailing students.

Edited to correct format

[-]JGWeissman17y10

That isn't the case with the control systems in the OP.

From the OP

If this was only about cruise controls and room thermostats, it would just be a minor conundrum. But it is also about people, and all living organisms.

My point was that features of the thermostat that the OP attempted to generalize to control systems used by people do not actually generalize. A thermostat is a simple system to solve a simple problem (though it still takes some evidence, that a given device cools or heats a room). A more complex problem requires a more complex solution, and more evidence to calibrate.

All that matters is that the system responds in a way that (1) approaches the desired state, and (2) won't overshoot---i.e., won't reach the desired state so quickly that the system can't turn off the response in time. These seem to be what were missing with your sailing students.

While technically true at a certain level of abstraction, that is just not helpful. The reason why the students do not approach the desired state, or overshoot, is important. If I just told them "approach the desired course, but don't overshoot", it would not help. They already know they want to do that, but not how to do that. I need to tell them more precisely how to use the tiller to do that. I tell them, "pull the tiller towards you, a little more ... now back in the center", and get them to observe the effect this has on the boat. It is after going through this exercise a few times that they are able to implement the control system themselves, and process higher level instructions.

[-]JamesAndrix17y00

(2) won't overshoot---i.e., won't reach the desired state so quickly that the system can't turn off the response in time. These seem to be what were missing with your sailing students.

But that's a a result of the high responsiveness of the furnace vs. the low responsiveness of the boat. You couldn't blindly let a thermostat control a boat or a missle, you would have to tune it. It some situations it might need to turn itself back off before it's input (heading) has noticeably changed.

[+]themightypuck16y-70

[-]Psychohistorian17y50

But there is no doubt that it is wrong. Completely, totally wrong. To this audience I can say, as wrong as theism.

I did not really see this being backed up, certainly not to a "wrong as theism" level. Much more importantly, it being wrong has little to do with you being right, any more than if Darwinian evolution is wrong, there must be an all-powerful father figure who cares deeply about what you do in the bedroom and if you eat meat on Fridays.

[-]Richard_Kennaway17y00

I was just trying to convey the degree of wrongness that I was claiming, not drawing any detailed connection between them.

[-]PhilGoetz17y40

This is 180 degrees around from the behavioural stimulus-response view, in which you apply a stimulus (a perception) to the organism, and that causes it to emit a response (a behaviour). I shall come back to why this is wrong below. But there is no doubt that it is wrong. Completely, totally wrong. To this audience I can say, as wrong as theism.

No. Absolutely not. Stimulus-response works, has worked reliably for 70 years, and we now know how some specific brain circuits encode stimulus-response learning.

Servos also work. Both have their uses.

[-]SilasBarta17y40

Richard's post is similar to something I was thinking about a few months ago. I tried to attack the problem of AI by looking at very simple systems that can be said to accomplish "goals" without all the fancy stuff that people typically think they have to put in AI, and and asking how that works.

For example, a mass hanging by a spring: it moves the mass back to its equilibrium position without doing the things listed in 2). But here, Richard is asking an easier question in 4), since he's asking about systems that are specifically designed to track some reference, rather than systems that happen to do it as a consequence of their other properties.

In that case, the answer (about how an arational system accomplishes the goals of rationality) is pretty simple: the system has been physically set up in a way that exploits the laws of nature to create mutual information between the system and its environment. If you view Bayescraft as a way to increase the mutual information between yourself (hopefully meaning the brain part!) and your environment, then the system is in fact doing that, so it is not arational. Its design implements Bayesian inference.

In the case of the thermostat, the temperature sensor, via heat transfer, becomes entangled with its environment, a natural process that happens to have an isomorphism to the Bayes Theorem. Then, something else senses the reading, causing another set of effents that determines what temperature air to blow out.

The next question is why this mutual information is such that it keeps the temperature within a specific range, rather than making it spiral out of control. The answer to that part, as others have mentioned, is that the person who set up the system, chose rules that happened to work. That required another kind of entanglement with the environment, which does not need to be done again during the operation of the thermostat.

Well, as long as the assumptions it's based on don't change too much...

[-]Richard_Kennaway17y30

Thanks for the discussion. At this point I'll write another top-level posting rather than make a dozen point-for-point replies. Also, some of the comments, mine included, have pressed a little too hard on the embargo against a certain topic.

Besides, the topic we do not speak of yet is one I'm not much interested in talking about at all, and will be avoiding as far as I can. It has its own forums. I wonder if the original reason for the embargo might justify keeping it permanently? This is intended to be a forum about human rationality, and talk of any other sort should only be incidental.

[-]MrShaggy17y30

I liked the Alien Space Bat description of a control system. The idea that our psychology is a collection of control systems, originated by a control engineer sounds like the cliche "if you're holding a hammer, everything look like a nail" and I don't know how the belief itself controls anticipation (http://www.overcomingbias.com/2007/07/making-beliefs-.html). So as of now, I still don't know why I need to know about control theory.

[-]i7717y20

Very interesting article. Yes, the controller is not intelligent but you have to factor in the designer. (I think this is something like a response to the Chinese Room argument). Just a few comments:

It has no model of its surroundings.

It has, a very simple one: the sign of the gain of the plant (steady-state).

It has no model of itself.

No, but its maker does: the transfer function of the controller.

It makes no predictions.

As in the first point: implicit in the design of the system is that temperature goes up with +1 output. If you flip the sign you get positive feedback and the system does not work as intended.

It has no priors.

Its designer knows some a priori things, like the typical time constant of the temperature trajectory and its range.

It has no utility function.

Maybe not a formal one, but you could build one with things like integrated squared error.

[-]Nelson_Flood12y00

Concerning your first point, that the designer has to hand-insert that all-important sign bit. So how do humans come up with these sign bits? I imagine a trial-and-error process of interacting with the controlled system. During this, the person's brain is generating an error signal derived directly or indirectly from an evolutionarily-fixed set point. While trying to control the system manually using an initially random sign bit, I suppose the brain can analyze at a low level in the hardware that the error is 1) changing exponentially, and 2) has a positive or negative slope, as the case may be. If the situation is exponential and the slope is positive, you synaptically weld the cortical representation of the controlled variable to the antagonist muscle of the one currently energized, and if negative, to the energized muscle itself. Bayesian inference would enter as a Kalman filter used to calculate the controlled variable. I suppose the process of acquiring the sign bit of the slope could not be separated from acquiring the model needed by the Kalman filter, so some kind of bootstrapping process could be involved. In his book "Neural Engineering..." (2004), Chris Eliasmith makes a case that the brain contains Kalman filters.

Is the evolutionary process responsible for the original hard-wired set point itself a controller? I doubt it, because, to use Douglas Adams' analogy, control principles to not seem to be involved in getting the shape of a puddle to match that of the hole it's in.

[-][anonymous]12y00

Concerning your first point, the designer has to hand-insert that all-important sign bit. So how do humans come up with these sign bits? I imagine a trial-and-error process of interacting with the controlled system. During this, the person's brain is generating an error signal derived over learning time by classical conditioning from an evolutionarily-derived hypothalamic error signal. While trying to control the system manually using an initially random sign bit, I suppose the brain can analyze at a low level in the hardware that the error is 1) changing exponentially, and 2) has a positive or negative slope, as the case may be. If the slope is positive, you synaptically weld the cortical representation of the controlled variable to the antagonist muscle of the one currently moving, and if negative, to the moving muscle itself. Bayesian inference would enter as a Kalman filter used to calculate the controlled variable. I suppose the process of acquiring the sign bit of the slope could not be separated from acquiring the model needed by the Kalman filter. In his book "Neural Engineering..." (2004), Chris Eliasmith makes a case that the brain contains Kalman filters.

Is the evolutionary process responsible for the original hard wired error signal itself a controller? I doubt it, because, to use Douglas Adams' analogy, control principles to not seem to be involved in getting the shape of a puddle to match that of the hole it's in.

[This comment is no longer endorsed by its author]Reply

[-]JulianMorrison17y10

Isn't a control system using feedback basically analogous to a look-up table? Feedbacks by themselves aren't optimizers, they're happenstance. Feedbacks that usefully seek a goal constitute the output of an optimization process that ran beforehand.

[-]Cyan17y00

Isn't a control system using feedback basically analogous to a look-up table?

Only in the sense that, say, the Lotka-Volterra equations are basically analogous to a look-up table. You'd be missing out if you thought that's all it was.

[-]JulianMorrison17y00

Those ones are happenstance. They just feed back, they're not going anywhere.

The analogy I mean is that like a LUT, the answer to any particular question is embodied in the pre-existing structure. And this correlation of response to result is optimized, it's not luck.

[-]Kaj_Sotala17y10

Interesting. I'm immediately reminded of the set points for happiness and weight. Also, Eliezer described the phenomenon in "It's okay to be a little irrational" as "removal of pressure -> removal of counterpressure -> collapse of irrationality". The pressure -> counterpressure -mechanism sounds like it may be related to this.

Extending it a bit further... often we seem to seek evidence that confirms our beliefs, not evidence that would actually challenge them. Wouldn't that, too, be an instance of trying to set one's perceptions at desired values?

[-]pjeby17y10

I'm looking forward to seeing more from you on this. NLP has a couple of bits and bobs of control theory in it, most notably the foundational ideas that the way to get a person to change (or any other result) is to be more flexible in your behavior than any other part of the system, and that you need to be able to measure yourself relative to a well-defined outcome. Even Robert Fritz's "creative process" books emphasize a concept of structural tension, which is the distance between a goal state and reality. My thoughts-into-action video is based on initiating internal measurement of the distance between a clean desk and a messy one, then standing back and letting the control system do its job.

Btw, while it isn't necessary for a control system to predict, remember, or model anything, in humans predictive modeling is an important part of the control system nonetheless. (See e..g the experiments that show humans can detect probability patterns without even having conscious awareness.)

Actually, risk homeostasis is another good example of a human control system that requires a predictive model in order to establish a set-point... heck, I imagine you can't even catch a ball unless you can predict where it's going to be.

Interesting anecdote: I recently read a Wired article about perception that mentions a professional pickpocket (entertainer/magician) who found that the way to have your hands be quicker than someone's eyes is to move your hands in a curve -- because if you move in a straight line, the person's eyes go to where your hands are going to be, rather than tracking where they are.

You could view all of these things as simply setting goals for a control system, but I find Hawkins' HTM model of the cortex more compelling from an evolutionary point of view. A design based on predictive memory control systems being "all the way down" is easier to evolve than one that has to have a bunch of collaborating components to produce the same behaviors, whereas a HTM-based cortex can just get bigger and add more layers. And at the early end of the evolutionary chain, incrementally adding memory/prediction to existing control systems is an equally incremental win -- i.e., "easy to evolve".

[-]MrHen17y10

I imagine you can't even catch a ball unless you can predict where it's going to be.

Mmm... does "you" mean a person or does "you" mean anything? Catching a ball can easily be done without predicting its final location and was discussed in a different thread.

[-]pjeby17y10

Mmm... does "you" mean a person or does "you" mean anything? Catching a ball can easily be done without predicting its final location and was discussed in a different thread.

That depends on what you mean by "predict". I don't mean a conscious prediction, I just mean a model that tells you how to get there. Even if that model is an algorithm, it's still a prediction.

Consider the ball player who runs to catch the ball, and then realizes he's not going to make it and stops trying. How is that not a prediction?

[-]MrHen17y00

I just mean a model that tells you how to get there.

Oh, okay. I misunderstood what you meant.

Consider the ball player who runs to catch the ball, and then realizes he's not going to make it and stops trying. How is that not a prediction?

That has little to do with what I was talking about. Something that "predicts" by thinking "If I am not holding the ball, move closer" has no concept of being able to "make it" to the landing spot. It couldn't care less where the ball ends up. All it needs to know is if it is currently holding the ball and how to get closer. The "how to get closer" is the predictor.

[-]pjeby17y10

That has little to do with what I was talking about. Something that "predicts" by thinking "If I am not holding the ball, move closer" has no concept of being able to "make it" to the landing spot. It couldn't care less where the ball ends up. All it needs to know is if it is currently holding the ball and how to get closer. The "how to get closer" is the predictor.

As I said, I understand you can make a control system that works that way. I'm just saying that humans don't appear to work that way, and possibly cortically-driven behaviors in general (across different species) don't work that way either.

Edit to add: see also the Memory-prediction Framework page on Wikipedia, for more info on feed-forward predictive modeling in the neocortex, e.g.:

The central concept of the memory-prediction framework is that bottom-up inputs are matched in a hierarchy of recognition, and evoke a series of top-down expectations encoded as potentiations. These expectations interact with the bottom-up signals to both analyse those inputs and generate predictions of subsequent expected inputs.

[-]MrHen17y00

I'm just saying that humans don't appear to work that way, and possibly cortically-driven behaviors in general (across different species) don't work that way either.

Yeah, this makes sense and that is why I asked the question about who "you" was.

Mmm... does "you" mean a person or does "you" mean anything?

[-]scientism17y00

If your approach isn't representationalist then why in the world would you maintain that we're "brains in vats" and have no direct access to the world? What do we have "access" to in lieu of the real world if there's no intervening model, representation, sense data, etc? Why not say it's the relative position of the car we're controlling rather than the "neural signals"? (How does one "see" a neural signal anyway?) It seems like your approach would be much more at home with direct realism.

LESSWRONG
is fundraising!
LW

LESSWRONG
is fundraising!
LW

60

What is control theory, and why do you need to know about it?

60

60