Backchaining causes wishful thinking

[-]Academian16y40

The final goal of a plan is a belief, i.e. the belief that state X currently holds. In your representation, this might appear as "X", but semantically it's always "believe(state(X))

If that means what I think it does, I disagree. If you employ enough sense of intentionality to call something a "goal", then a self-referencing intelligence can refer to the difference between X obtaining and it believing X obtains, and choose not to wirehead itself into a useless stupor. This is what JGWeissman was getting at in Maximise Expected Utility, not Expected Perception of Utility.

[-]PhilGoetz16y30

I stated it poorly. Guess I better rewrite it. In the meantime, see my reply to Yvain below.

... time passes ...

I didn't rewrite it. I deleted it. That whole paragraph about believe(state(X)) contributed nothing to the argument. And, as you noted, it was wrong.

[-]orthonormal16y00

With that paragraph deleted, it was difficult for me (just reading it now) to make the inference connecting your argument to wishful thinking. You might want to spell it out.

[-]PhilGoetz16y00

I don't think it's because I deleted that paragraph. I think it was just unclear. I rewrote the second half.

[-]orthonormal16y00

Much improved, and accordingly upvoted.

[-]apophenia16y00

I read this article after you deleted that paragraph, but I had basically the same objection reading "between the lines" of the rest of what you said.

Obviously, any animal that did something like this all the time would die. It's possible that doing it to a limited degree might really happen. Is there a way to test your hypothesis?

[-]PhilGoetz16y00

What's the "something like this" in your sentence refer to?

[-]apophenia16y00

Replacing a belief that actually obtains i.e. food, with a belief that actions it is already taking (sitting in place) will obtain it food.

[-]JoshuaZ16y10

"When we evolved the ability to make extensive use of belief actions, we probably took our existing plan-construction mechanism, and added belief actions."

This is an intriguing and plausible idea. Do you have any proposed mechanism for how we could test this?

[-]PhilGoetz16y00

I don't know how to test it in humans. I developed a variant of SNePS with SNActor that used a single inference engine both to direct inference, and to make plans, about 24 years ago. But it was poor at identifying the right actions and propositions to think about (and was running on a 66MHz CPU with maybe 8M RAM), so it didn't do either inference or plan construction well, so I couldn't conclude anything from it. It was never worked into the main SNePS code branch, and the code is lost now, unless I have it on an old hard drive.

[-]Scott Alexander16y00

Definitely makes some sense.

But I didn't understand what you meant in the paragraph starting with "What stops us from just saying..." What does stop us from just saying this, and how come some desires successfully result in action and others result in wishful thinking? Can you predict when wishful thinking would be more likely to occur?

On a similar note, if "the final goal of a plan is a belief", would you expect me to be indifferent between saving the world and taking a pill that caused me to believe that the world was saved, or is that confusing levels?

[-]PhilGoetz16y20

The algorithm you use to build your plan won't let you believe a step in the plan is successful until you can satisfy its preconditions. The problem is that "satisfy its preconditions" can be done in a one-sided, non-Bayesian manner, which doesn't work as well for inference as for action.

Re. the pill - that's a good question. To avoid taking the pill, you'd need to have a representation that distinguishes between causing X and causing believes(X), from the viewpoint of an outside observer. What I said in the post needs to be revised or clarified to account for this.

Your goal is X, a truth in the external world. When constructing a plan, you operate in a belief space representing the external world's viewpoint. You predict a plan will be successful if, in simulating it, you find it leads to the assertion X within that simulated belief space; not if it leads to finding believes(you, X) there. believes(you, X) in that belief space maps to X in your "root" belief space (which I'll call your mind); X in that belief space maps to X being true in the external world.

Successfully executing that plan would result in finding X in your mind. To an external observer, the X in your mind means believes(you, X), not X. That's because, to that observer, your mind is a belief space, just like the belief space you use when simulating a plan.

To represent the pill-taking action this way:

A = action(eat(me, pill)), precondition(A, have(me, pill)) , consequence(A, goal).

is not right, because that represents that you believe that eating the pill makes X true in the external world.

At first, it appears that it would also be wrong to represent it as

consequence(A, believes(me, goal))

because it appears that eating the pill would cause you to add believes(me, X) instead of X to your knowledge base, whereas you actually will add X.

However! Your inference engine is not the world. The representation in your mind, "consequence(A, believes(me, goal))", is not what the world actually uses to compute the results if you eat the pill. It's easy to forget this, because so often we write simulated worlds where we use one and the same rule set both for our agents to reason with, and also for the simulator to compute the next world state. So it's fine to use this representation.

[-]timtyler16y00

"Supervised learning" means sensory inputs are presented and paired with indications of the desired associated motor outputs. Just using a reward signal is usually unsupervised: reinforcement learning.

http://en.wikipedia.org/wiki/Supervised_learning

[-]PhilGoetz16y00

The term "supervised learning" doesn't have to do just with things for which there are motor outputs. If you want to train a system to recognize numbers, and you provide it with 100,000 photographs of handwritten numbers, and each photo is labelled with the number it pictures, that's supervised learning.

The reward signal is like a label. You need an oracle that provides the proper reward signal. Therefore, supervised learning.

[-]timtyler16y00

You should treat "motor outputs" as a synonym for "actuator signals" in the above comment if it is causing confusion.

Your definition of supervised learning doesn't seem to be the conventional one. Supervised learning is normally contrasted with reinforcement learning:

"Reinforcement learning differs from the supervised learning problem in that correct input/output pairs are never presented, nor sub-optimal actions explicitly corrected."

http://en.wikipedia.org/wiki/Reinforcement_learning

[-]PhilGoetz16y00

As I tried to explain in the post, a complete system that uses some function to generate its own reward signal is unsupervised. If you don't know how that reward signal is generated, and are just looking at the learning done with it, you're looking at a supervised system, which is part of a more-mysterious unsupervised system.

'Unsupervised' is sexier, and people are motivated to bend the term to cover whatever they're working on. But for the purposes of this post, it doesn't matter one bit which term you use.

[-]timtyler16y00

This all sounds very strange to me. If there is a supervisor - but all they do is use a carrot and a stick - then I think that would generally be classified as reinforcement learning. Supervised learning is where the learner gets given the correct outputs - or is told the right answers.

http://en.wikipedia.org/wiki/Supervised_learning

http://en.wikipedia.org/wiki/Unsupervised_learning

http://en.wikipedia.org/wiki/Semi-supervised_learning