Decision theories as heuristics

[-][anonymous]11y90

Meta comment: upvoted because right or wrong I'd very much like to see more posts like this.

I upvoted this comment because I agree with Mark about upvoting discussion posts that I would like to see more of.

I down-voted this comment because it is a clever ploy for karma that rests on exploiting LessWrongers' sometimes unnecessary enthusiasm for increasingly abstract and self-referential forms of reasoning but otherwise adds nothing to the conversation.

Twist: By "this comment" I actually mean my comment, thereby making this a paraprosdokian.

[-]So8res11y40

(Mostly pasted from a conversation I had with esrogs)

While there's some sense in which we're eventually going to need to use decision making heuristics, and in which using CDT on a graphical model of a world is just a heuristic, there's also a sense in which we don't know what we're approximating yet or how well our existing DTs approximate it.

My interest is in figuring out what the idealized process we want to approximate is first, and then figuring out the heuristics. The whole "Newcomblike problems are the norm" thing is building towards the motivation of "this is why we need to better understand what we're approximating" (although it could also be used to motivate "this is why we need better heuristics", but that was not my point).

Your objection seems similar to Vaniver's, in the main thread, that CDT could find a causal connection between its choice and the contents of the boxes in the Newcomb problem. This appeals to the intuition that there is some connection between the choice and the boxes (which there is), but fails to notice that the connection is acausal.

Or, in other words, it's a good intuition that "something like the CDT algorithm" can solve Newcomb's problem if you just give it a "good enough" world-model that allows it to identify these connections. But this involves using a non-causal world model. And, indeed, it is these non-causal world models that we must use to capture the intuition that you can win at Newcomb's problem using a non-causal decision theory.

Whenever there are non-causal connections (as in Newcomb problems) you need to have a world model containing non-causal connections between nodes.

(Side note: EDT is underspecified, and various attempts to fully specify it can make it equivalent to CDT or TDT or UDT, but we only found the latter two specifications after discovering TDT/UDT. It doesn't seem very useful to me to say that EDT works well/poorly unless you better specify EDT.)

I feel like there's this problem where when I say "look at this clear-cut case of there being non-causal connections", some people respond "but Newcomb problems are unrealistic", and then when I say "look at these realistic cases where there are realistically acausal connections", others say "ah, but this is not clear cut" -- and that's what you're doing when you say

If you allow the decision-maker to think carefully through all the unconscious signals sent by her decisions, it’s less clear that there’s anything Newcomblike

I'm sympathetic to this claim, but hopefully you can see the thing that I'm trying to point to here, which is this: there really are scenarios where there are acausal logical connections (that we care about) in the world.

Surely you agree that information can propagate acausally, e.g. if I roll a die and write down the result in two envelopes and send one to alpha centauri and read the other after it gets there -- I can learn what is in the envelope on alpha centauri "faster than light"; the physical causal separation does not affect information propagation. These things are often, but not always, related.

Similarly, the connections in the world that I care about are related to the information that I have, not to the causal connections between them. These things often correspond, but not always.

It is in this sense that CDT is doing the wrong thing: it's not the "evaluate counterfactuals and pick the best option" part that's the problem, it's the "how do you construct the counterfactuals (and on what world-model)" that is the problem.

We will inevitably eventually need to use decision making heuristics, but at this point we don't even know what we're approximating, and We're decidedly not looking specifically for "good decision-making heuristics" right now. We're trying to figure out decision theory in an idealized/deterministic setting first, so that by the time we do resort to heuristics we'll have some idea about what it is we're trying to approximate.

[-]owencb11y20

I'm sympathetic to this claim, but hopefully you can see the thing that I'm trying to point to here, which is this: there really are scenarios where there are acausal logical connections (that we care about) in the world.

I agree with this -- I think the absentminded driver is a particularly clean-cut case.

I was partly trying to offer an explanation of what was going on in e.g. discussions of Newcomb's problem where people contrast CDT with EDT. Given that you say EDT isn't even fully specified, it seems pretty clear that they're interpreting it as a heuristic, but I'm not sure they're always aware of that.

Surely you agree that information can propagate acausally

Yes -- nice example.

We will inevitably eventually need to use decision making heuristics, but at this point we don't even know what we're approximating, and We're decidedly not looking specifically for "good decision-making heuristics" right now.

I'm not entirely convinced by this. We can evaluate heuristics by saying "how well does implementing them perform?" (which just needs us to have models of the world and of value). I certainly think we can make meaningful judgements that some heuristics are better than others without knowing what the idealised form is.

That said, I'm sympathetic to the idea that studying the idealised form might be more valuable (although I'm not certain about that). The thrust of my post arguing that understanding the heuristics is valuable was to make it clear that I was trying to clarify the fact that some people end up discussing heuristics without realising it, rather than to attack such people.

[-]cousin_it11y40

I'm more interested in idealized decision theories than in heuristics, because until we figure out the idealized part, we don't know what we're trying to approximate or how well we're approximating it. All my decision theory posts on LW, as well as many of other people's posts, have followed this approach.

Also I think that discussing things at the level of "heuristics" might lead people to misconceptions. For example, UDT is not just for interacting with other agents. It's necessary in scenarios where you are the only agent, like the Absent-Minded Driver problem.

[-]owencb11y10

I'm open to the idea that the idealised form is more worth studying. I still think that a substantial fraction of discussion relates to heuristics (for instance the linked post saying that Newcomblike problems are common), and that having a way to notice this and separate it off would improve dialogue.

The Absent-Minded Driver problem is a good example of a problem which doesn't seem to succumb to confusion about idealised/heuristics.

I should clarify that I didn't want to claim that UDT was just for interacting with agents. I wanted to show the space of statements we could start discussing. From a heuristic level, there at least seem to be some cases where it will be unnecessary to use UDT, and more complex than is needed. It would be nice if there were a simple characterisation of how to recognise if you were in a scenario where CDT might fail.

[-]cousin_it11y40

I'm not sure that "Newcomblike problems are common" was intended as an argument about heuristics. To me it's more of an argument about which idealization we want to be studying in the long run.

A simpler version of the argument could go like this. When we build an AI, it will be able to make indistinguishable copies of itself, and the existence of these copies might depend on the AI's decisions and on facts about the world. VNM utility maximization doesn't cover such situations, so we need a more general theory that's as mathematically nice and reduces to VNM utility maximization when you have only one copy.

To be more precise, by "copies" I mean multiple decision nodes belonging to the same information set, as described in this post. These can also arise from mechanisms other than copying, such as forgetting things (AMD problem), having one agent simulate another (Newcomb's problem), or just putting multiple agents in the same situation (anthropic problems).

Does that answer your question about when UDT should be used?

[-]buybuydandavis11y10

Nice post.

Fundamentally, the Newcomb problem is about weighing off our confidence in CDT versus the evidence about Omega's predictive skill. If we have confidence in both, but they point in different directions, then it's which we have more confidence in. This kind of trade off happens all the time, just not to this degree.

We could write it out in Jaynes' notation, and that might make it clear to those to whom it isn't already.

LESSWRONG
is fundraising!
LW

LESSWRONG
is fundraising!
LW

21

Decision theories as heuristics

21

21

Existing discussion is often about heuristics

Understanding decision-making heuristics is valuable

We should acknowledge when we’re talking about heuristics