Should B2 be "$10 > $5 (probability 0.9999)?". If so, you find yourself in the situation where you have 0.99+ for two contradictory hypothesis, and it's not clear to me what the step "ignore the proportion of probability mass assigned to worlds where 1 and 2 are both true" actually looks like.

Reply

[-]J Bostock4y10

Should B2 be "$10 > $5 (probability 0.9999)?"

Yes it should be, thanks for the catch.

We only ignore the proportion of that probability mass while thinking about the counterfactual world in which $5 is taken. It's just treated as we would ignore the probability mass previously assigned to anything we now know to be impossible.

I used "ignore" to emphasize that the agent is not updating either of it's beliefs about B1 or B2 based on C1. It's just reasoning in a "sandboxed" counterfactual world where it now assigns ~99% probability to it taking the lower of $5 and $10 and ~1% chance to $5 being larger than $10. From within the C1 universe it looks like a standard (albeit very strong) bayesian update.

When it stops considering C1, it "goes back to" having strong beliefs that both B1 and B2 are true.

Reply

[-]NunoSempere4y10

Can you give the probabilities that the agent assigns to B1 through D4 in the "sandboxed" counterfactual?

Reply

[-]J Bostock4y20

Yeah, so there are four options, . These will have the ratios $0.99 \times 0.9999 : 0.01 \times 0.9999 : 0.99 \times 0.0001 : 0.01 \times 0.0001$ . By D4 we'd eliminate the first one. The remaining odds ratios are normalized to be something around $0 : 0.9901 : 0.0098 : 0.0001$ . I.e. given that the agent takes $5 instead of $10, it is pretty sure that it's taken the smaller one for some reason, gives a tiny probability of it having miscalculated which of $5 and $10 are larger, and a really really small probability that both are true.

In fact were it to reason further it would see that the fourth option is also impossible, we have an XOR type situation on our hands. Then it would end up with odds around $0 : 0.9902 : 0.0098 : 0$ .

That last bit was assuming that it doesn't have uncertainty about its own reasoning capability.

Ideally it would also consider that D4 might be incorrect , and still assign some tiny $ϵ$ of probability ( $10^{- 10}$ for example, the point is it should be pretty small to both the first and fourth options giving $10^{- 10} : 0.9902 : 0.0098 : 10^{- 10}$ . It wouldn't really consider them for the purposes of making predictions, but to avoid logical explosions, we never assign a "true" zero.