Deriving Conditional Expected Utility from Pareto-Efficient Decisions

Thoughts on the process of writing this post:

It took a lot of effort to write, something like 3 days of my time. Distillation is hard.
Most of this effort was not in understanding the original post (took me 2-3 hours to understand the math)
I sent drafts to johnwentworth several times and had several conversations with him to refine this piece. This probably spent ~2 hours of his time.
I'm not satisfied with the final result. It seems like the point the original post made was fairly obvious and I used way too many words to explain it properly. Maybe John thought the interpretation of the math was fairly deep and I thought it wasn't very deep?
I think that since John is a good and prolific writer already compared to most alignment researchers, there is higher value in distilling ideas of other researchers. It's hard to produce a lot of value from content already on LW.
- Paul Christiano blogposts are somewhat famously opaque; distillations of these have worked in the past and still seem pretty valuable. The highest-relevance academic papers might be better. But many of the highest-value distillations probably involve talking to researchers to get things they're too busy to write down at all.

^{^}

John made the following comment:

We are showing that the agent performs Bayesian updates, in some sense. That's basically what conditioning is. It's just not necessarily performing a series of updates over time, with each retaining the information from the previous, the way we usually imagine.

^{^}

When f depends on past decisions, the agent just maximizes $E [u (A, X) | f_{i} (A_{< i}, X) = o_{i}]$ . To see the math for the multi-decision case, read the original post by John Wentworth.

^{^}

If the world has $b_{X}$ bits of state, and the observations reveal $b_{o}$ bits of information each, the pigeonhole principle says this surely happens when there are $b_{x} / b_{o}$ observations. Our universe has about $10^{125}$ bits of state, so this won't happen unless our agent can operate coherently in ~ $10^{125}$ different decisions; this number can maybe be reduced if we suppose that our agent can only actually observe, say, $10^{10}$ bits of state.

f(X)	A(f(X))
{("Mad Seoul", 4.5), ("Sushinista", 4.8)}	eat at Sushinista
{("Kimchi Garden", 4.3), ("Great China", 4.4)}	eat at Kimchi Garden
…	…

LESSWRONG
LW

LESSWRONG
LW

24

Deriving Conditional Expected Utility from Pareto-Efficient Decisions

24

24

Introduction

Summary

Pareto efficiency over possible worlds implies EUM

EUM implies conditional expected value

Multiple decisions might imply conditional EV is meaningful