Time in Cartesian Frames

Scott Garrabrant

Crossposted from the AI Alignment Forum. May contain more technical jargon than usual.

This is the twelfth and final post in the Cartesian Frames sequence. Read the first post here.

Up until now, we have (in the examples) mostly considered agents making a single choice, rather than acting repeatedly over time.

The actions, environments, and worlds we've considered might be extended over time. For example, imagine a prisoner's dilemma where "cooperating" requires pushing a button every day for five years.

However, our way of discussing Cartesian frames so far would treat "push the button every day for five years" as an atomic action, a single element .

Now, will begin discussing how to use Cartesian frames to explicitly represent agents passing through time. Let us start with a basic example.

1. Partial Observability

Consider a process where two players, Yosef and Zoe, collaboratively choose a three-digit binary number. Yosef first chooses the first digit, then Zoe chooses the second digit, then Yosef chooses the third digit. The world will be represented by the three-digit number. The Cartesian frame from the perspective of Yosef looks like this:

$C_{0} = ⎛ ⎜ ⎜ ⎜ ⎜ ⎜ ⎜ ⎜ ⎜ ⎜ ⎜ ⎜ ⎜ ⎜ ⎝ \begin{matrix} 000 & 010 & 000 & 010 001 & 011 & 001 & 011 000 & 011 & 000 & 011 001 & 010 & 001 & 010 100 & 110 & 110 & 100 101 & 111 & 111 & 101 100 & 111 & 111 & 100 101 & 110 & 110 & 101 \end{matrix} ⎞ ⎟ ⎟ ⎟ ⎟ ⎟ ⎟ ⎟ ⎟ ⎟ ⎟ ⎟ ⎟ ⎟ ⎠$ .

Here, $C_{0} = (A_{0}, E_{0}, \cdot_{0})$ is a Cartesian frame over $W_{0} = {000, 001, 010, 011, 100, 101, 110, 111}$ .

The four possible environments from left to right represent Zoe choosing 0, Zoe choosing 1, Zoe copying the first digit, and Zoe negating the first digit.

The eight possible agents can be broken up into two groups of four. In the top four possible agents, Yosef chooses 0 for the first digit, while in the bottom four, he chooses 1. Within each group, the four possible agents represent Yosef choosing 0 for the third digit, choosing 1 for the third digit, copying the second digit, and negating the second digit.

Consider the three partitions $W_{1}$ , $W_{2}$ , and $W_{3}$ of $W_{0}$ representing the first, second and third digits respectively. $W_{i} = {w_{i}^{0}, w_{i}^{1}}$ , where $w_{1}^{0} = {000, 001, 010, 011}$ , $w_{1}^{1} = {100, 101, 110, 111}$ , $w_{2}^{0} = {000, 001, 100, 101}$ , $w_{2}^{1} = {010, 011, 110, 111}$ , $w_{3}^{0} = {000, 010, 100, 110}$ , and $w_{3}^{1} = {001, 011, 101, 111}$ .

Clearly, by the definition of observables, $W_{2}$ is not observable in $C_{0}$ . But there is still a sense in which this does not tell the whole story. Yosef can observe $W_{2}$ for the purpose of deciding the third digit, but can't observe $W_{2}$ for the purpose of deciding the first digit.

There are actually many ways to express this fact, but I want to draw attention to one specific way to express this partial observability: ${External}_{W_{1}} (C_{0})$ can observe $W_{2}$ .

Indeed, we have

${External}_{W_{1}} (C_{0}) ≃ C_{1} = ⎛ ⎜ ⎜ ⎜ ⎜ ⎜ ⎜ ⎜ ⎜ ⎜ ⎜ ⎜ ⎜ ⎜ ⎜ ⎜ ⎜ ⎜ ⎜ ⎜ ⎜ ⎜ ⎜ ⎜ ⎜ ⎜ ⎜ ⎜ ⎜ ⎜ ⎜ ⎜ ⎜ ⎜ ⎝ \begin{matrix} 000 & 010 & 100 & 110 000 & 010 & 100 & 111 000 & 010 & 101 & 110 000 & 010 & 101 & 111 000 & 011 & 100 & 110 000 & 011 & 100 & 111 000 & 011 & 101 & 110 000 & 011 & 101 & 111 001 & 010 & 100 & 110 001 & 010 & 100 & 111 001 & 010 & 101 & 110 001 & 010 & 101 & 111 001 & 011 & 100 & 110 001 & 011 & 100 & 111 001 & 011 & 101 & 110 001 & 011 & 101 & 111 \end{matrix} ⎞ ⎟ ⎟ ⎟ ⎟ ⎟ ⎟ ⎟ ⎟ ⎟ ⎟ ⎟ ⎟ ⎟ ⎟ ⎟ ⎟ ⎟ ⎟ ⎟ ⎟ ⎟ ⎟ ⎟ ⎟ ⎟ ⎟ ⎟ ⎟ ⎟ ⎟ ⎟ ⎟ ⎟ ⎠$ .

It may seem counter-intuitive that when you externalize $W_{1}$ , and thus take some control out of the hands of the agent, you actually end up with more possible agents. This is because the agent now has to specify what the third digit is, not only as a function of the second digit, but also as a function of the first digit. The agent could have specified the third digit as a function of the first digit before, but some of the policies would have been identical to each other.

The four possible environments of $C_{1}$ specify the first two digits, while the 16 possible agents represent all of the ways to have the third digit be a function of those first two digits. It is clear that $W_{2}$ is observable in $C_{1}$ .

This gives us a generic way to define a type of partial observability:

Definition: Given a Cartesian frame $C$ over $W$ , and partitions $V$ and $T$ of $W$ , we say $V$ is observable in $C$ after time $T$ if $V$ is observable in ${External}_{T} (C)$ .

2. Partitions as Time

Built into the above definition is the fact that we are thinking of (at least some) partitions of $W$ as representing time. This makes a lot of sense when we think of $W$ as a set of possible complete world histories. For any given time, this gives a partition where world histories are in the same subset if they agree on the world history up to that point in time.

For example, the above partition $W_{1}$ was the partition that we got by considering a time after Yosef chooses the first digit, but before Zoe chooses the second digit.

Further, this gives us a sequence of nested partitions, since the partition associated with one time is always a refinement of the partition associated with an earlier time.

Note that this is a multiplicative/updateless view of time. There is also an additive/updateful view of time, in which time is a nested sequence of subsets. In the additive view, possible worlds are eliminated as you pass through time. In the multiplicative view, possible worlds are distinguished from each other as you pass through time. We will focus on the multiplicative view, which I consider better-motivated.

3. Nested Subagents

Let $C = (A, E, \cdot)$ be a fixed Cartesian frame over a world $W$ . Let $T_{0}, \dots, T_{n}$ be a sequence of nested partitions of $W$ , with $T_{0} = {W}$ , $T_{n} = {{w} | w \in W}$ , and $T_{i + 1}$ a refinement of $T_{i}$ .

This gives a nested sequence of multiplicative superagents $C_{T_{n}} ◃_{\times} \dots ◃_{\times} C_{T_{0}}$ , where $C_{T_{i}} = {External}_{T_{i}} (C)$ , which follows from the lemma below.

Lemma: Given a Cartesian frame $C$ over $W$ , if $U$ and $V$ are partitions of $W$ and $U$ is a refinement of $V$ , then ${External}_{U} (C) ◃_{\times} {External}_{V} (C)$ .

Proof: Let $C = (A, E, \cdot)$ , and let $u : W \to U$ and $v : W \to V$ send each element of $W$ to their part in $U$ and $V$ respectively. Let ${External}_{U} (C) = (A / B_{U}, B_{U} \times E, \cdot_{U})$ , where $B_{U} = {{a^{'} \in A | \forall e \in E, u (a^{'} \cdot e) = u (a \cdot e)} | a \in A}$ . Similarly, let ${External}_{V} (C) = (A / B_{V}, B_{V} \times E, \cdot_{V})$ , where $B_{V} = {{a^{'} \in A | \forall e \in E, v (a^{'} \cdot e) = v (a \cdot e)} | a \in A}$ . Let $b_{U} : A \to B_{U}$ and $b_{V} : A \to B_{V}$ send each element of $A$ to its part in $B_{U}$ and $B_{V}$ respectively.

Since $U$ is a refinement of $V$ , there exists a $v^{'} : U \to V$ , such that $v^{'} \circ u = v$ . Further, we have that $B_{U}$ is a refinement of $B_{V}$ , so there exists a $b_{V}^{'} : B_{U} \to B_{V}$ such that $b_{V}^{'} \circ b_{U} = b_{V} .$

It suffices to show there exist three sets $X$ , $Y$ , and $Z$ , and a function $f : X \times Y \times Z \to W$ such that ${External}_{U} (C) ≃ (X, Y \times Z, ⋄)$ and ${External}_{V} (C) ≃ (X \times Y, Z, ∙)$ , where $⋄$ and $∙$ are given by $x ⋄ (y, z) = f (x, y, z)$ and $(x, y) ∙ z = f (x, y, z)$ .

We will take $X$ to be $A / B_{U}$ and $Z$ to be $B_{V} \times E$ . We define $Y$ to be the set of all right inverses to $b_{V}^{'}$ , $Y = {y : B_{V} \to B_{U} | \forall b \in B_{U}, b_{V}^{'} (y (b)) = b}$ . We will let $f (x, y, (b, e)) = x (y (b)) \cdot e$ .

First, we show

\begin{matrix} {External}_{U} (C) & = (A / B_{U}, B_{U} \times E, \cdot_{U}) ≃ (X, Y \times Z, ⋄) . \end{matrix}

We define

\begin{matrix} (g_{0}, h_{0}) : (A / B_{U}, B_{U} \times E, \cdot_{U}) \to (X, Y \times Z, ⋄) \end{matrix}

and

\begin{matrix} (g_{1}, h_{1}) : (X, Y \times Z, ⋄) \to (A / B_{U}, B_{U} \times E, \cdot_{U}) \end{matrix}

as follows. Let $g_{0}$ and $g_{1}$ be the identity on $X = A / B_{U}$ , and let $h_{0} : Y \times Z \to B_{U} \times E$ be given by $h_{0} (y, (b, e)) = (y (b), e)$ . Finally, let $h_{1} : B_{U} \times E \to Y \times Z$ be chosen to satisfy $h_{1} (b, e) = (y, (b_{V}^{'} (b), e))$ , where $y$ is such that $y (b_{V}^{'} (b)) = b$ , and for $b^{'} \neq b_{V}^{'} (b)$ , $y (b^{'})$ is chosen arbitrarily to be any preimage of $b^{'}$ under $b_{V}^{'}$ .

We have that $(g_{0}, h_{0})$ is a morphism, because for all $x \in A / B_{U}$ and $(y, (b, e)) \in Y \times Z$ ,

\begin{matrix} g_{0} (x) ⋄ (y, (b, e)) & = f (x, y, (b, e)) = x (y (b)) \cdot e = x \cdot_{U} (y (b), e) = x \cdot_{U} h_{0} (y, (b, e)) . \end{matrix}

Similarly, $(g_{1}, h_{1})$ is a morphism, because for all $x \in X$ and $(b, e) \in B_{U} \times E$ , we have

\begin{matrix} g_{1} (x) \cdot_{U} (b, e) & = x \cdot_{U} (b, e) = x (b) \cdot e = x (y (b_{V}^{'} (b))) \cdot e = f (x, y, (b_{V}^{'} (b), e)) = x ⋄ (y, (b_{V}^{'} (b), e)) = x ⋄ h_{1} (b, e), \end{matrix}

where $y$ is as given in the definition of $h_{1}$ . Since $g_{0} \circ g_{1}$ and $g_{1} \circ g_{0}$ are both the identity, we have that $(g_{0}, h_{0}) \circ (g_{1}, h_{1})$ and $(g_{1}, h_{1}) \circ (g_{0}, h_{0})$ are both homotopic to the identity, so ${External}_{U} (C) ≃ (X, Y \times Z, ⋄)$ .

Next, we show

\begin{matrix} {External}_{V} (C) & = (A / B_{V}, B_{V} \times E, \cdot_{V}) ≃ (X \times Y, Z, ∙) . \end{matrix}

We define

\begin{matrix} (g_{2}, h_{2}) : (A / B_{V}, B_{V} \times E, \cdot_{V}) \to (X \times Y, Z, ∙) \end{matrix}

and

\begin{matrix} (g_{3}, h_{3}) : (X \times Y, Z, ∙) \to (A / B_{V}, B_{V} \times E, \cdot_{V}) \end{matrix}

as follows. Let $h_{2}$ and $h_{3}$ be the identity on $Z = B_{V} \times E$ , and let $g_{3} : X \times Y \to A / B_{V}$ be given by $g_{3} (x, y) = x \circ y$ . To see that $x \circ y$ is in $A / B_{V}$ , we need to verify that $b_{V} \circ x \circ y$ is the identity on $B_{V}$ . Indeed,

\begin{matrix} b_{V} \circ x \circ y & = b_{V}^{'} \circ b_{U} \circ x \circ y = b_{V}^{'} \circ y, \end{matrix}

which is the identity on $B_{V}$ . Let $g_{2} : A / B_{V} \to X \times Y$ be given by $g_{2} (q) = (q^{'}, b_{U} \circ q)$ , where $q^{'} \in A / B_{U}$ is chosen such that for all $b \in B_{V}$ , $q^{'} (b_{U} (q (b))) = q (b)$ , and for $b^{'}$ not in the image of $b_{U} \circ q$ , $q^{'} (b^{'}) \in b^{'}$ . We can do this simultaneously for all inputs of the form $b_{U} (q (b))$ , since $b_{U} \circ q$ is injective, since it has a left inverse, $b_{V}^{'}$ .

We have that $(g_{2}, h_{2})$ is a morphism, because for all $q \in A / B_{V}$ and $(b, e) \in Z$ , we have

\begin{matrix} g_{2} (q) ∙ (b, e) & = (q^{'}, b_{U} \circ q) ∙ (b, e) = f (q^{'}, b_{U} \circ q, (b, e)) = q^{'} (b_{U} (q (b))) \cdot e = q (b) \cdot e = q \cdot_{V} (b, e) = h_{2} (q) \cdot_{V} (b, e), \end{matrix}

where $q^{'}$ is as in the definition of $g_{2}$ . Similarly, $(g_{3}, h_{3})$ is a morphism, because for all $(x, y) \in X \times Y$ and $(b, e) \in B_{V} \times E$ , we have

\begin{matrix} g_{3} (x, y) \cdot_{V} (b, e) & = x \circ y \cdot_{V} (b, e) = x (y (b)) \cdot e = f (x, y, (b, e)) = (x, y) ∙ (b, e) = (x, y) ∙ h_{3} (b, e) . \end{matrix}

Since $h_{3} \circ h_{2}$ and $h_{2} \circ h_{3}$ are both the identity, we have that $(g_{2}, h_{2}) \circ (g_{3}, h_{3})$ and $(g_{3}, h_{3}) \circ (g_{2}, h_{2})$ are both homotopic to the identity, so ${External}_{V} (C) ≃ (X \times Y, Z, ∙)$ , completing the proof. $□$

The sequence $C_{T_{0}}, \dots, C_{T_{n}}$ represents the agent persisting across time, but each subagent $C_{T_{i}}$ does not really represent a single time-slice of the agent. Instead, $C_{T_{i}}$ represents an agent persisting across time starting at the time $T_{i}$ .

I think that this is actually the more natural notion. However, if we want to think about an agent persisting across times as a sequence of single times-slices of the agent, we could also do that. Since $C_{T_{i + 1}}$ is a multiplicative subagent of $C_{T_{i}}$ , $C_{T_{i + 1}}$ must have a sister $D_{T_{i} + 1}$ in $C_{T_{i}}$ , so we could consider the sequence $D_{T_{1}}, \dots, D_{T_{n}}$ .

4. Controllables Decrease and Observables Increase Over Time

An interesting fact about these sequences $C_{T_{0}}, \dots, C_{T_{n}}$ is that controllables decrease and observables increase over time, so for $i \leq j$ we have $Obs (C_{T_{i}}) \subseteq Obs (C_{T_{j}})$ and $Ctrl (C_{T_{i}}) \supseteq Ctrl (C_{T_{j}})$ (and $Ensure (C_{T_{i}}) \supseteq Ensure (C_{T_{j}})$ and $Prevent (C_{T_{i}}) \supseteq Prevent (C_{T_{j}})$ ), which follows directly from the following two lemmas.

Lemma: Given a Cartesian frame $C$ over $W$ , if $U$ and $V$ are partitions of $W$ and $U$ is a refinement of $V$ , then $Ctrl ({External}_{V} (C)) \supseteq Ctrl ({External}_{U} (C))$ .

Proof: Let $C_{V} = {External}_{V} (C)$ , and let $C_{U} = {External}_{V} (C)$ . We will actually only need to use the fact that $C_{U} ◃_{\times} C_{V}$ , and that both $C_{U}$ and $C_{V}$ have nonempty agents. $C_{U}$ and $C_{V}$ do in fact have nonempty agent, because, as we have shown, externalizing a partition of $W$ always produces nonempty agents.

It suffices to establish that $Ensure (C_{T_{i}}) \supseteq Ensure (C_{T_{j}})$ , and the result for $Ctrl$ follows trivially.

Since $C_{U} ◃_{\times} C_{V}$ , there exist $X$ , $Y$ , $Z$ , and $f : X \times Y \times Z \to W$ such that $C_{U} ≃ (X, Y \times Z, ⋄)$ and $C_{V} ≃ (X \times Y, Z, ∙)$ , where $⋄$ and $∙$ are given by $x ⋄ (y, z) = f (x, y, z)$ and $(x, y) ∙ z = f (x, y, z)$ . Let $C_{U}^{'} = (X, Y \times Z, ⋄)$ , and let $C_{V}^{'} ≃ (X \times Y, Z, ∙)$ . Observe that $X$ and $Y$ are nonempty.

Since $Ensure$ is preserved by biextensional equivalence, it suffices to show that $Ensure (C_{V}^{'}) \supseteq Ensure (C_{U}^{'})$ . Let $S \in Ensure (C_{U}^{'})$ . Thus, there exists some $x_{0} \in X$ , such that for all $(y, z) \in Y \times Z$ , $x_{0} ⋄ (y, z) = f (x_{0}, y, z) \in S$ . Since $Y$ is nonempty, we can take an arbitrary $y_{0} \in Y$ , and observe that for all $z \in S$ , $(x_{0}, y_{0}) ∙ z = f (x_{0}, y_{0}, z) \in S$ . Thus, $S \in Ensure (C_{V}^{'})$ . $□$

Lemma: Given a Cartesian frame $C$ over $W$ , if $U$ and $V$ are partitions of $W$ and $U$ is a refinement of $V$ , then $Obs ({External}_{V} (C)) \subseteq Obs ({External}_{U} (C))$ .

Proof: Let $C = (A, E, \cdot)$ , and let $u : W \to U$ and $v : W \to V$ send each element of $W$ to their part in $U$ and $V$ respectively. Let ${External}_{U} (C) = (A / B_{U}, B_{U} \times E, \cdot_{U})$ , where $B_{U} = {{a^{'} \in A | \forall e \in E, u (a^{'} \cdot e) = u (a \cdot e)} | a \in A}$ . Similarly, let ${External}_{U} (C) = (A / B_{V}, B_{V} \times E, \cdot_{V})$ , where $B_{V} = {{a^{'} \in A | \forall e \in E, v (a^{'} \cdot e) = v (a \cdot e)} | a \in A}$ . Let $b_{U} : A \to B_{U}$ and $b_{V} : A \to B_{V}$ send each element of $A$ to its part in $B_{U}$ and $B_{V}$ respectively.

Let $S \in Obs ({External}_{V} (C)) .$ Thus, for every pair $q_{0}, q_{1} \in A / B_{V}$ , there exists a $q_{2} \in A / B_{V}$ such that $q_{2} \in if (S, q_{0}, q_{1})$ . Thus, we can define an $f : A / B_{V} \times A / B_{V} \to A / B_{V}$ such that for all $q_{0}, q_{1} \in A / B_{V}$ , $f (q_{0}, q_{1}) \in if (S, q_{0}, q_{1})$ .

Our goal is to show that $S \in Obs ({External}_{U} (C))$ . For this, it suffices to show that for any $q_{0}, q_{1} \in A / B_{U}$ , there exists a $q_{2} \in A / B_{U}$ such that $q_{2} \in if (S, q_{0}, q_{1})$ .

Let $q_{0}, q_{1} \in A / B_{U}$ be arbitrary. Given an arbitrary $b \in B_{U}$ , let $q_{i}^{b} \in A / B_{V}$ be any element that satisfies $q_{i}^{b} (b_{V}^{'} (b)) = q_{i} (b)$ . This is possible because $q_{i} (b) \in b \subseteq b_{V}^{'} (b)$ . It does not matter what $q_{i}^{b}$ does on other inputs. Let $q_{2} : B_{U} \to A$ be such that for all $b \in B_{U}$ , $q_{2} (b) = f (q_{0}^{b}, q_{1}^{b}) (b_{V}^{'} (b))$ .

To complete the proof, we need to show that $q_{2} \in A / B_{U}$ and $q_{2} \in if (S, q_{0}, q_{1})$ .

To show that $q_{2} \in A / B_{U}$ , we need that for all $b \in B_{U}$ , $q_{2} (b) \in b$ . Let $b \in B_{U}$ be arbitrary. Since $q_{0} (b) \in b$ , by the definition of $B_{U}$ , it suffices to show that for all $e \in E$ , $u (q_{2} (b) \cdot e) = u (q_{0} (b) \cdot e)$ . Further, since $q_{1} (b) \in b$ , we already have that for all $e \in E$ , $u (q_{1} (b) \cdot e) = u (q_{0} (b) \cdot e)$ . Thus, it suffices to show that for all $e \in E$ , either $q_{2} (b) \cdot e = q_{0} (b) \cdot e$ or $q_{2} (b) \cdot e = q_{1} (b) \cdot e$ . Indeed, if $q_{2} (b) \cdot e \in S$ , then

\begin{matrix} q_{2} (b) \cdot e & = f (q_{0}^{b}, q_{1}^{b}) (b_{V}^{'} (b)) \cdot e = q_{0}^{b} (b_{V}^{'} (b)) \cdot e = q_{0} (b) \cdot e, \end{matrix}

and similarly, if $q_{2} (b) \cdot e \notin S$ , then $q_{2} (b) \cdot e = q_{1} (b) \cdot e$ . Thus, we have that for all $e \in E$ , $u (q_{2} (b) \cdot e) = u (q_{0} (b) \cdot e)$ , so for our arbitrary $b \in B_{U}$ , $q_{0} (b) \in b$ , so $q_{2} \in A / B_{U}$ .

Let $(b, e) \in B_{U} \times E$ be such that $q_{2} \cdot_{U} (b, e) \in S$ . We want to show that $q_{2} \cdot_{U} (b, e) = q_{0} \cdot_{U} (b, e)$ . Indeed,

\begin{matrix} q_{2} \cdot_{U} (b, e) & = q_{2} (b) \cdot e = f (q_{0}^{b}, q_{1}^{b}) (b_{V}^{'} (b)) \cdot e = f (q_{0}^{b}, q_{1}^{b}) \cdot_{V} (b_{V}^{'} (b), e) = q_{0}^{b} \cdot_{V} (b_{V}^{'} (b), e) = q_{0}^{b} (b_{V}^{'} (b)) \cdot e = q_{0} (b) \cdot e = q_{0} \cdot_{U} (b, e) . \end{matrix}

Symmetrically, if $(b, e) \in B_{U} \times E$ is such that $q_{2} \cdot_{U} (b, e) \notin S$ , we have $q_{2} \cdot_{U} (b, e) = q_{1} \cdot_{U} (b, e)$ . Thus $q_{2} \in if (S, q_{0}, q_{1})$ .

Thus, since $q_{0}$ and $q_{1}$ were arbitrary, we have that $S \in Obs ({External}_{U} (C))$ , completing the proof. $□$

This result allows us to think of time as a sort of ritual in which control of the world is sacrificed in exchange for ability to condition on the world.

5. Directions for Future Work

As I noted at the start of this sequence, Cartesian frames take their motivation from Hutter, attempting to improve on the cybernetic agent model; they take their angle of attack from Pearl, using combinatorics to infer functional structure from relational structure; and they take their structure from game theory, working with base objects that look similar to normal-form games.

Building up from very simple foundations, we have found that Cartesian frames yield elegant notions of agents making choices and observations, of agents acting over time, and of subagent relations. At the same time, Cartesian frames allow us to switch between different levels of description of the world and consider many different ways of factorizing the world into variables.

I suspect that this is the last post I will write on Cartesian frames for a while, but I am excited about the framework, and would really like to get more people working on it.

To help with that, I've commented below with various directions for future work: ways that I think the framework could be extended, made better, or applied.

I've erred on the side of inclusion in these comments: some may point to dead ends, or may be based on false assumptions.

If you have questions or want to discuss Cartesian frames, I'll be hosting a fourth and final office hours / discussion section this Sunday at 2pm PT on GatherTown.

[-]Scott Garrabrant4yΩ7130

Preferences and goals

It might be interesting to put on top of this theory something that is dealing more with utilities, or something similar. Since this theory is basically a calculus of what agents could do, it seems likely that we could say interesting things by putting on top of it analysis of what agents should do.

[-]adamShimi4yΩ110

I don't think that's what you had in mind, but one reason I am interested in learning more about Cartesian Frames is that I think that they might prove useful for formalizing the locality of goals. Basically, the idea is to capture whether the goal followed by a system is really about its inputs, or if it is about the state of the world.

One way to understand this distinction is through wireheading. For example, I consider my own goals as about the world, because I wouldn't want to wirehead to believe that I accomplished them. Whereas having the goal of always being happy means being completely okay with wireheading, and so having a goal about my input instead of what truly happens in the world.

Intuitively, this distinction seem to depend on how the boundaries are drawn between the system/agent and the environment, as well as the interface. Which is where I draw a possible connection with Cartesian Frames. But I'm not sure if it is possible to use them for that purpose.

[-]Scott Garrabrant4yΩ7120

Category-theory-first approaches

I am in general not especially proficient in category theory, and I think that the whole framework could be rewritten from the ground up by someone who is more proficient in category theory than me, and be made much better in the process.

[-]Scott Garrabrant4yΩ6120

Logical uncertainty

There is a sense in which Cartesian frames is a very updateless ontology, and thus I am concerned about how to make it play nicely with logical uncertainty. Indeed, Cartesian frames are basically assuming that we have a set of possible worlds, which is assuming that we have objects that are the possible world that are not realized. Logical uncertainty does not do well with this assumption. Extending Cartesian frames to connect up with logical uncertainty is a major open problem.

[-]Scott Garrabrant4yΩ6110

Formalizing time

I think that much of the meat of what I want Cartesian frames to do is connected to time, and I have only really touched the surface of that. I think that there is a lot more to say about time, and I think there are options we have about how to think about time in Cartesian frames. The one I presented is my favorite at the moment, but I am uncertain.

For example, one might want to think about an agent, and the collection of pairs of partitions and $V$ of $W$ , such that the agent has a (multiplicative?) subagent that could choose $U$ , while observing $V$ . This collection of pairs is closed under coarsening in both arguments, and so one could talk about a sort of Pareto frontier of how refined you can make $U$ given $V$ or vice versa. I think this Pareto frontier looks a lot like time.

[-]Ramana Kumar3yΩ110

"subagent [] that could choose $U$ " -- do you mean $U \subseteq C t r l (C)$ or $U \subseteq E n s u r e (C)$ or neither of these? Since $C t r l$ is not closed under unions, I don't think the controllables version of "could choose" is closed under coarsening the partition. (I can prove that the ensurables version is closed; but it would have been nice if the controllables version worked.)

ETA: Actually controllables do work out if I ignore the degenerate case of a singleton partition of the world. This is because, when considering partitions of the world, ensurables and controllables are almost the same thing.

[-]Scott Garrabrant4yΩ590

Time and coarse world models

I feel like the partial observability I get from taking a coarsening of the world and saying an agent has observations in that coarsening is similar to the partial observability I get when saying an agent learns something at a specific time. In particular, these two things seem similar enough to me that one might be able to unify the two definitions, and in the process reveal new things about them.

[-]Ramana Kumar3yΩ490

I have something suggestive of a negative result in this direction:

Let be the prime-detector situation from Section 2.1 of the coarse worlds post, and let $p : W \to W$ be the (non-surjective) function that "heats" the outcome (changes any "C" to an "H"). The frame $p^{\circ} (C)$ is clearly in some sense equivalent to the one from the example (which deletes the temperature from the outcome) -- I am using my version just to stay within the same category when comparing frames. As a reminder, primality is not observable in $C$ but is observable in $p^{\circ} (C)$ .
Claim: No frame of the form ${E x t e r n a l}_{V} (C)$ is biextensionally equivalent to $p^{\circ} (C)$
Proof Idea: $I m a g e ({E x t e r n a l}_{V} (C)) = I m a g e (C) \neq I m a g e (p^{\circ} (C))$

The kind of additional observability we get from coarsening the world seems in this case to be very different from the kind that comes from externalising part of the agent's decision.

[-]Scott Garrabrant4yΩ480

Computational complexity

A random open question I am curious about, but doesn't seem that important: Is the existence of a morphism between Cartesian frames NP-complete?

[-]Scott Garrabrant4yΩ470

Logical time

In agent simulates predictor, I am given a proof that I output a certain action, and then I must make a choice. In making this choice, I am determining whether or not I am given that proof in the first place. Further, the proof must in some sense compress my deliberation, or I would not be able to comprehend it. Thus, I feel that there are some details of the proof that are not "true inputs" for me.

I want to say that my deciding what I would do if I saw a proof is "earlier" than the proof according to some generalized notion of causality, or earlier in "logical time." I want to say that the only way to make the agent-simulates-predictor set-up make sense is to have the full proof itself not be a true input for me. I think that Cartesian frames is a step towards making continuous the notion of inputs and outputs, and so could help our thinking around this problem.

[-]Scott Garrabrant4yΩ370

Subagents

I think that our current ability to talk about agents contained within other agents is pretty limited, and Cartesian frames is a significant step forward on that. It would not surprise me if this could help with fixing our ontology around subsystem alignment. It could also help with our ontology around reasoning about committees, subcommittees, and members.

[-]Scott Garrabrant4yΩ360

Generalizing observability

Observables can clearly be extended to infinite partitions, and maybe further to a sigma algebra or something similar. One might want to also think of and $E$ as sigma algebras.

Observables can also be extended to talk about separating two subsets of $W$ , rather than separating a subset of $W$ from its complement. One could also talk about observables that don't allow for arbitrary functions from the observed to $A$ , but instead allow for some restricted class such as continuous or Kakutani functions.

Such restricted classes might make more sense when using this more general notion of observables, or it might be possible to entirely construct these classes from this notion of observables.

This could allow the theory to encompass game theory, since you could have two agents which choose a probabilistic strategy, while knowing the probabilistic strategy chosen by the other player.

Frames that are partitions into rectangles

I think that there might be significantly more that can be said about Cartesian frames that are a "partition into rectangles" than can be said about Cartesian frames in general.

By a "partition into rectangles," I mean a Cartesian frame such that if $a_{0} \cdot e_{0} = a_{1} \cdot e_{1}$ , then $a_{0} \cdot e_{0} = a_{0} \cdot e_{1}$ . In particular, this assumption is saying something to the effect of "the level of description of this world is refined enough to play nicely with the factorization into $A$ and $E$ ."

[-]DanielFilan4yΩ230

Yosef can observe for the purpose of deciding the first digit, but can't observe $W_{2}$ for the purpose of deciding the third digit.

Am I missing something, or should this be the other way around? Intuitively, I'd think that it makes sense that Yosef can observe the second digit when choosing the third, but not when choosing the first.

[-]Scott Garrabrant4yΩ230

Fixed, thanks.

LESSWRONG
LW