The mathematical object (but not the philosophical interpretation) of a Cartesian Frame is studied under the name "Chu space."
(In category theory, Chu spaces are usually studied in the special case of W=2. To learn more about Chu spaces, see Vaughan Pratt's guide to papers and nLab's page on the Chu construction.)
In this post and the next one, I'll mostly be discussing standard facts about Chu spaces. I'll also discuss how to interpret the standard definitions as statements about agency.
Chu spaces form a category as a special case of the Chu construction. You may notice a strong similarity between operations on Cartesian frames and operations in linear logic, coming from the fact that the Chu construction is also intimately related to linear logic, and is used in the semantics for linear logic.
Linear logic has a large number of operations—additive conjunction (&), multiplicative conjunction (⊗), and so on—and many of those symbols will turn out to have interpretations for Cartesian frames, and they're actually going to be meaningful interpretations in this setting. For that reason, we'll be stealing much of our notation from linear logic, though this sequence won't assume familiarity with linear logic.
Definition:Chu(W) is the category whose objects are Cartesian frames over W, whose morphisms from C=(A,E,⋅) to D=(B,F,⋆) are pairs of functions (g:A→B,h:F→E), such that a⋅h(f)=g(a)⋆f for all a∈A and f∈F, and whose composition of morphisms is given by (g1,h1)∘(g0,h0)=(g1∘g0,h0∘h1).
The composition of two morphisms C0→C1 and C1→C2, then, sends the agent of C0 to C2 and sends the environment of C2 to C0.
Claim:Chu(W) is a category.
Proof: It suffices to show that composition is well-defined and associative and there exist identity morphisms. For identity, (idA,idE) is clearly an identity on C=(A,E,⋅), where idX is the identity map from X to itself.
The composition (g1,h1)∘(g0,h0) of (g0,h0):C0→C1 with (g1,h1):C1→C2 is (g1∘g0,h0∘h1):C0→C2. To verify that this is a morphism, we just need that a0⋅0h0(h1(e2))=g1(g0(a0))⋅2e2 for all a0∈Agent(C0), and e2∈Env(C2), where ⋅i=Eval(Ci). Indeed,
a0⋅0h0(h1(e2))=g0(a0)⋅1h1(e2)=g1(g0(a0))⋅2e2,
since each component is a morphism.
Associativity of the composition follows from the fact that it is just a pair of compositions of functions on sets, and composition is associative for sets. □
1. What Do These Morphisms Represent?
1.1. Morphisms as Interfaces
A Cartesian frame is a first-person perspective. The agent A finds itself in a certain situation or game, where it expects to encounter an environment E. The morphisms in Chu(W) allow the agent of one Cartesian frame to play the game of another Cartesian frame.
We can think of the morphisms from C=(A,E,⋅) to D=(B,F,⋆) as ways of fitting the agent of C into the environment of D. Indeed, for every morphism (g,h):C→D, one can construct a Cartesian frame (A,F,⋄), whose agent matches C's agent, and whose environment matches D's environment, with ⋄ given by a⋄f=a⋅h(f)=g(a)⋆f. (The morphism from C to D can actually be viewed as the composition of (idA,h):C→(A,F,⋄) with (g,idF):(A,F,⋄)→D.)
Two random large Cartesian frames will typically have no morphisms between them. When there is a morphism, the morphism functions as an interface that allows the agent A to interact with some other environment F. However, we aren't just randomly throwing A and F together. A's interaction with F factors through the function h:F→E, so A can in a sense still be thought of as using an interface where it interacts with E. It just interacts with an e∈E that is of the form h(f) for some f∈F. But this is happening simultaneously with the dual view in which F can be thought of as still interacting with B!
Since a Cartesian frame is a first-person perspective, you can imagine A having the internal experience of interacting with E, while F has the "experience" of interacting with B. The morphism's job is to be the translation interface that allows this A and F to interact with each other, while preserving their respective internal experiences in such a way that they feel like they're interacting with E and B respectively. A gets to play B's game, while still thinking that it is playing its own game.
1.2. Morphisms as Differences in Agents' Strength
We can also interpret the existence of a morphism from C=(A,E,⋅) to D=(B,F,⋆) as saying something like "D's agent is at least as strong as C's agent."
This is easiest to see for a morphism (g,h):C→D where g and h are both injective. In this case, it is as though A⊆B and F⊆E, so D's agent has more options to choose between and fewer environments it has to worry about.
Since some of the environments in E∖F might have been good for the agent, the agent isn't necessarily strictly better off in D; but in a zero-sum game, the agent will indeed be strictly better off. I think this justifies saying that C's agent is in some sense weaker than D's agent.
If g or h is not injective, we could duplicate elements of B and E to make it injective, so the interpretation "C's agent is no stronger than D's agent" is reasonable in that case as well. In particular, the existence of a morphism from C to D implies that Ensure(C)⊆Ensure(D) (and thus Ctrl(C)⊆Ctrl(D)).
However, the existence of a morphism is stronger than just saying the set of ensurables is larger. The morphism from C to D can be thought of as telling D's agent how to strategy-steal from C's agent, and thus do anything that C's agent can do.
We now provide a few examples to illustrate morphisms between Cartesian frames. (If you're ready to forge ahead, skip to §2 instead.)
1.3. Simple Examples of Morphisms
Imagine a student who is deciding between staying up late studying for a test (as) or ignoring the test (ai). We will represent the student with a Cartesian frame over letter grades, where W={A+,A,A-,B+,B,B-,C+,C,C-,D+,D,D-,F}.
If the student doesn't study, her final grade is always a C+, represented by the possible world C+. If she does study, she may oversleep and get a bad grade (represented by the environment selecting eo and putting her in D-). If she studies and doesn't oversleep, she is uncertain about whether her teacher is typical (et, resulting in A-) or unusually demanding (ed, resulting in B+). We represent this with the frame
CT=etedeoasai(A-B+D-C+C+C+).
Let us also suppose that yesterday, the student had the extra option of copying another student's answers on test day to get a sure A+. However, she decided not to cheat. We represent the student's options yesterday, prior to precommitting, with the frame
CY=ftfdfobsbibc⎛⎜⎝A-B+D-C+C+C+A+A+A+⎞⎟⎠.
There is a morphism from the student's frame today to her frame yesterday, representing the fact that Agent(CT) can be plugged into Agent(CY)'s game, or that the student was "stronger" yesterday than she is today.
Let us also suppose that the student's teacher is in fact demanding. If the student today knew this fact, we would instead represent her perspective with the frame
CT′=e′de′oa′sa′i(B+D-C+C+).
Here, we have a morphism from the student today (CT) to her perspective if she had an additional promise from the environment (CT′). This represents the fact that CT′ can strategy-steal from a version of herself who knows strictly less.
Given two Cartesian frames C0 and C1, I am not aware of an efficient universal method for determining whether there exists a morphism from C0 to C1. Indeed, I conjecture that this problem might be NP-complete. In the above cases, however, we can see that there exist morphisms from CT to the other two frames by observing that CT is effectively CY with a row deleted, or CT′ with a column added.
While Agent(CY) and Agent(CT′) are both stronger than Agent(CT), we have no morphisms between CY and CT′; their options are different enough that we can't compare their strength directly.
1.4. Examples of Morphisms Going Both Ways
Every Cartesian frame has an identity morphism pointing to itself; and as we'll discuss in the next post, whenever two Cartesian frames C and D are equivalent (in a sense to be defined), there will be a morphism going from C to D and another going from D to C. But not all pairs of Cartesian frames with morphisms going both ways are equivalent. Consider, for example,
C1=e0e1a0a1(w0w0w0w1) and D1=f0b0(w0).
In C1=(A,E,⋅), the default outcome is w0, but the agent and environment can handshake to produce w1. In D1=(B,F,⋆), there are no choices, and there's only one possible world, w0.
It turns out that there is a morphism (g,h):C1→D1, where g is the constant function b0 and h is the constant function e0; and there is a second morphism (g′,h′):D1→C1, where g′ is the constant function a0 and h′ is the constant function f0. We can interpret these like so:
There is a morphism C1→D1 because D1 is effectively C1 plus a promise from the environment "I'll choose e0." The agent in D1 is "stronger" in the sense that it has fewer possible environments to worry about. There is less the environment can do to interfere with the agent's choices.
There is a morphism D1→C1 because C1's agent has strictly more options than D1's agent: moving from D1 to C1 lets you retain the option to produce w0 if you want, but it also lets you try for w1.
So we can view the smaller matrix as the larger matrix plus a promise from the environment "I'll choose e0," or we can view it as the larger matrix plus a commitment from the agent "I'll choose a0."
This example demonstrates that my intuitive statement "wherever there's a morphism from C to D, D is at least as strong as C" conflates two different notions of "stronger." These notions often go together, but come apart in situations such as the handshake example. Like the hypothetical student in CT′, the agent of D1 is "stronger" in the sense that the environment can't do as much to get in the way. But like the not-yet-precommitted student in CY, the agent of C1 is "stronger" in the sense that it has more options.
2. Self-Duality
A key property of Chu(W) is that it is self-dual.
Definition: Let −∗:Chu(W)→Chu(W)op be the functor given by (A,E,⋅)∗=(E,A,⋆), where e⋆a=a⋅e, and (g,h)∗=(h,g).
The more standard notation for dual in linear logic would be −⊥, but this is horrible notation.1
Claim:−∗ is an isomorphism between Chu(W) and Chu(W)op.
Proof: First, we show −∗ is a functor. The objects in Chu(W)op are the same as in Chu(W), the morphisms from D to C in Chu(W)op are the morphisms from C to D in Chu(W), and composition is the same, but with the order reversed. −∗ clearly preserves identity morphisms. To show that −∗ preserves composition, we have
To see that it is an isomorphism, we need a left and right inverse. We will abuse notation and also write −∗ for the functor from Chu(W)op to Chu(W) given by (E,A,⋆)∗=(A,E,⋅), where a⋅e=e⋆a, and (h,g)∗=(g,h). Clearly, we have −∗:Chu(W)→Chu(W)op and −∗:Chu(W)op→Chu(W) composing to the identity in both orders, so −∗ is an isomorphism. □
Going back to our visualization of Cartesian frames as matrices, −∗ just takes the transpose of the matrix, swapping agent with environment. "Chu(W) is self-dual" is another way of saying that transposing a Cartesian frame always gives you another Cartesian frame.
Philosophically, depending on our interpretation, this may be doing something weird. We talk about possible agents and possible environments, but we may mean something different by "possible" in those two cases.
Since we are imagining events from the point of view of the agents, "possible agents" is referring to all of the ways the agent can choose to be by exercising its "free will." We could think of "possible environments" similarly, or we could think of possible environments as representing the agent's uncertainty.
Under the view where possible environments represent uncertainty, −∗ is pointing to an interesting duality that swaps choices with uncertainty, swaps the "could" of "I could do X" with the "could" of "The world could have property Y," and (if we add probability to the mix) swaps mixed strategies with probabilistic uncertainty. "What will I do?" becomes "What game am I playing?", or "What is the world-as-a-function-of-my-action like?"
I will introduce many operations on Cartesian frames, so it will help to highlight even the basic properties as I go. Here, I'll note:
The mathematical object (but not the philosophical interpretation) of a Cartesian Frame is studied under the name "Chu space."
(In category theory, Chu spaces are usually studied in the special case of W=2. To learn more about Chu spaces, see Vaughan Pratt's guide to papers and nLab's page on the Chu construction.)
In this post and the next one, I'll mostly be discussing standard facts about Chu spaces. I'll also discuss how to interpret the standard definitions as statements about agency.
Chu spaces form a category as a special case of the Chu construction. You may notice a strong similarity between operations on Cartesian frames and operations in linear logic, coming from the fact that the Chu construction is also intimately related to linear logic, and is used in the semantics for linear logic.
Linear logic has a large number of operations—additive conjunction (&), multiplicative conjunction (⊗), and so on—and many of those symbols will turn out to have interpretations for Cartesian frames, and they're actually going to be meaningful interpretations in this setting. For that reason, we'll be stealing much of our notation from linear logic, though this sequence won't assume familiarity with linear logic.
Definition: Chu(W) is the category whose objects are Cartesian frames over W, whose morphisms from C=(A,E,⋅) to D=(B,F,⋆) are pairs of functions (g:A→B,h:F→E), such that a⋅h(f)=g(a)⋆f for all a∈A and f∈F, and whose composition of morphisms is given by (g1,h1)∘(g0,h0)=(g1∘g0,h0∘h1).
The composition of two morphisms C0→C1 and C1→C2, then, sends the agent of C0 to C2 and sends the environment of C2 to C0.
Claim: Chu(W) is a category.
Proof: It suffices to show that composition is well-defined and associative and there exist identity morphisms. For identity, (idA,idE) is clearly an identity on C=(A,E,⋅), where idX is the identity map from X to itself.
The composition (g1,h1)∘(g0,h0) of (g0,h0):C0→C1 with (g1,h1):C1→C2 is (g1∘g0,h0∘h1):C0→C2. To verify that this is a morphism, we just need that a0⋅0h0(h1(e2))=g1(g0(a0))⋅2e2 for all a0∈Agent(C0), and e2∈Env(C2), where ⋅i=Eval(Ci). Indeed,
a0⋅0h0(h1(e2))=g0(a0)⋅1h1(e2)=g1(g0(a0))⋅2e2,since each component is a morphism.
Associativity of the composition follows from the fact that it is just a pair of compositions of functions on sets, and composition is associative for sets. □
1. What Do These Morphisms Represent?
1.1. Morphisms as Interfaces
A Cartesian frame is a first-person perspective. The agent A finds itself in a certain situation or game, where it expects to encounter an environment E. The morphisms in Chu(W) allow the agent of one Cartesian frame to play the game of another Cartesian frame.
We can think of the morphisms from C=(A,E,⋅) to D=(B,F,⋆) as ways of fitting the agent of C into the environment of D. Indeed, for every morphism (g,h):C→D, one can construct a Cartesian frame (A,F,⋄), whose agent matches C's agent, and whose environment matches D's environment, with ⋄ given by a⋄f=a⋅h(f)=g(a)⋆f. (The morphism from C to D can actually be viewed as the composition of (idA,h):C→(A,F,⋄) with (g,idF):(A,F,⋄)→D.)
Two random large Cartesian frames will typically have no morphisms between them. When there is a morphism, the morphism functions as an interface that allows the agent A to interact with some other environment F. However, we aren't just randomly throwing A and F together. A's interaction with F factors through the function h:F→E, so A can in a sense still be thought of as using an interface where it interacts with E. It just interacts with an e∈E that is of the form h(f) for some f∈F. But this is happening simultaneously with the dual view in which F can be thought of as still interacting with B!
Since a Cartesian frame is a first-person perspective, you can imagine A having the internal experience of interacting with E, while F has the "experience" of interacting with B. The morphism's job is to be the translation interface that allows this A and F to interact with each other, while preserving their respective internal experiences in such a way that they feel like they're interacting with E and B respectively. A gets to play B's game, while still thinking that it is playing its own game.
1.2. Morphisms as Differences in Agents' Strength
We can also interpret the existence of a morphism from C=(A,E,⋅) to D=(B,F,⋆) as saying something like "D's agent is at least as strong as C's agent."
This is easiest to see for a morphism (g,h):C→D where g and h are both injective. In this case, it is as though A⊆B and F⊆E, so D's agent has more options to choose between and fewer environments it has to worry about.
Since some of the environments in E∖F might have been good for the agent, the agent isn't necessarily strictly better off in D; but in a zero-sum game, the agent will indeed be strictly better off. I think this justifies saying that C's agent is in some sense weaker than D's agent.
If g or h is not injective, we could duplicate elements of B and E to make it injective, so the interpretation "C's agent is no stronger than D's agent" is reasonable in that case as well. In particular, the existence of a morphism from C to D implies that Ensure(C)⊆Ensure(D) (and thus Ctrl(C)⊆Ctrl(D)).
However, the existence of a morphism is stronger than just saying the set of ensurables is larger. The morphism from C to D can be thought of as telling D's agent how to strategy-steal from C's agent, and thus do anything that C's agent can do.
We now provide a few examples to illustrate morphisms between Cartesian frames. (If you're ready to forge ahead, skip to §2 instead.)
1.3. Simple Examples of Morphisms
Imagine a student who is deciding between staying up late studying for a test (as) or ignoring the test (ai). We will represent the student with a Cartesian frame over letter grades, where W={A+,A,A-,B+,B,B-,C+,C,C-,D+,D,D-,F}.
If the student doesn't study, her final grade is always a C+, represented by the possible world C+. If she does study, she may oversleep and get a bad grade (represented by the environment selecting eo and putting her in D-). If she studies and doesn't oversleep, she is uncertain about whether her teacher is typical (et, resulting in A-) or unusually demanding (ed, resulting in B+). We represent this with the frame
CT=et ed eoasai(A-B+D-C+C+C+).
Let us also suppose that yesterday, the student had the extra option of copying another student's answers on test day to get a sure A+. However, she decided not to cheat. We represent the student's options yesterday, prior to precommitting, with the frame
CY=ft fd fobsbibc⎛⎜⎝A-B+D-C+C+C+A+A+A+⎞⎟⎠.
There is a morphism from the student's frame today to her frame yesterday, representing the fact that Agent(CT) can be plugged into Agent(CY)'s game, or that the student was "stronger" yesterday than she is today.
Let us also suppose that the student's teacher is in fact demanding. If the student today knew this fact, we would instead represent her perspective with the frame
CT′=e′d e′oa′sa′i(B+D-C+C+).
Here, we have a morphism from the student today (CT) to her perspective if she had an additional promise from the environment (CT′). This represents the fact that CT′ can strategy-steal from a version of herself who knows strictly less.
Given two Cartesian frames C0 and C1, I am not aware of an efficient universal method for determining whether there exists a morphism from C0 to C1. Indeed, I conjecture that this problem might be NP-complete. In the above cases, however, we can see that there exist morphisms from CT to the other two frames by observing that CT is effectively CY with a row deleted, or CT′ with a column added.
While Agent(CY) and Agent(CT′) are both stronger than Agent(CT), we have no morphisms between CY and CT′; their options are different enough that we can't compare their strength directly.
1.4. Examples of Morphisms Going Both Ways
Every Cartesian frame has an identity morphism pointing to itself; and as we'll discuss in the next post, whenever two Cartesian frames C and D are equivalent (in a sense to be defined), there will be a morphism going from C to D and another going from D to C. But not all pairs of Cartesian frames with morphisms going both ways are equivalent. Consider, for example,
C1=e0e1a0a1(w0w0w0w1) and D1=f0b0(w0).
In C1=(A,E,⋅), the default outcome is w0, but the agent and environment can handshake to produce w1. In D1=(B,F,⋆), there are no choices, and there's only one possible world, w0.
It turns out that there is a morphism (g,h):C1→D1, where g is the constant function b0 and h is the constant function e0; and there is a second morphism (g′,h′):D1→C1, where g′ is the constant function a0 and h′ is the constant function f0. We can interpret these like so:
So we can view the smaller matrix as the larger matrix plus a promise from the environment "I'll choose e0," or we can view it as the larger matrix plus a commitment from the agent "I'll choose a0."
This example demonstrates that my intuitive statement "wherever there's a morphism from C to D, D is at least as strong as C" conflates two different notions of "stronger." These notions often go together, but come apart in situations such as the handshake example. Like the hypothetical student in CT′, the agent of D1 is "stronger" in the sense that the environment can't do as much to get in the way. But like the not-yet-precommitted student in CY, the agent of C1 is "stronger" in the sense that it has more options.
2. Self-Duality
A key property of Chu(W) is that it is self-dual.
Definition: Let −∗:Chu(W)→Chu(W)op be the functor given by (A,E,⋅)∗=(E,A,⋆), where e⋆a=a⋅e, and (g,h)∗=(h,g).
The more standard notation for dual in linear logic would be −⊥, but this is horrible notation.1
Claim: −∗ is an isomorphism between Chu(W) and Chu(W)op.
Proof: First, we show −∗ is a functor. The objects in Chu(W)op are the same as in Chu(W), the morphisms from D to C in Chu(W)op are the morphisms from C to D in Chu(W), and composition is the same, but with the order reversed. −∗ clearly preserves identity morphisms. To show that −∗ preserves composition, we have
(g0,h0)∗∘op(g1,h1)∗=(h1,g1)∘(h0,g0)=(h1∘h0,g0∘g1)=((g0,h0)∘(g1,h1))∗.To see that it is an isomorphism, we need a left and right inverse. We will abuse notation and also write −∗ for the functor from Chu(W)op to Chu(W) given by (E,A,⋆)∗=(A,E,⋅), where a⋅e=e⋆a, and (h,g)∗=(g,h). Clearly, we have −∗:Chu(W)→Chu(W)op and −∗:Chu(W)op→Chu(W) composing to the identity in both orders, so −∗ is an isomorphism. □
Going back to our visualization of Cartesian frames as matrices, −∗ just takes the transpose of the matrix, swapping agent with environment. "Chu(W) is self-dual" is another way of saying that transposing a Cartesian frame always gives you another Cartesian frame.
Philosophically, depending on our interpretation, this may be doing something weird. We talk about possible agents and possible environments, but we may mean something different by "possible" in those two cases.
Since we are imagining events from the point of view of the agents, "possible agents" is referring to all of the ways the agent can choose to be by exercising its "free will." We could think of "possible environments" similarly, or we could think of possible environments as representing the agent's uncertainty.
Under the view where possible environments represent uncertainty, −∗ is pointing to an interesting duality that swaps choices with uncertainty, swaps the "could" of "I could do X" with the "could" of "The world could have property Y," and (if we add probability to the mix) swaps mixed strategies with probabilistic uncertainty. "What will I do?" becomes "What game am I playing?", or "What is the world-as-a-function-of-my-action like?"
I will introduce many operations on Cartesian frames, so it will help to highlight even the basic properties as I go. Here, I'll note:
Claim: For any Cartesian