«Boundaries», Part 3a: Defining boundaries as directed Markov blankets

Overall, this is my favorite thing I have read on lesswrong in the last year.

Agreements:

I agree very strongly with most of this post, both in the way you are thinking about boundaries, and in the scale and scope of applications of boundaries to important problems.

In particular on the applications, I think that boundaries as you are defining them are crucial to developing decision theory and bargaining theory (and indeed are already helpful for thinking about bargaining and fairness in real life), but I also agree with your other potential applications.

I particular on the theory, I agree that the boundary of an agent (or agent-like-thing) should be thought of as that which screens off the viscera of the agent from its environment. I agree that the agent should think of decisions as intervening on its boundary. I agree that the boundary (as the agent sees it) only partially does the screening-off thing. I agree that the agent should in part be focused on actively maintaining its boundary, as this is crucial to its integrity as an agent.

I believe the above mostly independently of this post, but the place where I think this post is doing better than my default way of thinking is in the directionality of the arrows. I have been thinking about this in a pretty symmetric way: the notion of B screening off V from E is symmetric in swapping V and E. I was aware this was a mistake (because logical mutual information is not symmetric), but this post makes it clear to me how important that mistake was. Thanks!

Disagreements:

Philosophical nitpick: I think the boundary should be thought of as part of the agent/organism and simultaneously as part of the environment. Indeed, the screening off property can be thought of as an informational (as opposed to physical) way of saying that the boundary is the intersection of the agent and environment.

I think the boundary factorization into active and passive is wrong. I am not sure what is right here. My default proposal is to think of the active as the minimal part that contains all information flow from the viscera, and the perceptive as the minimal part that contains all information flow from the environment. By definition, these cover the boundary, but they might intersect. (An alternative proposal is to define the active as the part that the agent thinks of its interventions as living, and the perceptive as where the agent thinks of its perceptions as living, and now they don't cover the boundary)

In both of the above, I am pushing for the claim that we are not yet in the part of the theory where we need to break agent-environment symmetry in the theory. (Although we do need to track the directions of information flow separately!)

I think that thinking of there as being physical nodes is wrong. Unfortunately Finite Factored Sets is not yet able to handle directionality of information flow, so I see how it is the only way you can express an important part of the model. We need to fix that, so we can think of viscera, environments, boundaries, etc. as features of the world rather than sets of nodes.

I also think that the time-embedded picture is wrong. I often complain about models that have a thing persisting across linear time like this, but I think it is especially important here. As far as I can tell, time is mostly about screening-off, and boundaries are also mostly about screening-off, so I think that this is a domain in which it is especially important to get time right.

[-]Andrew_Critch3yΩ450

Thanks, Scott!

I think the boundary factorization into active and passive is wrong.

Are you sure? The informal description I gave for A and P allow for the active boundary to be a bit passive and the passive boundary to be a bit active. From the post:

the active boundary, A — the features or parts of the boundary primarily controlled by the viscera, interpretable as "actions" of the system— and the passive boundary, P — the features or parts of the boundary primarily controlled by the environment, interpretable as "perceptions" of the system.

There's a question of how to factor B into a zillion fine-grained features in the first place, but given such a factorization, I think we can define A and P fairly straightforwardly using Shapley value to decide how much V versus E is controlling each feature, and then A and P won't overlap and will cover everything.

[-]Scott Garrabrant3yΩ451

Oh yeah, oops, that is what it says. Wasn’t careful, and was responding to reading an old draft. I agree that the post is already saying roughly what I want there. Instead, I should have said that the B=AxP bijection is especially unrealistic. Sorry.

[-]Andrew_Critch3yΩ120

Why is it unrealistic? Do you actually mean it's unrealistic that the set I've defined as "A" will be interpretable at "actions" in the usual coarse-grained sense? If so I think that's a topic for another post when I get into talking about the coarsened variables ...

[-]Scott Garrabrant3yΩ220

I mean, the definition is a little vague. If your meaning is something like "It goes in A if it is more accurately described as controlled by the viscera, and it goes in P if it is more accurately described as controlled by the environment," then I guess you can get a bijection by definition, but it is not obvious these are natural categories. I think there will be parts of the boundary that feel like they are controlled by both or neither, depending on how strictly you mean "controlled by."

[-]Scott Garrabrant3yΩ221

Forcing the AxP bijection is an interesting idea, but it feels a little too approximate to my taste.

[-]Scott Garrabrant3yΩ330

To be clear, everywhere I say “is wrong,” I mean I wish the model is slightly different, not that anything is actually is mistaken. In most cases, I don’t really have much of an idea how to actually implement my recommendation.

[-]Scott Garrabrant3yΩ220

More of my thoughts here.

[-]Alex Flint3yΩ6170

I have the sense that boundaries are so effective as a coordination mechanism that we have come to believe that they are an end in themselves. To me it seems that the over-use of boundaries leads to loneliness that eventually obviates all the goodness of the successful coordination. It's as if we discovered that cars were a great way to get from place to place, but then we got so used to driving in cars that we just never got out of them, and so kind of lost all the value of being able to get from place to place. It was because the cars were in fact so effective as transportation devices that started to emphasize them so heavily in our lives.

You say "real-world living systems sometimes do funky things like opening up their boundaries" but that's like saying "real-world humans sometimes do funky things like getting out of their cars" -- we shouldn't begin with the view that boundaries are the default thing and then consider some "extreme cases" where people open up their boundaries.

Some specific cases to consider for a theory of boundaries-as-arising-from-cordination:

A baby grows inside a mother and is born, gradually establishing boundaries. You might say the baby has zero boundaries just prior to conception and full boundaries at age 10? age 15? age 20? How do you make appropriate sense of the coming into existence of boundaries over time?
A human dies, gradually losing agency over years. What is the appropriate way to view the attenuation-to-zero of this person's boundaries?
During an adult human life, a person finds themselves in situations where it is extremely difficult, for practical reasons, to establish certain boundaries. For example, two people locked in a tiny closet together are unable to establish, perhaps, any boundary around personal space. Perhaps it was a mistake to get locked in there in the first place, but now that they are in there, they need a way to coordinate without being able to establish certain boundaries.

Overall, I would ask "what is an effective set of boundaries given our situation and our goal?" rather than "how can we coordinate on our goals given our situation and our apriori fixed boundaries?"

[-]Chris Lakin2y10

Oooo I like this comment, especially the first two examples

also,

For example, two people locked in a tiny closet together are unable to establish, perhaps, any boundary around personal space.

Personally I wouldn't call this a «boundary». I don't consider boundaries to be things that are "set" or "established"

[-]Alexander Gietelink Oldenziel3y*72

Woah, Andrew, this is fantastic work! I am seriously excited about this direction.! I liked your previous posts on boundaries very much too, but I had no idea your thoughts on boundaries were this technically refined - and that they tie in so beautifully with Markov blankets!

re: Friston.

Friston particular style that could justifiably be called obscurantist. His writing is extremely verbose, often fails to define key terms, and very nontrivial equations are often posited without derivation or citation. After spending considerable effort trying to understand the rather lofty prose I would often realize the key ideas were indeed quite interesting - but could be explained far better. There are some big claims in his work but insufficiently many details are worked out to ascertain to what degree this claims can be justified. A group of people have taken up the thankless task of Friston exegesis but unfortunately have not really developed a sufficiently clear & crisp account. The book you cite is an example of this literature. Long story short I am excited that your recent work might finally resolve this cloud of confusion!

In the context of this post, the following paper is particularly relevant:

https://royalsocietypublishing.org/doi/10.1098/rsif.2013.0475

Apart from being very unclear, almost deliberately so, apparently it also has actual serious errors.

Nevertheless, inside there is a definition of a decomposition of a Markov blanket & boundary into action, observation, internal and external states that seems extra similar to your story [but I think you've gone much farther already!]. I talked a little about Friston's decomposition of Markov Boundaries recently for the MetaUni Abstraction seminar, see here for the relevant slide.

[-]winstonne2y10

If anyone is interested in joining a learning community around the ideas of active inference, the mission of https://www.activeinference.org/ is to educate the community around these topics. There's a study group around the 2022 active inference textbook by Parr, Friston, and Pezzulo. I'm in the 5th cohort and it's been very useful for me.

[-]Alex Flint3yΩ241

The post begins with "this is part 3b of..." but I think you meant to say 3a.

[-]Andrew_Critch3yΩ121

Thanks, fixed!

[-]Alex_Altair3y30

Here I'm going to log some things I notice while reading, mostly as a way to help check my understanding, but also to help find possible errata.

In Definition part (a), you've got a whole lot of W-type symbols, and I'm not 100% sure I follow each of their uses. You use a couple times which is legit, but it looks a lot like $w$ , so maybe it could be replaced with $N$ ?

See this comment for two errata with the different w's.

$F u t (w)$ denotes, for any world state $w \in W$ , the future of the Dirac (100% concentrated) distribution on the world state $w \in W$ .

Maybe you could just say, $F u t (w)$ is shorthand for $F u t (T_{W} (w))$ , since $T_{W}$ will map $w$ to the right thing of type $Δ W$ . Then you can avoid bringing in the somewhat exotic Dirac delta function. Of course, that now means that $w$ itself is not the first item in the resulting sequence. I'm not sure if you need that to be the case for later. But also, everything above is ambiguous about whether the argument to $F u t$ was in the sequence anyway.

The character ⫫ doesn't render for me. (I could figure out what it was by pasting the unicode into google, but maybe it could be done with LaTeX instead?)

To formalize this, I want a collection of state spaces and maps, like so:

Is the following bulleted list missing an entry for $E$ ?

Each of these factorizations are assumed to be bijective, in the sense of accounting for everything that matters and not double-counting anything

I was wondering if you were going to say something like $W = V \times B \times E$ and $B = A \times P$ . It sounds like that's almost right, except that you allow the factors to pass through arbitrary functions first, as long as they're bijective. Is that right?

We say $r_{θ}$ is a good fit

You bring back $r_{θ}$ here, but I don't see the $θ$ doing anything yet. Might be better not to introduce it until later, to free up a bit of the reader's working memory.

See this comment for a broken link.

[-]rajashree3y30

Some notational bugs in the bulleted list in "Definition part (a): "part of the world" that come up in trying to communicate about this with someone:

In the first bullet, should probably be $w \in W$ both for consistency with later bullets, and so that it does not seem like $T_{W}$ is subscript-indexed over some $W \in W$ . (I'm inferring that subscript W is just part of the name of $T_{W}$ .)

In bullet 6 about $Fut (w)$ , the sentence should end with $w \in W$ , not $w \in W$ .

The last three bullet points should maybe use $w_{t}$ instead of $W_{t}$ , or else maybe all the lowercase $w$ s should be made capital.

[-]Alex_Altair3y20

Chapter 3 of Parr (2022)

My browser thinks this is an invalid link and won't let me open it.

[-]scottviteri2y10

If I try to use this framework to express two agents communicating, I get an image with a V1, A1, P1, V2, A2, and P2, with cross arrows from A1 to P2 and A2 to P1. This admits many ways to get a roundtrip message. We could have A1 -> P2 -> A2 -> P2 directly, or A1 -> P2 -> V2 -> A2 -> P1, or many cycles among P2, V2, and A2 before P1 receives a message. But in none of these could I hope to get a response in one time step the way I would if both agents simultaneously took an action, and then simultaneously read from their inputs and their current state to get their next state. So I have this feeling that pi : S -> Action and update : Observation x S -> S already bake in this active/passive distinction by virtue of the type signature, and this framing is maybe just taking away the computational teeth/specificity. And I can write the same infiltration and exfiltration formulas by substituting S_t for V_t, Obs_t for P_t, Action_t for A_t, and S_env_t for E_t.

[-]scottviteri2y10

I take back the part about pi and update determining the causal structure, because many causal diagrams are constant with the same poly diagram

[-]Chris Lakin2y*10

I've written a conceptual distillation of the Markov blanket aspect of this post: Formalizing «Boundaries» with Markov blankets.

LESSWRONG
is fundraising!
LW

LESSWRONG
is fundraising!
LW

90

«Boundaries», Part 3a: Defining boundaries as directed Markov blankets

90

Ω 44

90

Ω 44

Motivation

Boundaries, defined

Definition part (a): "part of the world"

Definition parts (b) & (c): the active boundary, passive boundary, and viscera

Definition part (d): "making decisions"

Discussion

Non-violent boundary-crossings

Respect for boundaries as non-arbitrary coordination norms

Comparison to related work

Cartesian frames

Active inference

Functional decision theory (FDT)

Markov blankets

Recap

Reminder to vote