How path-dependent are human values?

Apr 15, 2022

I think it's important to remind yourself what the latent vs observable variables represent.

The observable variables are, well, the observables: sights, sounds, smells, touches, etc.. Meanwhile, the latents are all concepts we have that aren't directly observable, which yes includes intangibles like friendship, but also includes high-level objects like chairs and apples, or low-level objects like atoms.

One reason to mention this is because it has implications for your graphs. The causal graph would more look like this:

(As the graphs for HMMs tend to look. Arguably the sideways arrows for the xs are not needed, but I put them in anyway.)

Of course, that's not to say that you can't factor the probability distribution as you did, it just seems more accurate to call it something other than causal graph. Maybe inferential graph? (I suppose you could call your original graph as causal graph of people's psychology. But then the hs would only represent people's estimates of their latent variables, which they would claim could differ from their "actual" latent variables, if e.g. they were mistaken.)

Anyway, much more importantly, I think this distinction also answers your question about path-dependence. There'd be lots of path-dependence, and it would not be undesirable to have the path-dependence. For example:

If you observe that you get put into a simulation, but the simulation otherwise appears realistic, then you have path-dependence because you still know that there is an "outside the simulation".
If you observe that an apple in your house, then you update your estimate of h to contain that apple. If you then leave your house, then x no longer shows the apple, but you keep believing in it, even though you wouldn't believe it if your original observation of your house had not found an apple.
If you get told that someone is celebrating their birthday tomorrow, then tomorrow you will believe that they are celebrating their birthday, even if you aren't present there.

[-]Ege Erdil4y10

I think you misunderstood my graph - the way I drew it was intentional, not a mistake. Probably I wasn't explicit enough about how I was splitting the variables and what I do is somewhat different from what johnswentworth does, so let me explain.

Some latent variables could have causal explanatory power, but I'm focusing on ones that don't seem to have any such power because they are the ones human values depend on most strongly. For example, anything to do with qualia is not going to have any causal arrows going from it to what we can observe, but neverthe... (read more)

2tailcalled4y

I'm still not entirely sure how you classify variables as latent vs observed. Could you classify these as "latent", "observed" or "ambiguous" to classify? * The light patterns that hit the photoreceptors in your eyes * The inferrences made output by your visual cortex * A person, in the broad sense including e.g. ems * A human, in the narrow sense of a biological being * An apple * A chair * An atom * A lie * A friendship

3tailcalled4y

Wait I guess I'm dumb, this is explained in the OP.

1Ege Erdil4y

I've edited the post after the fact to clarify what I meant, so I don't think you're dumb (in the sense that I don't think you missed something that was there). I was just not clear enough the first time around.

3tailcalled4y

Ah ok, didn't realize it was edited.

1Ege Erdil4y

I posted this question within less than 20 minutes of the thought occurring to me, so I didn't understand what was going on well enough & couldn't express my thoughts properly as a consequence. Your answer helped clarify my thoughts, so thanks for that!

1Ege Erdil4y

Whether particular variables are latent or not is a property relative to what the "correct model" ends up being. Given our current understanding physics, I'd classify your examples like this: * The light patterns that hit the photoreceptors in your eyes: Observed * The inferrences made output by your visual cortex: Ambiguous * A person, in the broad sense including e.g. ems: Latent * A human, in the narrow sense of a biological being: Latent * An apple: Latent * A chair: Latent * An atom: Ambiguous * A lie: Latent * A friendship: Latent With visual cortex inferences and atoms, I think the distinction is fuzzy enough that you have to specify exactly what you mean. It's important to notice that atoms are "latent" in both chemistry and quantum field theory in the usual sense, but they are causally relevant in chemistry while they probably aren't in quantum field theory, so in the context of my question I'd say atoms are observed in chemistry and latent in QFT. The realization I had while responding to your answer was that I really care about the model that an AGI would learn and not the models that humans use right now, and whether a particular variable is downstream or upstream of observed variables (so, whether they are latent or not in the sense I've been using the word here) depends on what the world model you're using actually is.

tailcalled

Apr 15, 2022

Here's a partial answer to the question:

It seems like one common and useful type of abstraction is aggregating together distinct things with similar effects. Some examples:

"Heat" can be seen as the aggregation of molecular motion in all sorts of directions; because of chaos, the different directions and different molecules don't really matter, and therefore we can usefully just add up all their kinetic energies into a variable called "heat".
A species like "humans" can be seen as the aggregation (though disjunction rather than sum) of many distinct genetic patterns. However, ultimately the genetic patterns are simple enough that they all code for basically the same thing.
A person like me can be seen as the aggregation of my state through my entire life trajectory. (Again unlike heat, this would be disjunction rather than sum.) A major part of why the abstraction of "tailcalled" makes sense is that I am causally somewhat consistent across my life trajectory.

An abstraction that aggregates distinct things with similar effects seems like it has a reasonably good chance to be un-path-dependent. However, it's not quite guaranteed, which you can see by e.g. the third example. While I will have broadly similar effects through my life trajectory, the effects I will have will change over time, and the way they change may depend on what happens to me. For instance if my brain got destructively scanned and uploaded while my body was left behind, then my effects would be "split", with my psychology continuing into the upload while my appearance stayed with my dead body (until it decayed).

[-]Ege Erdil4y10

"Heat" can be seen as the aggregation of molecular motion in all sorts of directions; because of chaos, the different directions and different molecules don't really matter, and therefore we can usefully just add up all their kinetic energies into a variable called "heat".

Nitpick: this is not strictly correct. This would be the internal energy of a thermodynamic system, but "heat" in thermodynamics refers to energy that's exchanged between systems, not energy that's in a system.

Aside from the nitpick, however, point taken.

An abstraction that aggregate

... (read more)

3tailcalled4y

Ah, I think this is a fundamentally different kind of abstraction than the "aggregating together distinct things with similar effects" type of abstraction I mentioned. To distinguish, I suggest we use the name "causal abstraction" for the kind I mentioned, and the name "protocol abstraction" (or something else) for this concept. So: * Causal abstraction: aggregating together distinct phenomena that have similar causal relations into a lumpy concept that can be modelled as having the same causal relations as its constituents * Protocol abstraction: extending your ontology with new "epiphenomenal" variables that follow certain made-up rules (primarily for the use in social coordination, so that there is a ground truth even with deception? - but can also be used on an individual level, in values) I feel like personal identity has both elements of causal abstraction and of protocol abstraction. E.g. social relationships like debts seem to be strongly tied to protocol abstraction, but there's also lots of social behavior that only relies on causal abstraction. I agree. Coming up with a normative theory of agency in the case of protocol abstraction actually sounds like a fairly important task. I have some ideas about how to address causal abstraction, but I haven't really thought much about protocol abstraction before.

1Ege Erdil4y

I think your distinction between causal and protocol abstractions makes sense and it's related to my distinction between causally relevant vs causally irrelevant latent variables. It's not quite the same, because abstractions which are rendered causally irrelevant in some world model can still be causal in the sense of aggregating together a bunch of things with similar causal properties. I agree. Can you clarify what you mean by a "normative theory of agency"? I don't think I've ever seen this phrase before.

2tailcalled4y

What I mean is stuff like decision theory/selection theorems/rationality; studies of what kinds of ways agents normatively should act. Usually such theories do not take abstractions into account. I have some ideas for how to take causal abstractions into account, but I don't think I've seen protocol abstractions investigated much. In a sense, they could technically be handled by just having utility functions over universe trajectories rather than universe states, but there are some things about this that seem unnatural (e.g. for the purpose of Alex Turner's power-seeking theorems, utility functions over trajectories may be extraordinarily power-seeking, and so if we could find a narrower class of utility functions, that would be useful).

LESSWRONG
is fundraising!
LW

LESSWRONG
is fundraising!
LW

14

[ Question ]

How path-dependent are human values?

14

14

2 Answers sorted by
top scoring

Apr 15, 2022

Apr 15, 2022

14

[ Question ]

How path-dependent are human values?

14

14

2 Answers sorted by top scoring

Apr 15, 2022

Apr 15, 2022

2 Answers sorted by
top scoring