Troubles With CEV Part1 - CEV Sequence

[-]Zetetic14y10

There is little in common between Eliezer, Me and Al Qaeda terrorists, and most of it is in the so called reptilian brain. We may end up with a set of goals and desires that are nothing more than “Eat Survive Reproduce,” which would qualify as a major loss in the scheme of things.

I think you may possibly be committing the fundamental attribution error. It's my understanding that Al Qaeda terrorists are often people who were in a set of circumstances that made them highly succeptible to propaganda - often illiterate, living in poverty and with few, if any, prospects for advancment. It is easy to manipulate the ignorant and disenfranchised. If they knew more, saw the possibilities and understood more about the world I would be surprised if they would choose a path that diverges so greatly with your own that CEV would have to resort to looking at the reptilian brain.

In this specific case, what ends up dominating CEV is what evolution wants, not what we want. Instead of creating a dynamic with a chance of creating the landscape of a Nice Place to Live, we end up with some exotic extrapolation of simple evolutionary drives.

"What evolution wants" doesn't seem like a clear concept - at least I'm having trouble making concrete sense of it. I think that you're conflating "evolution" with "more ancient drives" - the described extrapolation is an extrapolation with respect to evolutionarily ancient drives.

In particular, you seem to be suggesting that a CEV including only humans will coincide with a CEV including all vertibrates possessing a reptillian brain on the basis that our current goals seem wildly incompatible. However, as I understand it, CEV asks what we would want if we "knew more, grew up further together" etc.

[-]Rhwawn14y60

It's my understanding that Al Qaeda terrorists are often people who were in a set of circumstances that made them highly succeptible to propaganda - often illiterate, living in poverty and with few, if any, prospects for advancment. It is easy to manipulate the ignorant and disenfranchised.

No, that is completely wrong: the correlations are quite the opposite way, terrorists tend to be better educated and wealthier. Bin Laden is the most extreme possible example of that - he was a multimillionaire son of a billionaire!

[-]diegocaleiro14y-20

If they knew more, saw the possibilities and understood more about the world I would be surprised if they would choose a >path that diverges so greatly with your own This is not so simple to assert. You have to think of the intensity of their belief in the words of allah. Their fundamental wordview is so different from ours that there may be nothing humane left when we try to combine them.

I think that you're conflating "evolution" with "more ancient drives" In this specific case I was using this figure of speech, yes. I mean't that we would be extrapolating drives that matter for evolution (our ancient drives) but don't really matter to us, not in the sense of Want to Want described in 4c.

[-]Zetetic14y00

This is not so simple to assert. You have to think of the intensity of their belief in the words of allah. Their fundamental wordview is so different from ours that there may be nothing humane left when we try to combine them.

CAVEAT: I'm using CEV as I understand it, not necessarily as it was intended as I'm not sure the notion is sufficiently precise for me to be able to accurately parse all of the intended meaning. Bearing that in mind:

If CEV produces a plan or AI to be implemented, I would expect it to be sufficiently powerful that it would entail changing the worldviews of many people during the course of implementation. My very basic template would be that of Asimov's The Evitable Conflict - the manipulations would be subtle and we would be unlikely to read their exact outcomes at a given time X without implementing them (this would be dangerous, as it means you can't "peak ahead" at the future you cause) though we still prove that at the end we will be left with a net gain in utility. The Asimov example is somewhat less complex, and does not seek to create the best possible future, only a fairly good, stable one, but this basic notion I am borrowing is relevant to CEV.

The drives behind the conviction of the suicide bomber are still composed of human drives, evolutionary artifacts that have met with a certain set of circumstances. The Al Qaeda example is salient today because the ideology is among the most noncontroversial, damaging ideology we can cite. However, I doubt that any currently held ideology or belief system held by any human today is ideal. The CEV should search for ways of redirecting human thought and action - this is necessary for anything that is meant to have global causal control. The CEV does not reconcile current beliefs and ideologies, it seeks to redirect the course of human events to bring about new, more rational, more efficient and healthy ideologies that will be compatible, if this can be done.

If there exists some method for augmenting our current beliefs and ideologies to become more rational, more coherent and more conducive to positive change, then the CEV should find it. Such a method would allow for much more utility than the failure mode you describe, and said failure mode should only occur when such a method is intractable.

In this specific case I was using this figure of speech, yes. I mean't that we would be extrapolating drives that matter for evolution (our ancient drives) but don't really matter to us, not in the sense of Want to Want described in 4c.

My point is that, in general, our drives are a product of evolutionary drives, and are augmented only by context. If the context changes, those drives change as well, but both the old set and new set are comprised of evolutionary drives. CEV changes those higher level drives by controlling the context in sufficiently clever ways.

CEV should probably be able to look at how an individual will develop in different contexts and compute the net utility in each one, and then maximize. The danger here is that we might be directed into a course of events that leads to wireheading.

It occurs to me that the evolutionary failure mode here is probably something like wireheading, though it could be more general. As I see it, CEV is aiming to maximize total utility while minimizing the net negative utility for as many individuals as possible. If some percentage of individuals prove to be impossible to direct towards a good future without causing massive dis-utility in general we have to devise a way to look at each case like this, and ask what sorts of individuals are not getting a pay-off. If it turns out to be a small number of sociopaths, this will probably not be a problem. I expect that we will have the technology to treat sociopaths and bring them into the fold. CEV should consider this possibility as well. If it turns out to be a small number of very likable people, it could be somewhat more complicated, and we should ask why this is happening. I can't think of any reasonable possible scenarios for this at the moment, but I think this is worth thinking about more.

The kernel of this problem is very central to CEV as I understand it, so I think it is good to discuss it in as much detail as we can in order to glean insight.

[-]torekp14y10

2a - If volition depends on emotional state, what we want is a me+ who is able to have any of these emotional states, but is not stuck in any one of them. Me+ will grok the states of chocolate-in-hand, chocolate-in-mouth, and fat-on-hips, taking on each emotional set in turn, and then consider the duration as well as the character of each experience. I don't see this as especially problematic, beyond the way that every psychological simulation/prediction is challenging.

3a - Not all psychological changes are problematic for what matters. Parfit has been criticized (unfairly?) on this very point, especially when it comes to changes that are increases in knowledge and rationality. (It may be a misreading of him to infer that all changes count as decreased connectedness over time.) Whenever we try to reason out what it is that we really want, we show a commitment to rationality. We can hardly complain if our criterion of "what we really want" includes increased rationality on the search path.

4c - If "want to want" can't be leveraged into just plain want, in the agent's most rational moments, I suspect it's just hot air. Sometimes "akrasia" isn't, and stated goals are sometimes abandoned on reflection.

[-]hairyfigment14y10

In this specific case, what ends up dominating CEV is what evolution wants, not what we want.

Possibly. It also sounds like the best part of Robert Heinlein's Good Outcome for the future. I think we can do better -- but you seem to be arguing for the claim that we can't. Still beats paperclips, or even true orgasmium.

[-]diegocaleiro14y00

We can do better if we take this kind of problem in consideration. If there is too much of what Eliezer calls spread and muddle, we may end up just evolving faster. I don't think blind faster evolution would be on top of anyone's list of desires.

[-]Caspian14y00

One of my issues with the interpersonal coherence part: you are splitting off part of someone's wish - the incoherent part - the remainder may be something that was only desired in context of the full wish. For example if people coherently wish for people to have superpowers and have incoherent preferences about dealing with more powerful criminals that result.

[-]Giles14y00

Is this a good summary of the ideas presented here or did I miss something important?

1) We need a correct definition for all the (apparently very fuzzy) concepts that CEV relies upon.

2a) People appear to have multiple "selves"; the preferences of each "self" are more consistent than the aggregation of all of them.

2b) If you strip away all the incoherent preferences, you might strip away most of the stuff you really care about.

3) A much smarter version of me does not resemble me any more. That person's preferences are not my preferences.

4a) We are behavior-executors, not a utility-maximizers. The notion of a "preference" or "goal" exists in the map not the territory. Asking "what is someone's true preference" is like asking whether it's a blegg or a rube. etc.

4b) Our reports of our own preferences are unreliable.

4c) CEV doesn't appear to address "Want to want".

[-]diegocaleiro14y00

You have not considered the failure mode called "Defeated by Evolution"

Other than that, it is a great really short summary. Why don't you do the same to Part2? :)

LESSWRONG
is fundraising!
LW

LESSWRONG
is fundraising!
LW

6

Troubles With CEV Part1 - CEV Sequence

6

6