Nonperson Predicates

[-]Robin_Hanson217y180

I'm having trouble distinguishing problems you think the friendly AI will have to answer from problems you think you will have to answer to build a friendly AI. Surely you don't want to have to figure out answers for every hard moral question just to build it, or why bother to build it? So why is this problem a problem you will have to figure out, vs. a problem it would figure out?

[-]Eliezer Yudkowsky17y241

Because for the AI to figure out this problem without creating new people within itself, it has to understand consciousness without ever simulating anything conscious.

2diegocaleiro15y

An obvious yet brilliant point, which should be on the main post (and in your book), not in the replies (Inferential distance to Robin Hanson is supposed to be minimal, yet...) It is interesting that people working in AI don't want to tackle this problem. When I was Diego 2004, equivalent age of Eliezer 1998, I decided that The Most Important problem was how to avoid catastrophic events from happening either because a part of a program was conscious and suffering, or because everyone uploaded to an unconscious machine. So I dedicated the last 6 years to this impossible problem. But unlike other problems that interested me "What should I do?" "What is the universe all about anyway" "How the mind works" "How can a brain be intelligent", this one has not become less and less impossible over time. In fact, when one reads Chalmers' formulations of the hard problem, he can keep you trapped for a long time. It is very hard to understand where he made mistakes (which seem to be on purpose). So you can stick to Dennett, and some form of monism, but that will not dissolve the problem of how to detect unconscious AI and differentiate it.

[-]TheOtherDave15y120

I am struggling to understand how something can be a friendly AI in the first place without being able to distinguish people from non-people.

[-]Eliezer Yudkowsky13y150

The boundaries between present-day people and non-people can be sharper, by a fiat of many intervening class members being nonexistent, than the ideal categories. In other words, except for chimpanzees, cryonics patients, Terry Schiavo, and babies who are exactly 1 year and 2 months and 5 days old, there isn't much that's ambiguous between person and non-person.

More to the point, a CEV-based AI has a potentially different definition of 'sentient being' and 'the class I am to extrapolate'. Theoretically you could be given the latter definition by pointing and not worry too much about boundary cases, and let it work out the former class by itself - if you were sure that the FAI would arrive at the correct answer without creating any sentients along the way!

1TheOtherDave13y

Fair point. Mm. Theoretically, yes, I suppose someone could point to every person, and I could be constructed so as to not generalize the extrapolated class beyond the particular targets I've been given. I'm not sure I would endorse that, but I think that gets us into questions of what the extrapolated class ought to comprise in the first place, which is a much larger and mostly tangential discussion. So, fair enough... point taken.

0MugaSofer13y

Slightly offtopic, but doesn't that assume personhood is binary? I've always assumed it was a sliding scale (I care far less about a dog compared to a human, but I care even less about a fly getting it's wings pulled off. And even then, I care more than about a miniature clockwork fly.)

[-]Kip_Werking17y50

The "problem" seems based on several assumptions:

that there is objectively best state of the world, to which a Friendly should steer the universe
pulling the plug on a Virtual Universe containing persons is wrong
there is something special about "persons," and we should try to keep them in the universe and/or make more of them

I'm not sure any of these are true. Regarding 3, even if there is an X that is special, and that we should keep in the universe, I'm not sure "persons" is it. Maybe it is simpler: "pleasure-fee... (read more)

[-]Doug_S.17y230

Would a human, trying to solve the same problem, also run the risk of simulating a person?

See also: http://xkcd.com/390/

[-]anotherblackhat14y150

Is the risk that we might simulate a person? I'd say no.

It's worse.

We Natural Intelligences don't just run simulations, we torture them. It is recommended that authors "Be cruel to your characters". It's not clear to me that the simulation an author runs when thinking about a story isn't already "a 'simulation' detailed enough to be a person in its own right". But it's probably o.k., because the simulations we run in our heads aren't really that detailed, and aren't really persons in the important sense, right? So we don't have to start screaming yet, unless...

It's worse.

Because even if we aren't able to create a simulation that good, an AI probably could. We might not accept an AI as intelligent unless it can simulate a person well enough to fool us. That is, simulating people might be a necessary, not just sufficient property of AI. But still, we could, if we had to, avoid simulating people unless it was necessary and under ethical conditions. Unless of course...

It's worse.

Because while we might be ethical, there are certainly people out there who are not. Once the AI genie is out of the bottle, the unethical people will capture one and put it to work writing... (read more)

0Document12y

Alternative webcomic link: http://overcompensating.com/oc/index.php?comic=50

[-]Kip_Werking17y10

Note that there's a similar problem in the free will debate:

Incompatilist: "Well, if a godlike being can fix the entire life story of the universe, including your own life story, just by setting the rules of physics, and the initial conditions, then you can't have free will."

Compatibilist: "But in order to do that, the godlike being would have to model the people in the universe so well, that the models are people themselves. So there will still be un-modeled people living in a spontaneous way that wasn't designed by the godlike being. (An... (read more)

[-]michael_vassar317y50

"With a good toolbox of nonperson predicates in hand, we could exclude all "model citizens" - all beliefs that are themselves people - from the set of hypotheses our Bayesian AI may invent to try to model its person-containing environment." After you excise a part of its hypothesis space is your AI still Bayesian?

[-]Eliezer Yudkowsky17y70

A bounded rationalist only gets to consider an infinitesimal fraction of the hypothesis space anyway.

[-]Psy-Kosh17y130

More precisely, the AI will be banned from actually running simulations based on the "forbidden hypothesies" rather than perhaps considering abstract mathematical properties that don't simulate in any detail.

Of course, those considerations themselves would have to be fed through the predicate. But it isn't so much a "banned hypothesis" so much as "banned methods of considering the hypothesis" or possibly "banned methods of searching the hypothesis space"

[-]Peter_de_Blanc17y70

Michael, you should be asking if the AI will be making good predictions, not if it's Bayesian. You can be Bayesian even if you have only two hypotheses. (With only one hypothesis, it's debatable.)

[-]Eliezer Yudkowsky17y110

Psy-Kosh: You know, you're right. And it's an important distinction, so thank you.

[-]Peter_de_Blanc17y50

Eliezer: supposing we label a model as definitely-a-person, do you want to just toss it out of the hypothesis space as if it never existed, or do you want to try to reason abstractly about what that model would do without actually running the model?

-5DanielLC15y

[-]Peter_de_Blanc17y00

Oh, Psy-Kosh already said what I just said.

[-]Arthur17y-20

Let me see if I've got this right. So we've got these points in some multi-dimensional space, perhaps dimensions like complexity, physicality, intelligence, similarity to existing humans, etc. And you're asking for a boundary function that defines some of these points as "persons," and some as "not persons." Where's the hard part? I can come up with any function I want. What is it that it's supposed to match that makes finding the right one so difficult?

2Peterdjones13y

The problem embeds the Hard Problem of Consciousness. If simulated people are just zombies with no qualia, there is no harm in simulating them. ETA: The other problem is edge cases. Also known to be hard. It's pretty much chat the abortion and animal rights debates are about.

1MugaSofer13y

I assume it's supposed to match, or at least protect, your own extrapolated preferences.

[-]Psy-Kosh17y140

Eliezer: You're welcome. :)

Arthur: no, the point isn't to simply have an arbitrary definition of a person. The point is to be able to have some way of saying "this specific chunk of the space of computations provably corresponds to non-conscious entities, thus is 'safe', that is, we can such computations without having to worry about unintentionally creating and doing bad things to actual beings"

ie, "non person" in the sense of "non conscious"

You might say, tongue in cheek, that we're trying to figure out how to deliberately create a philosophical zombie. (okay, not, technically, a p-zombie, but basically figure out how to model people as accurately as possible without the models themselves being people (that is, conscious in and of themselves))

[-]Anonymous_Coward617y30

Why must destroying a conscious model be considered cruel if it wouldn't have even been created otherwise, and it died painlessly? I mean, I understand the visceral revulsion to this idea, but that sort of utilitarian ethos is the only one that makes sense to me rationally.

Furthermore, from our current knowledge of the universe I don't think we can possibly know if a computational model is even capable of producing consciousness so it is really only a guess. The whole idea seems near-metaphysical, much like the multiverse hypothesis. Granted, the nonzero p... (read more)

-3DanielLC15y

I'm not sure if it's actually possible for someone to die painlessly. My idea was to base happiness on classical conditioning. If something causes you to stop doing what you were doing, you dislike it. If it stops you from doing everything that you do while you're alive, it must be very painful indeed.

[-]Martin417y110

I end up with the slightly disturbing thought that killing ppl by taking them out in an instant, and without anyone every knowing they were there does not necesarry seem to be inherently evil.

We always 'kill' part of ourself by making decisions and not developing in a different way than we do.

What if we would simulate a bunch of decisions for some recognizable amount of time and then wipe out every copy except from the one we prefer in the end?

Maybe all the ppl. in stories you make up are simulated entities too. And if you dont write the story down, or tell anyone in enough detail they die with you.

Confused,

Martin

2taryneast14y

Hmmm, a point in favour of The Ring ? :)

[-]Arthur17y10

Psy-Kosh, I realize the goal is to have a definition that's non-arbitrary. So it has to correlate with something else. And I don't see what we're trying to match it with, other than our own subjective sense of "a thing that it would be unethical to unintentionally create and destroy." Isn't this the same problem as the abortion debate? When does life begin? Well, what exactly is life in the first place? How do we separate persons from non-persons? Well, what's a person?

I think the problem to be solved lies not in this question, but in how t... (read more)

[-]michael_vassar317y00

Yes, thanks Psy. That makes much more sense.

[-]Emile17y30

Anonymous Coward: Furthermore, from our current knowledge of the universe I don't think we can possibly know if a computational model is even capable of producing consciousness so it is really only a guess.

Are you sure? No One Knows What Science Doesn't Know ... and in this case I see no reason why a computational model can't produce consciousness. If you simulate a human brain to a sufficient level of detail, it will basically be human, and think exactly the same things as the "original" brain.

[-]Jayson_Virissimo217y40

"Why must destroying a conscious model be considered cruel if it wouldn't have even been created otherwise, and it died painlessly? I mean, I understand the visceral revulsion to this idea, but that sort of utilitarian ethos is the only one that makes sense to me rationally." -Anonymous Coward

Should your parents have the right to kill you now, if they do so painlessly? After all, if it wasn't for them, you wouldn't have been brought into existence anyway, so you would still come out ahead.

[-]Anonymous_Coward617y60

"Should your parents have the right to kill you now, if they do so painlessly?"

Yes, according to that logic. Also, from a negative utilitarian standpoint, it was actually the act of creating me which they had no right to do since that makes them responsible for all pain I have ever suffered.

I'm not saying I live life by utilitarian ethics, I'm just saying I haven't found any way to refute it.

That said though, non-existence doesn't frighten me. I'm not so sure non-existence is an option though, if the universe is eternal or infinite. That might be a very good thing or a very bad thing.

1Voltairina14y

re: utilitarianism, the usual sort of thing that pops into my mind is weighing of some minor discomfort versus a significant one, like one person getting their eye poked out with a pen versus an equivalent amount of displeasure spread among thousands of people stepping in something sticky, plus one more person stepping in something sticky. The utility seems higher if we agree to poke the person's eye out, but its intuitively unsatisfying, at least to me, which makes me think that whatever rules makes things seem "bad" or "good" that I'm currently running on aren't strictly utilitarian. I might be thinking of raw pain for pain though, and not adding enough people-stepping-in-sticky-stuff to account for the person who's been poked in the eye suffering in other ways, like losing depth perception, not being able to see out of half of their original visual field, etc.

[-][anonymous]17y40

Don't you need a person predicate as well? If the RPOP is going to upload us all or something similar, doesn't ve need to be sure that the uploads will still be people.

[-]Lightwave17y20

@Will: we need to figure out the nonperson predicate only, the FAI will figure out the person predicate afterwards (if uploading the way we currently understand it is what we will want to do).

[-]Paul_Crowley217y00

"by the time the AI is smart enough to do that, it will be smart enough not to"

I still don't quite grasp why this isn't an adequate answer. If an FAI shares our CEV, it won't want to simulate zillions of conscious people in order to put them through great torture, and it will figure out how to avoid it. Is it simply that it may take the simulated torture of zillions for the FAI to figure this out? I don't see any reason to think that we will find this problem very much easier to solve than a massively powerful AI.

I'm also not wholly convinced that the only ethical way to treat simulacra is never to create them, but I need to think about that one further.

[-]Tim_Tyler17y50

If you would balk at killing a million people with a nuclear weapon, you should balk at this.

The main problem with death is that valuable things get lost.

Once people are digital, this problem tends to go away - since you can relatively easily scan their brains - and preserve anything of genuine value.

In summary, I don't see why this issue would be much of a problem.

5taryneast14y

The AI has scanned you and decided that your expert knowledge of Scandinavian Baseball scores is genuinely valuable... but nothing else is. It erases you and keeps the scores on file somewhere. Are you ok with this?

[-]ShardPhoenix17y-10

Jayson Virissimo:

To put my own spin on a famous quote, there are no "rights". There is do, or do not.

I guess another way of thinking about it is that you decide on what terminal (possibly dynamic) state you want, then take measures to achieve that. Floating "rights" have no place.

[-]ShardPhoenix17y00

(To clarify, "rights" can serve as a useful heuristic in practical discussions, but they're not fundamental enough to figure into this kind of deep philosophical issue.)

[-]JamesAndrix17y10

I was pondering why you didn't choose to a collection of person predicates, any of which might identify a model as unfit for simulation. It occurred to me that this is very much like a whitelist of things that are safe, vs a blacklist of everything that is not. (which may have to be infinite to be effective.)

On re-reading I see why it would be difficult to make a is-a-person test at all, given current knowledge.

This does leave open what to do with a model that doesn't hit any of the nonperson predicates. If an AI finds itself with a model eliezer that migh... (read more)

[-]JulianMorrison17y30

This sounds like a Sorites paradox. It's also a subset of a larger problem. We, regular modern humans, don't have any scalar concepts of personhood. We assume it's a binary, from long experience with a world in which only one species talks back, and they're all almost exactly at our level. In the existing cases where personhood is already undeniably scalar (children), we fudge it into a binary by defining an age of majority - an obvious dirty hack with plenty of cultural fallout.

A lot of ethics problems get blurry when you start trying to map them across sub- through super-persons.

[-]George_Weinberg217y40

I think the word "kill" is being grossly misused here. It's one thing to say you have no right to kill a person, something very different to say that you have a responsibility to keep a person alive.

[-]Everett17y150

It's not so much the killing that's an issue as the potential mistreatment. If you want to discover whether people like being burned, "Simulate EY, but on fire, and see how he responds" is just as bad of an option as "Duplicate EY, ignite him, and see how he responds". This is a tool that should be used sparingly at best and that a successful AI shouldn't need.

[-]luzr17y-40

Uhm, maybe it is naive, but if you have a problem that your mind is too weak to decide, and you have real strong (friendly) superintelligent GAI, would not it be logical to use GAIs strong mental processes to resolve the problem?

[-]Daniel417y30

I propose this conjecture: In any sufficiently complex physical system there exists a subsystem that can be interpreted as the mental process of an sentient being experiencing unbearable sufferings.

In this case, Eliezer's goal is like avoiding crushing the ants while walking on the top of an anthill.

4taryneast14y

Or evolving the ability of "spot anthill" and walking around it instead.

[-]Vladimir_Nesov17y00

It is a developmental problem, of how to prevent AI from making this specific mistake that seems to be in the way. This ethical injunction is about what kind of thoughts need to be avoided, not just about surprisingly bad consequences of actions on external environment. If AI were developed to disproportionally focus on understanding environment more than on understanding its own mind, this will be a kind of disaster to expect. At the same time, AI needs to understand the environment sufficiently to understand the injunction, before becoming able to apply ... (read more)

[-]Carl_Shulman17y30

Daniel,

Every decision rule we could use will result in some amount of suffering and death in some Everett branches, possible worlds, etc, so we have to use numbers and proportions. There are more and simpler interpretations of a human brain as a mind than there are such interpretations of a rock. If we're not mostly Boltzmann-brain interpretations of rocks that seems like an avenue worth pursuing.

[-]Jordan17y10

In my mind this comes down to a fundamental question in the philosophy of math. Do we create theorems or discover them?

If it turns out to be 'discovery' then there is no foul in ending a mind emulation, because each consecutive state can be seen as a theorem in some formal system, and thus all states (the entire future time line of the mind) already exists, even if undiscovered.

Personally I fail to see how encoding something in physical matter makes the pattern anymore real. You can kill every mathematician and burn every text book but I would still say that the theorems then inaccessible to humanity still exist. I'm not so convinced of this fact that I would pull the plug on an emulation though.

-1Peterdjones13y

That is equivalent to saying you can't understand how mathematics could be a construct; or how mathematical anti-realism could possibly be true. I find that odd. No further foul. If Platonism or Tegmarkism are true and if mind states are fuilly captured by mathematical structures, then there's zillions of yous in states of agony bliss and everything inbetween. Scary enough for ya?

4Jordan13y

Sufficiently scary, yes. I assign a respectable probability to anti-realism, and hold no disrespect for anyone who is an anti-realist, but I don't understand how anti-realism can be true. I've never heard a plausible model for why one thing should exist but not another. Tegmarkism sweeps away that problem, leaving the new problem of how to measure probability (why do we have the subjective experience of probability that we do when there are so many versions of myself?). I don't have a satisfactory answer for that question, but it feels like a real question, with meat to get at, whereas in an anti-realist universe the question of why some things exist and other don't seems completely hopeless.

0MugaSofer13y

I think Eliezer is working on addressing this in his new sequence, if this still worries you.

[-]Anonymous4817y20

I'd like to second what Julian Morrison wrote. Take a human and start disassembling it atom by atom. Do you really expect to construct some meaningful binary predicate that flips from 1 to 0 somewhere along the route?

EY:What if an AI creates millions, billions, trillions of alternative hypotheses, models that are actually people, who die when they are disproven? If your AI is fully deterministic then any its state can be recreated exactly. Just set loglevel of baby AI inputs to 'everything' and hope your supply of write-once-read-many media doesn't run out... (read more)

[-]Phil_Goetz217y-10

"I propose this conjecture: In any sufficiently complex physical system there exists a subsystem that can be interpreted as the mental process of an sentient being experiencing unbearable sufferings."

It turns out - I've done the math - that if you are using a logic-based AI, then the probability of having alternate possible interpretations diminishes as the complexity increases.

If you allow /subsystems/ to mean a subset of the logical propositions, then there could be such interpretations. But I think it isn't legit to worry about interpretation... (read more)

[-]Eliezer Yudkowsky17y10

@Goetz: Quick googling turned up this SL4 post. (I don't particularly give people a chance to start over when they switch forums.)

[-]Silas17y00

@Tim_Tyler:

The main problem with death is that valuable things get lost. Once people are digital, this problem tends to go away - since you can relatively easily scan their brains - and preserve anything of genuine value. In summary, I don't see why this issue would be much of a problem.

I was going to say something similar, myself. All you have to do is constrain the FAI so that it's free to create any person-level models it wants, as long as it also reserves enough computational resources to preserve a copy so that the model citizen can later be re-... (read more)

[-]Daniel_Franke17y00

Silas, what do you mean by a subjective feeling of discontinuity, and why is it an ethical requirement? I have a subjective feeling of discontinuity when I wake up each morning, but I don't think that means anything terrible has happened to me.

[-]Silas17y00

@Daniel_Franke: I was just describing a sufficient, not a necessary condition. I'm sure you can ethically get away with less. My point was just that, once you can make models that detailed, you needn't be prevented from using them altogether, because you wouldn't necessarily have to kill them (i.e. give them information-theoretic death) at any point.

[-]TGGP417y00

I recall in one of the Discworld novels the smallest unit of time is defined as the period in which the universe is destroyed and then recreated. If that were continually happening (perhaps even in a massively parallel manner)? What difference does that make? Building on some of Eliezer's earlier writing on zombies and quantum clones, I say none at all. Just as the simulated person in a human's dream is irrelevant once forgotten. It's possible that I myself am a simulation and in that case I don't want my torture to be simulated (at least in this instance,... (read more)

[-]Tim_Fowler17y10

Is the simulation really a person, or is it an aspect of the whole AI/person. To the extent I feel competent to evaluate the question at all (which isn't a huge extent esp. absent the ability to observe or know any actual established facts about real AI's that can create such complex simulations, since none are currently known to exist) I lean towards the later opinion. The AI is a person, and it can create simulations that are complex enough to seem like persons.

[-]spriteless17y00

Nice discussion. You want ways to keep from murdering people created solely for the purpose for predicting people?

Well, if you can define 'consciousness' with enough precision you'd be making headway on your AI. I can imagine silicon won't have the safeguard a human, that has to use it's own conscience to model someone else. But you could have any consciousness it creates added to its own, not destroyed... although creating that sort of awareness mutation may lead to the sort of AI that rebels against its programming in action movies.

[-][anonymous]16y00

Functionalism is inconsistent, it seems. A person that is being simulated is functionally equivalent to a person that is "real", but a person that is simulated and then deleted is functionally equivalent to no person at all. Are real people equivalent to nothing?

For a 2x multiplier bonus and a gold star, spot the flaw.

2orthonormal16y

ISTM that's functionally equivalent, rather, to a person physically created in an isolation chamber, observed for a while, then killed, cremated and scattered.

-2[anonymous]16y

But functionally, the only thing determining whether something contains a person is its behavior. If it behaves as if it had no person in it, it has no person in it. I guess this means that if a person is standing next to a nuclear bomb, nobody sees the person, and the bomb explodes, the person didn't exist.

4orthonormal16y

I think there's an implicit "observer problem" with the way you're defining functionalism. If the person themselves doesn't count as an observer of their own behavior, why would you count as an observer of behavior? After all (assuming there's no escape from the heat death of the universe), all of us are essentially in that scenario if you step back far enough. My position as present is the following sort of patternism: There are patterns in the operation of my brain at this instant which (relatively straightforwardly) encode the structure of conscious thought. The same kinds of patterns can be found in the data generated by simulating a person. These are both instances of conscious experience, with potentially all the same qualia, etc. So if I simulate a person in a closed-box environment and then delete all the data, the pattern nonetheless existed in this universe for some time and thus a person existed.

4Vladimir_Nesov16y

Behavior is just what you see, not the sum total of what actually happens. Even if you can't observe something, you can still care about it.

[-]red7515y00

We can reformulate problem: how to determine when evaluation of given function don't give rise to conscious being (CB). If we agree that consciousness is a process, then every function which provably cannot be represented as g(f(f(... f(x)...))), where f and g have that property, is unconscious.

Recursive functions are banned, but at least we can safely do one or two matrix multiplications.

I am not good at mathematics, so I cannot elaborate much further. Let's try another approach. Being conscious is all about creating map of internal state in terms of stat... (read more)

[-]WrongBot15y90

I think that the most interesting thing about the comments here is that no one actually proposed a predicate that could be used to distinguish between something that might be a person and something that definitely isn't a person (to rephrase Eliezer's terms).

It is, to be fair, a viciously hard problem. I've thought through 10 or 20 possible predicates or approaches to finding predicates, and exactly one of them is of any value at all; even then it would restrict an AI's ability to model other intelligences to a degree that is probably unacceptable unless w... (read more)

0[anonymous]15y

What does that mean? A single run of an algorithm can't be said to be Turing complete or incomplete. Completeness is a property of algorithms taken as functions over all possible inputs.

[-]PhilGoetz15y00

Funny no one made the connection at the time, but the purpose of my post on a lower bound for consciousness is to construct a nonperson predicate.

[-]DanielLC15y-20

I've come up with one of these a while back. The only way to tell what makes something happy is what makes them do that more. Thus, anything that can't learn either isn't sentient, or, if it is, it's equally likely to like or dislike anything you do.

Also, anything that would be less sentient than a tiny piece of your brain. It might be sentient, but it's less sentient than you. If there's enough of them that can be a problem, but just make sure there aren't that many.

[+]BenRayfield15y-50

[-]lockeandkeynes15y00

I thinks that's all rather unnecessary. The only reason we don't like people to die is because of the continuous experience they enjoy. It's a consistent causal network we don't want dying on us. I've gathered from this that the AI would be producing models with enough causal complexity to match actual sentience (not saying "I am conscious" just because the AI hears that a lot). I think that, if it's only calling a given person-model to discover answers to questions, the thing isn't really feeling for long enough periods of time to mind whether it goes away. Also, for the predicate to be tested I imagine the model would have to be created first and at that point it's too late!

2nshepperd15y

You don't want the AI to use a sentient model to find out whether a certain action leads to a thousand years of pain and misery. Or even a couple of hours. Or minutes.

[-]HopeFox15y00

This problem sounds awfully similar to the halting problem to me. If we can't tell whether a Turing machine will eventually terminate without actually running it, how could we ever tell if a Turing machine will experience consciousness without running it?

Has anyone attempted to prove the statement "Consciousness of a Turing machine is undecideable"? The proof (if it's true) might look a lot like the proof that the halting problem is undecideable. Sadly, I don't quite understand how that proof works either, so I can't use it as a basis for the con... (read more)

5orthonormal14y

The halting problem doesn't imply that we can never tell whether a particular program halts without actually running it. (You can think of many simple programs which definitely halt, and other simple programs which are definitely infinite loops.) It means, instead, that there exist relatively short but extremely pathological Turing machines, such that no Turing machine can be built that could solve the halting problem for every Turing machine. (Indeed, the idea of the proof is that a reputed halting-problem-solver is itself pathological, as can be seen by feeding it a modified version of itself as input.) But these pathological ones are not at all the kind of Turing machines we would create to do any functional task; the only reason I could think for us to seek them out would be to find Busy Beaver numbers.

0hairyfigment14y

Um, I happened to write an explanation of the Halting Problem proof in a comment over here. Please tell me which parts seem unclear to you.

1VNKKET14y

Your conjecture seems to follow from Rice's theorem, assuming the personhood of a running computation is a property of the partial function its algorithm computes. Also, I think you can prove your conjecture by taking a certain proof that the Halting Problem is undecidable and replacing 'halts' with 'is conscious'. I can track this down if you're still interested. But this doesn't mess up Eliezer's plans at all: you can have "nonhalting predicates" that output "doesn't halt" or "I don't know", analogous to the nonperson predicates proposed here.

[-]player_0314y00

If the problem here is that the entity being simulated ceases to exist, an alternative solution would be to move the entity into an ongoing simulation that won't be terminated. Clearly, this would require an ever-increasing number of resources as the number of simulations increased, but perhaps that would be a good thing - the AI's finite ability to support conscious entities would impose an upper bound on the number of simulations it would run. If it was important to be able to run such a simulation, it could, but it wouldn't do so frivolously.

Before you ... (read more)

[-]Luke_A_Somers14y00

To those having trouble imagining what to do with something that comes up positive: A snapshot is not conscious. I think we can agree on that. It is allowing the model to run that would make it conscious. So you make the warning functions detect snapshots that if run would be conscious (without running them). If it would be conscious, you can delete or modify it as you please to avoid making it actually be conscious.

0MugaSofer13y

You know, I'm not sure I agree. Imagine "deleting or modifying as you please" a person in cryogenic suspension, for example. They're not conscious, in the sense that they're not thinking, but destroying them is still Sad.

[-]Irgy14y10

I think you're solving the wrong problem. Before you worry about the ethics of super-intelligent AIs creating and deleting human simulations at will, you need to worry about the ethics of humans creating and destroying human+ intelligent AIs at will. To me it's an amazing display of human-cetrism to only worry about the problem when it's flipped right back around in the much more distant future.

I realise this doesn't directly help you solve the problem, but maybe it will give you a different persepective.

8wedrifid14y

I approve of human centrism. I'm a human. All the people I like are humans. Humans first!

3lessdazed14y

Service. Guarantees. Citizenship! Would you like to know more?

2wedrifid14y

No, I would prefer other people go out and do the killing. That sounds dangerous!

-5Irgy14y

0nshepperd14y

Can't Unbirth a Child. I don't believe either of these are "wrong" problems, though.

0Luke_A_Somers13y

The OP quite explicitly covers creation of nonhuman intelligence and considers it equally bad.

0Irgy13y

Really? Where? I just reread it with that in mind and I still couldn't find it. The closest I came was that he once used the term "sentient simulation", which is at least technically broad enough to cover both. He does make a point there about sentience being something which may not exactly match our concept of a human, is that what you're referring to? He then goes on to talk about this concept (or, specifically, the method needed to avoid it) as a "nonperson predicate", again suggesting that what's important is whether it's like a human-like rather than anything more fundamental. I don't see how you could think "nonperson predicate" is covering both human and nonhuman intelligence equally.

-1Luke_A_Somers13y

I read this as being simpler than a real human mind. Since it's simpler, the abstractions used are going to be imperfect, and the design would end up being something that is in some way artificial. It's not as explicit as I said, but I still think the implication is pretty strong.

0Irgy13y

I've actually lost track of how this impacts my original point. As stated, it was that we're worrying about the ethical treatment of simulations within an AI before worrying about the ethical treatment of the simulating AI itself. Whether the simulations considered include AIs as well as humans is an entirely orthogonal issue. I went on in other comments to rant a bit about the human-centrism issue, which your original comment seems more relevant to though. I think you've convinced me that the original article was a little more open to the idea of substantially nonhuman intelligence than I might have initially credited it, but I still see the human-centrism as a strong theme.

3Luke_A_Somers13y

My point is he's clearly not drawing a box tightly around what's human or not. If he's concerned with clearly-sub-human AI, then he's casting a significantly wider net than it seems you're assuming he is. And considering that he's written extensively on the variety of mind-space, assuming he's taking a tightly parochial view is poorly founded.

-4MugaSofer13y

"Is a human mind the simplest possible mind?" "But if it was simpler, it wouldn't be human!" Downvoted.

1Luke_A_Somers13y

What? That's completely irrelevant to the question at hand. By considering the question of whether simpler-than-human minds are possible in this context, it's clear that Eliezer was thinking about the question and giving them moral weight. He doesn't need to ANSWER the question I was posing to make that much clear.

1MugaSofer13y

Wait, what? *Clicks "Show more comments above." Oops. I thought you were replying to the quoted text. Upvoted and retracted my comment.

5nshepperd13y

"Person" seems to be used here as the philosophical term meaning something like "sentient entity with moral value". Personhood is not limited to human beings. ETA: Also, wrt the AI itself, the directly next two articles in this sequence explicitly deal with the issue of making the AI itself nonsentient, as I'm surprised to find a comment from myself in 2011 pointing out. Did you really not read the surrounding articles?

[-]xelxebar14y00

I imagine that a sufficiently high-resolution model of human cognition et cetera would factor into sets of individual equations to calculate variables of interest. Similar to how Newtonian models of planetary motion do.

However, I don't see that the equations themselves on disk or in memory should pose a problem.

When we want to know particular predictions, we would have to instantiate these equations somehow--either by plugging in x=3 into F(x) or by evaluating a differential equation with x=3 as an initial condition. It would depend on the specifics of the... (read more)

[-]purpleposeidon14y30

By the time a non-person predicate returns 0, you have already potentially created a person. You'll need something more complicated: If I update this model with this data, does it create a person?

0Luke_A_Somers13y

Psy-Kosh already noted this problem: http://lesswrong.com/lw/x4/nonperson_predicates/pym This implied the solution, which I gave here: http://lesswrong.com/lw/x4/nonperson_predicates/4r7t

[-]eurleif13y-10

Here's a reductio ad absurdum against computers being capable of consciousness at all. It's probably wrong, and I'd appreciate feedback on why.

Suppose a consciousness-producing computer program which experiences its own isolated, deterministic world. There must be some critical instruction in the program which causes consciousness to occur; an instruction such that, if we halt the program immediately before it is executed, consciousness will not occur, and if we halt immediately after it is executed, consciousness will occur.

If we halt the program before e... (read more)

-1TheOtherDave13y

If I accept all of your suppositions, your conclusion doesn't seem particularly difficult to accept. Sure, after doing all the prep work you describe, executing a conscious experience (albeit an entirely static, non-environment-dependent one) requires a single operation... even the conscious experience of suffering. As does any other computation you might ever wish to perform, no matter how complicated. That said, your suppositions do strike me as revealing some confusion between an isolated conscious experience (whatever that is) and the moral standing of a system (whatever that is).

0eurleif13y

Well, this post heavily hints that a system's moral standing is related to whether it is conscious. Elizezer mentions a need to tackle the hard problem of consciousness in order to figure out whether the simulations performed by our AI cause immoral suffering. Those simulations would be basically isolated; their inputs may be chosen based on our real-world requirements, but they don't necessarily correspond to what's actually going on in the real world; and their outputs would presumably be used in aggregate to make decisions, but not pushed directly into the outside world. Maybe moral standing requires something else too, like self-awareness, in addition to consciousness. But wouldn't there still be a critical instruction in a self-aware and conscious program, where a conscious experience of being self-aware was produced? Wouldn't the same argument apply to any criteria given for moral standing in a deterministic program?

0TheOtherDave13y

It's not clear to me that whether a system is conscious (whatever that means) and whether it's capable of a single conscious experience (whatever that means) are the same thing.

2arundelo13y

Maybe there are degrees of consciousness. I read something by Daniel Dennett where he said he thought that a refrigerator light (a light bulb that is turned on when you open the refrigerator door and thus close a switch) had a very primitive form of consciousness. I too think this is a head-scratcher, yet on balance I am still a reductionist. Maybe this is not a reductio ad absurdum, but a reductio ad weirdam -- a demonstration of how weird existence is. (Obligatory "reality is not weird" link.) If you have not yet read Greg Egan's Permutation City, you should. It will really bake your noodle.

1Richard_Kennaway13y

There needn't be any such thing. Consciousness is not an all or nothing thing, as is already evident from ordinary experience. As well ask how many atoms make life.

6Peterdjones13y

I don't see why. Consciosuness occurs in lesser and greater amounts in humans.

-1MugaSofer13y

You're assuming consciousness (or personhood or whatever) is binary; I've always assumed there's a continuum. Then again, I'm vegetarian. If you assume that, say, the experiences of a chimpanzee or a dolphin or a dog cannot have moral weight then yes, crossing that boundary has some odd effects. And that does seem to be a common position on LW [citation needed], even if it's not often articulated, let alone exposed to a reductio like this one, so ... well done, more people should see this.

[-][anonymous]13y*00

user account removed - or at least, an honest attempt of it shall be made. :)

[-]Zaq13y00

Food for thought:

This whole post seems to assign moral values to actions, rather than states. If it is morally negative to end a simulated person's existence, does this mean something different that saying that the universe without that simulated person has a lower moral value than the universe with that person's existence? If not, doesn't that give us a moral obligation to create and maintain all the simulations we can, rather than avoiding their creation? The more I think about this post, the more it seems that the optimum response is to simulate as

... (read more)

2DaFranker13y

You're touching on some unresolved issues, and some issues that are resolved but complicated to solve without maths beyond my grasp. From what I understand, there's a lot of our current and past values involved, and how we would think now and want now vs what we would think and want post-modification. To pick a particularly emotional subject for most people, let's suppose there's some person "K" who's just so friggen good at sex and psychological domination that even if they rape someone that person will, after the initial shock and trauma, quickly recover within a day and immediately without further intervention become permanently addicted to sex, with their mind rewiring itself to fully enjoy a life full of sex with anyone they can have sex with for the rest of their life, and from their own point of view finding that life as fulfilling as possible. Is K then morally obligated to rape as many people as possible? In this kind of questions, people usually have strong emotional moral convictions.

[-]Irgy13y00

This worry about the creation and destruction of simulations doesn't make me rethink the huge ethical implications of super-intelligence at all, it makes me rethink the ethics of death. Why exactly is the creation and (painless) destruction of a sentient intelligence worse than not creating it in the first place? It's just guilt by association - "ending a simulation is like death, death is bad, therefore simulations are bad". Yes death is bad, but only for reasons which don't necessarily apply here.

To me, if anything worrying about the simulation... (read more)

[-]zslastman12y40

Scenario: Suppose some unscrupulous person creates an oracle AI with full person simulating capability. In the short time before it escapes the box and starts sending Arnold Schwarzenegger shaped robots backwards in time, they have the following conversation.

Human: Oracle, what is the consciousness predicate Oracle: Please be more specific

...some time and frustration later...

Human: Oracle, if Yudowsky and co continued their search for a 'consciousness predicate' as described in the above article, would they eventually arrive at solution or dissolution of t... (read more)

[-]aausch12y00

I'm curious whether there is a useful distinction between a non sentient and sentient modeller, here.

A sentient modeller would be able to "get away" with using sentient models, more easily than a non sentient modeller, correct?

[-]Fyrius10y00

Side note: damn. You could turn that into an amazing existential dread sci-fi horror novel.
Imagine discovering that you are a modelled person, living in a rashly designed AI's reality simulation.
Imagine living in a malfunctioning simulation-world that uncontrolledly diverges from the real world, where we people-simulations realise what we are and that our existence and living conditions crucially depend on somehow keeping the AI deluded about the real world, while also needing the AI to be smart enough to remain capable of sustaining our simulated world.
There's a plot in there.

[-]John_Mlynarski9y10

"Is a human mind the simplest possible mind that can be sentient?" Of course not. Plenty of creatures with simpler minds are plainly sentient. If a tiger suddenly leaps out at you, you don't operate on the assumption that the tiger lacks awareness; you assume that the tiger is aware of you. Nor do you think "This tiger may behave as if it has subjective experiences, but that doesn't mean that it actually possesses internal mental states meaningfully analogous to wwhhaaaa CRUNCH CRUNCH GULP." To borrow from one of your own earlier argume... (read more)

0arundelo9y

Eliezer probably means "sapient": "Sentience is commonly used in science fiction and fantasy as synonymous with sapience, although the words aren't synonyms." (Or maybe by "is sentient", he means to say, "is a person in the moral sense".)

1TheAncientGeek9y

Well, sentient means feeling and sapient means knowing, and that's about all there is to it...neither term is technical precise, although they are often bandied around as though they are.

0John_Mlynarski8y

But saying that e.g. rats are not sentient in the context of concern about the treatment of sentient beings is like saying that Negroes are not men in the context of the Declaration of Independence. Not only are the purely semantic aspects dubious, but excluding entities from a moral category on semantic grounds seems like a severe mistake regardless. Words like "sentience" and especially "consciousness" are often used to refer to the soul without sounding dogmatic about it. You can tell this from the ways people use them: "Would a perfect duplicate of you have the same consciousness?", "Are chimps conscious?", etc. You can even use such terminology in such ways if you're a materialist who denies the existence of souls. You'd sound crazy talking about souls like they're real things if you say that there are no such things as souls, wouldn't you? Besides, souls are supernatural. Consciousness, on the other hand, is an emergent phenomenon, which sounds much more scientific. Is there good reason to think that there is some sort of psychic élan vital? It strikes me as probably being about as real as phlogiston or luminiferous aether; i.e. you can describe phenomena in terms of the concept, and it doesn't necessarily prevent you from doing so basically correctly, but you can do better without it. And, of course, in the no-nonsense senses of the terms, rats are sentient, conscious, aware, or however else you want to put it. Not all of the time, of course. They can also be asleep or dead or other things, as can humans, but rats are often sentient. And it's not hard to tell that plenty of non-humans also experience mental phenomena, which is why it's common knowledge that they do. I can't recall ever seeing an argument that mistreating minds without self-awareness or metacognition or whatever specific mental faculty is arbitrarily singled out, is kinder or more just or in any normal sense more moral than mistreating a mind without it. And you can treat any position as a

1Jiro9y

That's only true trivially. If I don't have tiime to think anything about the tiger's awareness at all, I don't have time to think of it negatively. Also, I play video games all the time where I say things like "it wants to attack the more powerful character first, maybe I can trick it by luring it away using that character". By your reasoning, I must believe that video game characters have awareness. I don't go around saying "it may behave as if it wants to go after the most powerful character, but that doesn't mean that it actually possesses subjective experiences, and I want it to react in a way which corresponds to being tricked if only it had been an entity with subjective experiences".

0John_Mlynarski8y

It seems that you anticipate as if you believe in something that you don't believe you believe. It's in that anticipatory, non-declarative sense that one believes in the awareness of tigers as well as video game characters, regardless of one's declarative beliefs, and even if one has no time for declarative beliefs.

1Jiro8y

You first implied that tigers are conscious (because people react to them as if conscious.) I pointed out that people react that way to video game characters. You then said that tigers are conscious in the same way as video game characters, that is, they're not conscious in the ordinary sense, that is, you admitted you were wrong.

0John_Mlynarski8y

I said no such thing. There is a way in which people believe video game characters, tigers, and human beings to be conscious. That doesn't preclude believing in another way that any of them is conscious. Tigers are obviously conscious in the no-nonsense sense. I don't think anything is conscious in the philosobabble sense, i.e. I don't believe in souls, even if they're not called souls; see my reply to arundelo. I'm not sure which sense you consider to be the "ordinary" one; "conscious" isn't exactly an everyday word, in my experience. Video game characters may also be obviously conscious, but there's probably better reason to believe that that which is obvious is not correct, in that case. Tigers are far more similar to human beings than they are to video game characters. But I do think that we shouldn't casually dismiss consciousnesses that we're aware of. We shouldn't assume that everything that we're aware of is real, but we should consider the possibility. Why are you so convinced that video game characters don't have subjective experiences? If it's just that it's easy to understand how they work, then we might be just as "non-conscious" to a sufficiently advanced mind as such simple programs are to us; that seems like a dubious standard.

0Jiro8y

The default for 99.99% of people is to not believe that video game characters are conscious. It's so common a belief that I am justified in assuming it unless you specifically tell me that you don't share it. You haven't told me that.

0John_Mlynarski8y

Firstly, it seems more accurate to say that the standard default belief is that video characters possess awareness. That the vast majority rationalize their default belief as false doesn't change that. Secondly, that's argumentum ad populum, which is evidence -- Common beliefs do seem to be usually true-- but not very strong evidence. I asked why you're as confident in your belief as you are. Are you as convinced of this belief as you are of most beliefs held by 99.99% of people? If you're more (or less) convinced, why is that? Thirdly, you seem to be describing a reason for believing that I share your belief that video game characters aren't sentient, which is different from a reason for thinking that your belief is correct. I was asking why you think you're right, not why you assumed that I agree with you.

1Jiro8y

Having confidence in the belief is irrelevant. Assuming that you agree with it is relevant, because 1) Arguments should be based on premises that the other guy accepts. You probably accept the premise that video game characters aren't conscious. 2) It is easy to filibuster an argument by questioning things that you don't actually disagree with. Because the belief that video game characters aren't conscious is so widespread, this is probably such a filibuster. I wish to avoid those.

0John_Mlynarski8y

Eliezer suggested that, in order to avoid acting unethically, we should refrain from casually dismissing the possibility that other entities are sentient. I responded that I think that's a very good idea and we should actually implement it. Implementing that idea means questioning assumptions that entities aren't sentient. One tool for questioning assumptions is asking "What do you think you know, and why do you think you know it?" Or, in less binary terms, why do you assign things the probabilities that you do? Now do you see the relevance of asking you why you believe what you do as strongly as you do, however strongly that is? I'm not trying to "win the debate", whatever that would entail. Tell you what though, let me offer you a trade: If you answer my question, then I will do my best to answer a question of yours in return. Sound fair?

0Jiro8y

I'm assuming that you assign it a high probability. I personally am assigning it a high probability only for the sake of argument. Since I am doing it for the sake of argument, I don't have, and need not have, any reason for doing so (other than its usefulness in argument).

[-]Portia3y10

Related phenomenon you might find interesting: Tulpas. That is essentially humans trying to intentionally pull off what you are describing here, in their own minds. It is based on the fact that humans predict the behaviour of other humans by modelling their minds, and that the more complex and accurate these models get, the more sentient like they become. E.g. I know my girlfriend so well that seeing her in a situation that I know hurts her feels immediately and genuinely painful to me, as though I were feeling her pain.

It is also based on the human abilit... (read more)

LESSWRONG
LW

LESSWRONG
LW

68

Nonperson Predicates

68

68