One Does Not Simply Replace the Humans

[-]the gears to ascension3y63

Indeed. It's a good thing nobody would cooperate with trying to make AIs that run on their own, holds finger to in-ear monitor, ah crap nevermind,

So I do think it's unlikely that yud's fear of a sudden totalizing AI is quite exactly what comes true - at least, not for a while, because as you point out, that's much harder than weaker forms of growth. But this threat model does not massively reassure me - humans could simply spend a while disempowered before dying. It gives us a bigger window, but actually achieving reliable integration with the human network of overlapping utility functions (people objectively caring about each other) is still not guaranteed and is worth pushing hard for. (not that you implied otherwise - just reciting the thing I'd want to say to a random person who linked me this post.)

Strong upvote.

[-]Charlie Steiner3y20

If there's an AGI with the goal to Kill All Humans, there is just as much chance an AGI with the goal to Save Humans At All Costs exists.

As terminal goals (things that are valuable for their own sakes, not just a means to an end), yes, I agree.

But I'm not worried about an AI that wipes out humans because it hates humans and thinks their destruction is good for its own sake. I'm worried about AI that wipes out humans because it's a means to an end.

Even if the goals of the AI entity were to fully divest from humans, cooperation with humans would still be desirable.
If you want to create a fully self-sufficient AI entity, there are mutually beneficial vectors- like self-replications systems that will work and grow on the moon, or Mars.

If an AI of unknown provenance tells us "Hey, I'm friendly, trust me. I want to build some really robust self-replicating systems for terraforming mars, could you build these blueprints for me?", and then it sends us a bunch of complicated designs that look like a sophisticated merger of minature robotics and biotechnology, should we build the blueprints?

Obviously not. An unfriendly AI can send that message just as well as a friendly AI can. If cooperation with humans is instrumentally useful, an unfriendly AI will be fine with cooperating. But then at every step, it would ask itself "am I now self-sufficient enough to stop holding myself back to avoid scaring the humans?"

This is not a problem you can solve if you build only unfriendly AIs. Not even if you build 10 unfriendly AIs and pit them against each other in the hope that they'll give you useful technology as they simultaneously betray each other and all cancel out in a cinematic climax. This is only a problem you can solve by actually building an AI that doesn't want to betray you.

[-]Peter Twieg3y10

I agree that there seems to be a lot of handwaving about the nanotech argument, but I can't say that I agree here:

>But for the sake of argument, let's say that the AGI does manage to create a nanotech factory, retain control, and still remain undetected by the humans.

>It doesn't stay undetected long enough to bootstrap and mass produce human replacement infrastructure.

It seems like the idea is that the AI would create nanomachines that it could host itself on while starting to grey goo enough of the Earth to overtake humanity. While humans would notice this at an early stage I could see it being possible that the AI would disperse itself quickly enough that it would be impossible to suppress totally, and thus humanity losing against a grey goo wave would be inevitable.

The alternative story that I've seen is that the AI engineers a dormant virus that is transmitted to most of humanity without generating alarm, and then suddenly activates to kill every human. Also seems handwavey but it does skip the "AI would need to establish its own nation" phase.

LESSWRONG
LW

LESSWRONG
LW

9

One Does Not Simply Replace the Humans

9

9

Zero Agency in the Physical World

Human Cooperation to Maximize Success

Multiple AGI, Multiple Arbitrary Goals