3

7th May 2023

1 min read

A

1 3

3

AI

Frontpage

3

New Answer

New Comment

1 Answers sorted by
top scoring

JBlack

May 08, 2023

31

It has been explored (multiple times even on this site), and doesn't avoid doom. It does close off some specific paths that might otherwise lead to doom, but not all or even most of them.

Some remaining problems:

AI may be perfectly well capable of killing everyone without self-improvement;
An AI may be capable of some large self-improvement step, but not aware of this theorem;
Self-improving AI's might not care about whether the result is aligned with their former self, and indeed may not even have any goals at all before self-improvement;
AIs may create smarter AIs without improving their own capabilities, knowing that the result won't be fully aligned but expecting that they can nevertheless keep the result under control (and they were wrong);
In a population with many AIs, those that don't self-improve may be out-competed by those that do - leading to selection for AIs that self-improve regardless of consequences;
It is extremely unlikely that a mere change of computing substrate would meet the conditions of such a theorem, so an AI can almost certainly upgrade its hardware (possibly by many orders of magnitude) to run faster without modifying its mind in any fundamental way.

At this point my 5-minute timer on "think up ways things can still go wrong" ran out, and I just threw out the dumbest ideas and listed the rest. I'm sure with more thought other objections could be found.

[-]Kinrany3y10

Thanks!

It has been explored (multiple times even on this site), and doesn't avoid doom. It does close off some specific paths that might otherwise lead to doom, but not all or even most of them.

Do you have any specific posts in mind?

To be clear, I'm not suggesting that because of this possibility we can just hope that this is how it plays out and we will get lucky.

If we could find a hard limit like this, it seems like it would make the problem more tractable, however. It doesn't have to exist simply because we want it to exist. Searching for it still s... (read more)

Reply

1Kinrany3y

The problem of creating a strong AI and surviving, that is. We'd still get Hanson's billions of self-directed EMs.

Moderation Log

LESSWRONG
LW

LESSWRONG
LW

3

[ Question ]

If alignment problem was unsolvable, would that avoid doom?

3

3

1 Answers sorted by
top scoring

May 08, 2023

3

[ Question ]

If alignment problem was unsolvable, would that avoid doom?

3

3

1 Answers sorted by top scoring

May 08, 2023

1 Answers sorted by
top scoring