Eugene D — LessWrong

Superintelligent AI is necessary for an amazing future, but far from sufficient

Is super-intelligent AI necessarily AGI (for this amazing future), or can it be ANI ?

i.e. why insist on all of the work-arounds we force with pursuing AGI, when, with ANI, don't we already have Safety, Alignment, Corrigibility, Reliability, and super-human ability, today?

Eugene

Where I agree and disagree with Eliezer

Eugene D4y21

OK thanks, I guess I missed him differentiating between 'solve alignment first, then trust', versus 'trusting first, given enough intelligence'. Although I think one issue w/having a proof is that we (or a million monkeys, to paraphrase him) still won't understand the decisions of the AGI...? ie we'll be asked to trust the prior proof instead of understanding the logic behind each future decision/step which the AGI takes? That also bothers me, because, what are the tokens which comprise a "step"? Does it stop 1,000 times to check with us ... (read more)

Where I agree and disagree with Eliezer

Eugene D4y10

Thank you--btw before I try responding to other points, here's the Ben G vid to which I'm referring. Starting around 52m, for a few minutes, for that particular part anyway:

Where I agree and disagree with Eliezer

Eugene D4y10

I've heard a few times that AI experts both 1) admit we don't know much about what goes on inside, even as it stands today, and 2) we expect to extend more trust to the AI even as capabilities increase (most recently Ben Goertzel).

I'm curious to know if you expect explainability to increase in correlation with capability? i.e. or can we use Ben's analogy that 'I expect my dog to trust me, both bc I'm that much smarter, and I have a track-record of providing food/water for him' ?

thanks!

Eugene

Where I agree and disagree with Eliezer

Eugene D4y20

I wonder when Alignment and Capability will finally be considered synonymous, so that the efforts merge into one -- bc that's where any potential AI-safety lives, I would surmise.

Where I agree and disagree with Eliezer

Eugene D4y60

I for one really appreciate the 'dumb-question' area :)

AGI Safety FAQ / all-dumb-questions-allowed thread

Eugene D4y60

When AI experts call upon others to ponder, as EY just did, "[an AGI] meant to carry out some single task" (emphasis mine), how do they categorize all the other important considerations besides this single task?

Or, asked another way, where do priorities come into play, relative to the "single" goal? e.g. a human goes to get milk from the fridge in the other room, and there are plentiful considerations to weigh in parallel to accomplishing this one goal -- some of which should immediately derail the task due to priority (I notice the power is o... (read more)

AGI Safety FAQ / all-dumb-questions-allowed thread

Eugene D4y10

Does this remind you of what I'm trying to get at? bc it sure does, to me:

https://twitter.com/ESYudkowsky/status/1537842203543801856?s=20&t=5THtjV5sUU1a7Ge1-venUw

but I'm prob going to stay in the "dumb questions" area and not comment :)

ie. "the feeling I have when someone tries to teach me that human-safety is orthogonal to AI-Capability -- in a real implementation, they'd be correlated in some way"

AGI Safety FAQ / all-dumb-questions-allowed thread

Eugene D4y32

That makes sense. My intention was not to argue from the position of it becoming a psychopath though (my apologies if it came out that way)...but instead from a perspective of an entity which starts-out as supposedly Aligned (centered-on human safety, let's say), but then, bc it's orders of magnitude smarter than we are (by definition), it quickly develops a different perspective. But you're saying it will remain 'aligned' in some vitally-important way, even when it discovers ways the code could've been written differently?

AGI Safety FAQ / all-dumb-questions-allowed thread

Eugene D4y10

thank you. Make some sense...but does "rewriting its own code" (the very code we thought would perhaps permanently influence it before it got-going) nullify our efforts at hardcoding our intentions?

AGI Safety FAQ / all-dumb-questions-allowed thread

Eugene D4y60

Why do we suppose it is even logical that control / alignment of a superior entity would be possible?

(I'm told that "we're not trying to outsmart AGI, bc, yes, by definition that would be impossible", and I understand that we are the ones who "create it" (so I'm told, therefore, we have the upper-hand bc of this--somehow in building it that provides the key benefit we need for corrigibility...

What am I missing, in viewing a superior entity as something you can't simply "use" ? Does it depend on the fact that the AGI is not meant to have ... (read more)

AGI Safety FAQ / all-dumb-questions-allowed thread

Eugene D4y10

YES.

You are a gentleman and a scholar for taking the time on this. I wish I could've explained it more clearly from the outset.

AGI Safety FAQ / all-dumb-questions-allowed thread

Eugene D4y10

https://twitter.com/KerryLVaughan/status/1536365808594608129?s=20&t=yTDds2nbg4F4J3wqXbsbCA

AGI Safety FAQ / all-dumb-questions-allowed thread

Eugene D4y20

Yes you hit the nail on the head understanding my point, thank you. I also think this is what Yann is saying, to go out on a limb: He's doing AI-safety simultaneously, he considers alignment AS safety.

I guess, maybe, I can see how the 2nd take could be true..but I also can't think of a practical example, which is my sticking point. Of course, a bomb which can blow-up the moon is partly "capable", and there is partial-progress to report --but only if we judge it based on limited factors, and exclude certain essential ones (e.g. navi... (read more)

AGI Safety FAQ / all-dumb-questions-allowed thread

Eugene D4y10

I'm sorry, I think i misspoke--I agree with all that you said about it being different. But when I've attempted to question the Orthogonality of safety with AI-safety experts, it seems as if I was told that safety is independent of capability. First, I think this is a reason why AI-Safety has been relegated to 2nd-class status...and second, I can't see why it is not, like I think Yann puts it, central to any objective (i.e. an attribute of competency/intelligence) we give to AGI (presuming we are talking about real-world goals and not jus... (read more)

AGI Safety FAQ / all-dumb-questions-allowed thread

Eugene D4y10

when i asked about singling-out safety, i agree about it being considered different, however, what i meant: why wouldn't safety be considered as 'just another attribute' by which we can judge the success/intelligence of the AI ? that's what Yann seems to be implying? how could it be considered orthogonal to the real issue--we judge the AI by its actions in the real world, the primary concern is its effect on humanity, and we consider those actions on a scale of intelligence, and every goal (I would presume) has some semblance of embedded safety consideration...

AGI Safety FAQ / all-dumb-questions-allowed thread

Eugene D4y30

Strictly speaking about superhuman AGI: I believe you summarize the relative difficulty / impossibility of this task :) I can't say I agree that the goal is void of human-values though (I'm talking about safety in particular--not sure if that's make a difference?) --seems impractical right from the start?

I also think these considerations seem manageable though, when considering the narrow AI that we are producing as of today. But where's the appetite to continue on the ANI road? I can't really believe we wouldn't want more of the same, in different fields of endeavor...

AGI Safety FAQ / all-dumb-questions-allowed thread

Eugene D4y10

great stuff.

I'm saying that no one is asking that safety be smuggled-in, or obtained "for free", or by default --I'm curious why it would be singled-out for the Thesis, when it's always a part of any goal, like any other attribute of the goal in question? if it fails to be safe then it fails competency to perform properly...whether it swerved into another lane on the highway, or it didn't brake fast enough and hit someone, both not smart things.

"the smarter the AI is, the safer it becomes" -- eureka, but this seems un-orthogonal, dang-near corr... (read more)

AGI Safety FAQ / all-dumb-questions-allowed thread

Eugene D4y30

OK again I'm a beginner here so pls correct me, I'd be grateful:

I would offer that any set of goals given to this AGI would include the safety-concerns of humans. (Is this controversial?) Not theoretical intelligence for a thesis, but AGI acting in the world with the ability to affect us. Because of the nature of our goals, it doesn't even seem logical to say that the AGI has gained more intelligence without also gaining an equal amount of safety-consciousness.

e.g. it's either getting better at safely navigating the highway, or it'... (read more)

AGI Safety FAQ / all-dumb-questions-allowed thread

Eugene D4y60

Why does EY bring up "orthogonality" so early, and strongly ("in denial", "and why they're true") ? Why does it seem so important that it be accepted? thanks!