Eli Tyre — LessWrong

Which side of the AI safety community are you in?

it's washing something which we don't yet understand and should not pretend to understand.

Washing? Like safetywashing?

I see here is that if there's a delay between when uploads are announced and when they occur, living people retain the option to end their lives.

Seems correct that cryonics patients have a lot less ability to flexibly respond to the situation compared to alive and animate people.^[1]

I don't think that this is a very decisive consideration. I expect that whatever series of events will cause the superintelligence to get the most of what it wants in expectation is the series of events that will play out.

It's astonishingly weird if the superintelligence prefers to upload Bob, and then takes actions that allow Bob to prevent himself from being uploaded. "Announcing" that you're going to upload people is an unforced error, if it causes people to kill themselves. (Though I suppose it might not be an error if most people would prefer to be uploaded, and the AI is using it as a bargaining chip?)

A very savvy person be able to see the writing on the wall and see that a misaligned superintelligence is close to inevitable, and if the balance of fear of personal s-risk vs. personal death comes out in favor of death, commit suicide early. But this will almost definitely be a gamble based on substantial uncertainty. Presumably less uncertainty than a decision to get frozen, or not, at any point before then, but not so much less that you still need to weigh the probabilities of different outcomes and make a bet.

^{^}
Not literally zero flexibility, though. It's normal to leave a will / instructions with the cryonics org about under what circumstances you want to be revived (eg. upload or bodily resurrection, how good does the tech need be before you risk it, etc). It's probably non-standard to leave instructions like "please destroy my brain, if XYZ happens", but may be feasible.

Cryonics companies are not enormously competent (this is bad, to be clear). I wouldn't trust them to execute those instructions unless I had a personal relationship with someone who worked there, I had assessed their competence and trustworthiness as "high", and they personally told me that they would take responsibility for destroying my brain if XYZ.

But there are some options here.

Annabelle's Shortform

Eli Tyre11d70

I'm signed up for Alcor.

I straightforwardly agree that the more likely I am to die of x-risk, the less good of a deal, probabalistically, cryonics is.

(I don't particularly buy the cryonics patients are more likely to be utilized by misaligned superintelligences than normally living humans. Cryopreserving destroys structure that the AI would have to reconstruct, which might be cheap, but isn't likely to be cheaper than just using intact brains, scanned with superintelligently developed technology.

But, yep, to the extent that living through an AI takeover might entail an AI doing stuff with your brain-soul that you don't like, being cryopreserved also exposes you to that risk.)

I’m an EA who benefitted from rationality

Eli Tyre12d20

I think goal factoring may in turn have been inspired by Geoff Anders’ goal mapping

I think this was mostly convergent development, rather than a clear lineage.

Though Geoff did teach a version of Goal Factoring that was much more like CT-charting at some early CFAR workshops.

Mikhail Samin's Shortform

Eli Tyre12d20

This seems like the opposite of a disagreement to me? Am I missing something?

Generalized Coming Out Of The Closet

Eli Tyre12d30

Uh, I think I don't want to leave a list of people, who didn't opt in to being a topic of discussion. But Eliezer has already been mentioned, as an example. We could talk privately about other specific cases.

My guess is that the people who are unusually disembodied that you're thinking of probably suppress a kind of contempt and/or anger at other people who don't have so much will-to-Goodness.

I think...maybe yes, of all the men that I'm thinking of, but no of all the women that I'm thinking of? Modulo, it doesn't seem very suppressed.

The "Length" of "Horizons"

Eli Tyre13d246

Given that humans are our only existing example of decent agents, I think one obvious sanity check for proposed measures of AI agency is whether they are helpful for characterizing variation in human agency.

This seems like an obvious and apt question to ask, but I don't think it's an obvious sanity check, in the sense that "if a measure doesn't pass this check, that's a strong sign that it's not capturing what we care about."

AIs are different than human minds! I think it's not at all surprising if they have different limiting constraints and therefore very different "capability profiles."

Like, for humans, working memory is an important constraint on many of the complicated intellectual operations that we do. And working memory correlates with overall cognitive ability.

When you try to measure human intelligence , and figure out what it is made of, working memory is something like one of the major factors that falls out of the factor analysis.

But if we imagine aliens that have vastly larger working memories (or "context windows") than humans. These aliens might still vary in working memory capacity, but it might be close to irrelevent for predicting their overall cognitive performance, because the bottlenecks on their cognitive ability are something else entirely.

I think that's exactly the situation we're in with the AIs. Their minds are of a quite different shape than ours, and so good proxy metrics for human capability won't generalize to AIs or vis versa.

Overall, great post.

If Anyone Builds It Everyone Dies, a semi-outsider review

Eli Tyre13d20

I claim that there are fairly solid arguments that address your three concerns. Do you feel satisfied by the answers already given, in the comments, here? Or should I reply to them at length?

Alternatively, I'd be up for talking through it, synchronously, over a video call (and posting the recording?) if that seems better for you.

If Anyone Builds It Everyone Dies, a semi-outsider review

Eli Tyre13d104

After encountering a number of posts wondering how outsiders were responding to the book, I thought it might be valuable for me to write mine down.

Thank you!

I loved reading "My loose priors going in" and "To skip ahead to my posteriors". Great, concise, way to capture the impact of the book for you. More reviews should try that format.

Generalized Coming Out Of The Closet

Eli Tyre13d*40

[I'm not sure how to productively engage here, because it seems like it's hard to do more than throw our differing impressions at each other, and we should expect to have wildly differing experiences depending on the details of our own psychologies, which are both putting a lot of selection on what social and psychological observations we make of others, and also how we interpret those observations. It's a bit like discussing shapes in clouds, except with more of an annoying insistence that the shapes we're seeing correspond to something real in the territory and not just our projections.]

People who have a very strong "will-to-Goodness" don't necessarily have very strong/extreme shadows, but often do, because they created the very strong will-to-Goodness by strongly suppressing their antisocial desires, which then strongly polarized those desires.

My impression is that this is mostly wrong, or at least least wrong with regard to what I meant by "will-to-Goodness", though I agree that there are some dynamics like this in play around these topics.

A lot of particularly scrupulous people's moral behavior routes through social or parental acceptance (who behave in some approved pattern of behavior out of an insecurity). I think that often does involve suppressing their shadow, maybe even in the typical case.

I've never known a person with vibrant will-to-Goodness (as I mean it, and as I am able to detect it) who was, to my knowledge, motivated that way.^[1]

Further, those people have various issues (from poor emotional regulation, to social-epistemic confidence-anxiety, to various blindspots), but they definitely don't read to me to be repressing socially-disapproved shadow parts more than most people. Almost the opposite.

However, almost 100% of the cases I'm thinking of are people who are unusually disembodied, which is at least suggestive of repressing something, but not suppression of antisocial desires. That might be a kind of shadow, but it's not a central example of what comes to mind when people talk about shadow work, and it's not the kind of thing that is sated by BDSM.

^{^}
I'm on weaker ground here, but I speculate that the thing that I'm detecting is a morality that is grounded in their own desires for the world, or for how to be, instead of a morality that's motivationally grounded in a social acceptance desire.

LESSWRONG
LW

LESSWRONG
LW

Posts

Wikitag Contributions

Comments