Matt Goldenberg — LessWrong

It's true our preferences get more stable as we get older but I still think over the course of decades they change. We're typically bad at predicting what we'll want in 10 years even at much older ages.

Alignment as uploading with more steps

Matt Goldenberg1d30

For instance I bet the you of 4 or 5 would want you to spend your money on much more candy and toys than the you of today.

Alignment as uploading with more steps

Matt Goldenberg2d30

I doubt this, it's very hard to achieve giving developmental issues with stuff like shifting hormones

Alignment as uploading with more steps

Matt Goldenberg2d50

but we usually endorse the way that our values change over time, so this isn’t necessarily a bad thing.

I'm pretty skeptical of this, of course it seems that way because we are the ones with the new values, but I think this is like 70% just a tautology of valuing valuing the things we currently value, and 20% a psychological thing that justifies our decisions in retrospect and make them seem more consistent than they are, and only 10% any sort of actual consistency effect where if I asked myself at time x if it endorses the value changes I've made at future time y, past me would say "yes, y is better than x".

Also, I find it hard to imagine hating my past self so much that I would want to kill him or allow him to be killed.

I could easily imagine a future version of myself after e.g. hundreds of years of value drift that I would see as horrifying and no longer consider them me.

Alignment as uploading with more steps

Matt Goldenberg3d52

I'm pretty sure that the me from 10 years ago is aligned to different values than the me of today, so I suspect a copy running much faster than me would quickly diverge.

And that's just a normal speed running version of me one that experienced the world much faster would have such a different experience of the world, as a small example conversations would be more boring but also I'd be more skilled at them, so things would diverge much faster.

High-level actions don’t screen off intent

Matt Goldenberg3d20

Only if your ethics are purely utilitarian.

[Anthropic] A hacker used Claude Code to automate ransomware

Matt Goldenberg20d914

it seems like the operations were ongoing, and they disrupted them.to me it appears a normal and legitimate use of the word.

[Anthropic] A hacker used Claude Code to automate ransomware

Matt Goldenberg20d82

i doubt this very much, one of the most consistent trends we see is that once a capability is available in any model, the cost of inference and training an open source model goes down over time.

A speculation on enlightenment

Matt Goldenberg21d22

In other words, classically, enlightenment would be much more in the direction of removing the causes and conditions of consciousness - see E. G. dependent origination

A speculation on enlightenment

Matt Goldenberg21d42

Fwiw I think this is close to reversing the udnerstanding of enlightenment

LESSWRONG
LW

LESSWRONG
LW

Sequences

Posts

Wikitag Contributions

Comments