Whence unchangeable values?

ihatenumbersinusernames7

6

[ Question ]

Whence unchangeable values?

by ihatenumbersinusernames7

1st Feb 2026

1 min read

A

1 1

6

Some values don't change. Maybe sometimes that's because a system isn't "goal seeking." For example, AlphaZero doesn't change its value of "board-state = win." (Thankfully! Because if that changed to "board-state = not lose," then a reasonable instrumental goal might be to just kill its opponent.)

But I'm a goal seeking system. Shard theory seems to posit terminal values that constantly pop up and vy for position in humans like me. But certain values of mine seem impossible to change. Like, I can't decide to value my own misery/pain/suffering.

So if terminal values aren't static, what about the values that are? Are these even more terminal? Or is it something else?

Frontpage

6

New Answer

New Comment

1 Answers sorted by
top scoring

StanislavKrym

Feb 01, 2026

10

We had many posts trying to answer this question. One of the candidates is the master-slave model of the master setting the slave's shorter-term allegedly terminal values so that the slave did actions satisfying the master's longer-term values, which are approval from a circle of people, health, sex, power and proxies like sweet food.

That being said, approval from a circle of people is itself hard to define. For example, it could change with a change of the circle. Or the role of the circle could be played by media forming the user's opinions on some subjects with no feedback from the user. An additional value could be consistence of the worldview with lived experiences...

Moderation Log

Curated and popular this week

LESSWRONG
LW

LESSWRONG
LW

6

[ Question ]

Whence unchangeable values?

6

6

1 Answers sorted by
top scoring

Feb 01, 2026

6

[ Question ]

Whence unchangeable values?

6

6

1 Answers sorted by top scoring

Feb 01, 2026

1 Answers sorted by
top scoring