LESSWRONG
LW

3224
Wikitags
Main
LW Wiki

Moral uncertainty

Edited by Eliezer Yudkowsky, et al. last updated 19th Feb 2025

"Moral uncertainty" in the context of AI refers to an agent with an "uncertain utility function". That is, we can view the agent as pursuing a utility function that takes on different values in different subsets of possible worlds.

For example, an agent might have a meta-utility function saying that eating cake has a utility of €8 in worlds where Lee Harvey Oswald shot John F. Kennedy and that eating cake has a utility of €10 in worlds where it was the other way around. This agent will be motivated to inquire into political history to find out which utility function is probably the 'correct' one (relative to this meta-utility function), though it will never be absolutely sure.

Moral uncertainty must be resolvable by some conceivable observation in order to function as uncertainty. Suppose for example that an agent's probability distribution ΔU over the 'true' utility function U asserts a dependency on a fair quantum coin that was flipped inside a sealed box then destroyed by explosives: the utility function is U1 over outcomes in the worlds where the coin came up heads, and if the coin came up tails the utility function is U2. If the agent thinks it has no way of ever figuring out what happened inside the box, it will thereafter behave as if it had a single, constant, certain utility function equal to 0.5⋅U1+0.5⋅U2.

Parents:
Preference framework
Children:
Ideal target
Subscribe
Discussion
Subscribe
Discussion
Posts tagged Moral uncertainty
47Normativity
Ω
abramdemski
5y
Ω
11
36Polymath-style attack on the Parliamentary Model for moral uncertainty
danieldewey
11y
74
1902018 AI Alignment Literature Review and Charity Comparison
Ω
Larks
7y
Ω
26
1302019 AI Alignment Literature Review and Charity Comparison
Ω
Larks
6y
Ω
18
93Six Plausible Meta-Ethical Alternatives
Wei Dai
11y
41
93Preliminary thoughts on moral weight
lukeprog
7y
49
92Ontological Crisis in Humans
Wei Dai
13y
69
60Ideas for benchmarking LLM creativity
gwern
9mo
11
58AI Alignment Podcast: An Overview of Technical AI Alignment in 2018 and 2019 with Buck Shlegeris and Rohin Shah
Ω
Palus Astra
5y
Ω
27
57Three kinds of moral uncertainty
Kaj_Sotala
13y
15
51Arguments for moral indefinability
Richard_Ngo
7y
10
37Altruism Under Extreme Uncertainty
lsusr
4y
9
27AXRP Episode 3 - Negotiable Reinforcement Learning with Andrew Critch
Ω
DanielFilan
5y
Ω
0
27Fundamental Uncertainty: Chapter 3 - Why don't we agree on what's right?
Gordon Seidoh Worley
3y
22
26Moral uncertainty vs related concepts
MichaelA
6y
13
Load More (15/70)
Add Posts