LESSWRONGTags
LW

The Pointers Problem

EditHistorySubscribe
Discussion (0)
Help improve this page (1 flag)
EditHistorySubscribe
Discussion (0)
Help improve this page (1 flag)
The Pointers Problem
Random Tag
Contributors
11point7point4
1Noosphere89

The pointers problem refers to the fact that most humans would rather have an AI that acts based on real-world human values, not just human estimates of their own values – and that the two will be different in many situations, since humans are not all-seeing or all-knowing. It was introduced in a post with the same name.

Posts tagged The Pointers Problem
Most Relevant
8
111The Pointers Problem: Human Values Are A Function Of Humans' Latent VariablesΩ
johnswentworth
2y
Ω
44
3
62Don't design agents which exploit adversarial inputsΩ
TurnTrout, Garrett Baker
4mo
Ω
62
2
111Robust DelegationΩ
abramdemski, Scott Garrabrant
4y
Ω
10
2
57Alignment allows "nonrobust" decision-influences and doesn't require robust gradingΩ
TurnTrout
4mo
Ω
41
2
43Don't align agents to evaluations of plansΩ
TurnTrout
4mo
Ω
48
2
38[Intro to brain-like-AGI safety] 9. Takeaways from neuro 2/2: On AGI motivationΩ
Steven Byrnes
1y
Ω
6
2
32People care about each other even though they have imperfect motivational pointers?Ω
TurnTrout
4mo
Ω
25
2
19Stable Pointers to Value III: Recursive QuantilizationΩ
abramdemski
5y
Ω
4
2
18Stable Pointers to Value II: Environmental GoalsΩ
abramdemski
5y
Ω
2
2
15Stable Pointers to Value: An Agent Embedded in Its Own Utility FunctionΩ
abramdemski
6y
Ω
9
1
51The Pointers Problem: Clarifications/VariationsΩ
abramdemski
2y
Ω
14
1
37Human sexuality as an interesting case study of alignment
beren
3mo
26
1
36Updating Utility FunctionsΩ
JustinShovelain, Joar Skalse
10mo
Ω
6
1
9The Pointers Problem - Distilled
NinaR
10mo
0
Add Posts