LESSWRONG
LW

Ambitious value learning aims to give the AI the correct utility function to avoid catastrophe. Given its difficulty, we revisit the arguments for utility functions in the first place.

55Intuitions about goal-directed behavior

Ω

Rohin Shah

7y

Ω

15

139Coherence arguments do not entail goal-directed behavior

Ω

Rohin Shah

7y

Ω

69

61Will humans build goal-directed agents?

Ω

Rohin Shah

7y

Ω

43

68AI safety without goal-directed behavior

Ω

Rohin Shah

7y

Ω

15

Narrow Value Learning

23What is narrow value learning?

Ω

Rohin Shah

7y

Ω

3

31Ambitious vs. narrow value learning

Ω

paulfchristiano

7y

Ω

16

34Human-AI Interaction

7y

10

7y

3

19The human side of interaction

Ω

Rohin Shah

7y

Ω

5

40Following human norms

Ω

Rohin Shah

7y

Ω

10

12Future directions for narrow value learning

Ω

Rohin Shah

7y

Ω

4

52Conclusion to the sequence on value learning

Ω

Rohin Shah

6y

Ω

20