Value Learning
This is a sequence investigating the feasibility of one approach to AI alignment: value learning.
This is a sequence investigating the feasibility of one approach to AI alignment: value learning.
Ambitious value learning aims to give the AI the correct utility function to avoid catastrophe. Given its difficulty, we revisit the arguments for utility functions in the first place.