x
Reinforcement Learning, Agency and Taste — LessWrong