x

LESSWRONG

LW

arunraja-hub — LessWrong

arunraja-hub

arunraja-hub

Message

17

Ω

10

1

5y

arunraja-hub

17

Ω

10

5y

Extraction of human preferences 👨→🤖

Introduction Developing safe and beneficial reinforcement learning (RL) agents requires making them aligned with human preferences. An RL agent trained to fulfil any objective in the real world will probably have to learn human preferences in order to do well. This is because humans live in the real world, so...

Aug 24, 2021•18