"Designing agent incentives to avoid reward tampering", DeepMind
14th Aug 2019
Crossposted from the
AI Alignment Forum
. May contain more technical jargon than usual.
This is a linkpost for