"Designing agent incentives to avoid reward tampering", DeepMind

bygwern 6d14th Aug 201915 comments

23

Ω 6


Crossposted from the AI Alignment Forum. May contain more technical jargon than usual.