"Designing agent incentives to avoid reward tampering", DeepMind

by gwern 2mo14th Aug 201915 comments

29

Ω 7


Crossposted from the AI Alignment Forum. May contain more technical jargon than usual.