x
Improved formalism for corruption in DIRL — LessWrong