On the lethality of biased human reward ratings — LessWrong