Soft optimization makes the value target bigger — LessWrong