Do we want alignment faking? — LessWrong