Takes on "Alignment Faking in Large Language Models" — LessWrong