Simple experiments with deceptive alignment — LessWrong