x
Simple experiments with deceptive alignment — LessWrong