Collin Burns on Alignment Research And Discovering Latent Knowledge Without Supervision — LessWrong