x
Are we aligning the model or just its mask? — LessWrong