x
Alignment pretraining could backfire — LessWrong