x
Alignment does not require adversarially robust grading — LessWrong