x
Towards understanding-based safety evaluations — LessWrong