x
2B scoring model flags out-of-domain misalignment, suggesting specialist judges have potential for audits — LessWrong