x
1.75 ASR HARMBENCH & 0% HARMFUL RESPONSES FOR MISALIGNMENT. — LessWrong