When AI critique works even with misaligned models — LessWrong