x
Reasoning Long Jump: Why we shouldn’t rely on CoT monitoring for interpretability — LessWrong