A gentle introduction to mechanistic anomaly detection — LessWrong