Weight-diff SVD for LLM Monitoring — LessWrong