x
SVD on Weight Differences for Model Auditing — LessWrong