Open Source Replication of Anthropic’s Crosscoder paper for model-diffing — LessWrong