SAE on activation differences — LessWrong