SAEs (usually) Transfer Between Base and Chat Models — LessWrong