Identifying "Deception Vectors" In Models — LessWrong