x
Persona vectors: monitoring and controlling character traits in language models — LessWrong