x
Manipulating Self-Preference In LLMs — LessWrong