x
Jailbreaking language models with user roleplay — LessWrong