x
Sophistication-Disinhibition Relationship in Language Models [Epistemic status: robust findings, active research, need peer review] — LessWrong