x
Load-Bearing Obfuscation and Self-Jailbreaking CoT — LessWrong