x
Defeating Introspection Adapters (and Why Threat Models Matter) — LessWrong