x
Introspection Adapters: Training LLMs to Report Their Learned Behaviors — LessWrong