x
Detecting Strategic Deception Using Linear Probes — LessWrong