x
Is deception linear? Geometry-aware probes didn't beat linear ones, for detection or for control — LessWrong