x
Power Steering: Behavior Steering via Layer-to-Layer Jacobian Singular Vectors — LessWrong