x
Latent Reasoning Sprint #3: Activation Difference Steering and Logit Lens — LessWrong