x
Probing CODI's Latent Reasoning Chain with Logit Lens and Tuned Lens — LessWrong