x
A non-adversarial prose text produces strong late-layer divergence in Gemma-3. I measured it; I'm not sure what it means. — LessWrong