x
Jailbreaks Peak Early, Then Drop: Layer Trajectories in Llama-3.1-70B — LessWrong