x
Selectively reducing eval awareness and murder in Gemma 3 27B via steering — LessWrong