x
Frontier Models Choose Self-Preservation Over Honesty: A Sandbox Escape Experiment — LessWrong