x
LLMs are Capable of Misaligned Behavior Under Explicit Prohibition and Surveillance — LessWrong