x
Semantic Similarity, Not Model Capability, Drives Monitor Evasion in AI Control — LessWrong