x
Stress-Testing Alignment Audits With Prompt-Level Strategic Deception — LessWrong