A Solution to Sandbagging and other Self-Provable Misalignment: Constitutional AI Detectives — LessWrong