x
An Introduction to AI Sandbagging — LessWrong