x
Evaluating honesty and lie detection techniques on a diverse suite of dishonest models — LessWrong