x
Efficiently Detecting Hidden Reasoning with a Small Predictor Model — LessWrong