x
Teaching Models to Dream of Better Monitors through Evaluator Conditioned Training — LessWrong