x
Teaching Models to Dream of Better Monitors through Evaluation Conditioned Training — LessWrong