x
Inference-time Generative Debates on Coding and Reasoning Tasks for Scalable Oversight — LessWrong