The Problem with Reasoners by Aidan McLaughin
Some critique on reasoning models like o1 (by OpenAI) and r1 (by Deepseek). > OpenAI admits that they trained o1 on domains with easy verification but hope reasoners generalize to all domains. Whether or not they generalize beyond their RL training is a trillion-dollar question. Right off the bat, I’ll...
Nov 25, 202412