x
Vestigial reasoning in RL — LessWrong