x
How's it going? Reinforcement learning in language models recruits a functional welfare axis — LessWrong