AI Safety Thursday: The Limitations of Reinforcement Learning for LLMs in Achieving AI for Science — LessWrong