x
Can LLMs learn Steganographic Reasoning via RL? — LessWrong