x
Towards Shutdownable Agents: Generalizing Stochastic Choice in RL Agents and LLMs — LessWrong