Extending the Off-Switch Game: Toward a Robust Framework for AI Corrigibility — LessWrong