Only a hack can solve the shutdown problem
(My first post on LessWrong. It seems the most recent Welcome Thread is from 2020, so I'm making a top-level post. This an edited version of my submission to the AI Alignment Awards.) Abstract: First, we offer a formalisation of the shutdown problem from [1], and we show that solutions...
Jul 15, 20235