LESSWRONG
LW

mpopv
4610
Message
Dialogue
Subscribe

Posts

Sorted by New

Wikitag Contributions

Comments

Sorted by
Newest
No wikitag contributions to display.
AGI Safety FAQ / all-dumb-questions-allowed thread
mpopv3y50

Assuming we have control over the utility function, why can't we put some sort of time-bounding directive on it?

i.e. "First and foremost, once [a certain time] has elapsed, you want to run your shut_down() function. Second, if [a certain time] has not yet elapsed, you want to maximize paperclips."

Is that problem that the AGI would want to find ways to hack around the first directive to fulfill the second directive? If so, that would seem to at least narrow the problem space to "find ways of measuring time that cannot be hacked before the time has elapsed".

Reply
No posts to display.