Wireheading and misalignment by composition on NetHack — LessWrong