Killing Recurrent Memory Over Self Attention? — LessWrong