Pacing Outside the Box: RNNs Learn to Plan in Sokoban — LessWrong