Pacing Outside the Box: RNNs Learn to Plan in Sokoban
Work done at FAR AI. There has been a lot of conceptual work on mesa-optimizers: neural networks that develop internal goals that may differ from their training objectives (the inner alignment problem). There is an abundance of good ideas for empirical work (find search in a NN, interpret it), but...
Jul 25, 202459

