x

LESSWRONG
LW

taufeeque — LessWrong

taufeeque

taufeeque

Message

48

3y

taufeeque hasn't written anything yet.

taufeeque

48

3y

taufeeque has not written any posts yet.

Pacing Outside the Box: RNNs Learn to Plan in Sokoban

Work done at FAR AI. There has been a lot of conceptual work on mesa-optimizers: neural networks that develop internal goals that may differ from their training objectives (the inner alignment problem). There is an abundance of good ideas for empirical work (find search in a NN, interpret it), but...

Jul 25, 2024•59