Distillation of 'Do language models plan for future tokens' — LessWrong