Understanding the Information Flow inside Large Language Models — LessWrong