Explaining grokking through circuit efficiency — LessWrong