Grokking revisited: reverse engineering grokking modulo addition in LSTM — LessWrong