How Do Induction Heads Actually Work in Transformers With Finite Capacity? — LessWrong