x
1 Layer Induction Heads and Some Research — LessWrong