Gated Attention Blocks: Preliminary Progress toward Removing Attention Head Superposition — LessWrong