x
Tree Transformers: A step towards generalizing the transformer architecture — LessWrong