x
Toy transformers may represent belief-state geometry optimally but not minimally — LessWrong