x
Toy Models of Superposition: what about BitNets? — LessWrong