Jack Morris has posted this thread https://x.com/jxmnop/status/1925224612872233081 about his paper "Harnessing the Universal Geometry of Embeddings"
Have others thought through what this means for the notion of fundamentally alien internal ontologies? Would love any ideas! Sorry if missed a post on it.
Thanks for this reference. It arguably means aliens don't have alien ontologies. Previous related discussion.
Let me know if anyone has thoughts on this question I just posted as well: Does the Universal Geometry of Embeddings paper have big implications for interpretability?
My default assumption on all empirical ML papers is that the authors Are Not Measuring What They Think They Are Measuring.
It is evidence for the natural abstraction hypothesis in the technical sense that P[NAH|paper] is greater than P[NAH], but in practice that's just not a very good way to think about "X is evidence for Y", at least when updating on published results. The right way to think about this is "it's probably irrelevant".
Thank you John! Is there an high-bit or confounder controlling evidence that would move your prior? Say something like english + some other language? (Also I might be missing something deeper about the heuristic in general, if so I apologize!)
I found this paper by Amir Zur and others really interesting: It's Owl in the Numbers:
Token Entanglement in Subliminal Learning where they try to explain subliminal learning (the notion that "language model fine-tuned on seemingly meaningless data from a teacher model acquires the teacher's hidden behaviors.")
The researchers found that certain concepts like "owl" and "087" can become entangled during training (the probability of one increases the probability of the other.)
Fascinating and would be curious to hear what others think!
You may be interested in this discussion then, and also the article you mention is posted on LW too.
What is the total number of dinosaur species that existed?
What statistical approach would you use to make your estimate?
Current estimates for animals on earth seem to be 1M+ known species?