What Can Wittgenstein Teach Us About LLM Safety Research?
The biggest event in 2025 for me is entering the field of AI safety research. In particular, I work with collaborators at Geodesic Research on designing a suite of health metrics to evaluate the "pathologies" in the chain-of-thought reasoning of Large Language Models, such as post-hoc, internalized, and encoded reasoning....
Dec 23, 20258