sbaumohl
Message
I'm a senior at the University of Wisconsin-Madison, a longtime member of the Wisconsin AI Safety Initiative, a software engineer, and an aspiring AI safety researcher.
Go Badgers!
46
3
4
I am often frustrated by those who promote vibes and deliver aimless soliloquies. We would often be better served by speaking specifically, more concisely, and boldly. From the average meeting room to the American political landscape, we are harming ourselves by speaking vaguely, and current roadblocks in policymaking across many...
TLDR: Anthropic's recent paper "When Models Manipulate Manifolds" proves that there are meaningful insights in the geometries of LLM activation space. This matches my intuition on the direction of interpretability research, and I think AI safety researchers should build tools to uncover more geometric insights. This paper is a big...