LESSWRONG
LW

AI PsychologyAnthropic (org)Category theoryInterpretability (ML & AI)Language Models (LLMs)Logic & Mathematics AI
Frontpage

11

Category-Theoretic Wanderings into Interpretability

by unruly abstractions
2nd Sep 2025
1 min read
2

11

AI PsychologyAnthropic (org)Category theoryInterpretability (ML & AI)Language Models (LLMs)Logic & Mathematics AI
Frontpage

11

Category-Theoretic Wanderings into Interpretability
3Trevor Hill-Hand
1unruly abstractions
New Comment
2 comments, sorted by
top scoring
Click to highlight new comments since: Today at 7:35 PM
[-]Trevor Hill-Hand3h32

I enjoyed reading the paper but did not find the screenshots here in the post a helpful addition; I think I would have just quoted the introduction, if converting it into a full article was infeasible.

It's also fun seeing other Eugenia Chang fans!

Reply
[-]unruly abstractions1h10

Good feedback!

I am still trying to figure out my workflow. I like writing on Typst, but I realized it's not very easy to go from Typst -> Less Wrong. Also, a lot of my writing is sorta experimental. I'm trying to determine which parts of my writing should be directed to which platforms/audiences.

And yes, Eugenia Chang is amazing :)

Reply
Moderation Log
More from unruly abstractions
View more
Curated and popular this week
2Comments
 					 CLICK HERE TO READ
				 FULL PAPER AT UNRULYABSTRACTIONS.COM