LESSWRONG
LW

455
Matthew Shinkle
34130
Message
Dialogue
Subscribe

Posts

Sorted by New

Wikitag Contributions

Comments

Sorted by
Newest
No wikitag contributions to display.
How To Become A Mechanistic Interpretability Researcher
Matthew Shinkle1mo30

PIBBSS definitely does some mech interp, and I believe AI safety camp has some mech interp projects.

Reply
How To Become A Mechanistic Interpretability Researcher
Matthew Shinkle1mo40

For people looking for MATS-like programs in other locations, with different timelines, etc. this page is a great resource for finding other training programs, a number of which (PIBBSS, Pivotal, LASR Labs, others) include mech interp research: https://www.aisafety.com/map

Reply
Open Thread - Summer 2025
Matthew Shinkle3mo40

Hello! Long-time lurker, planning to post research results on here in the near future. I'm a currently a PIBBSS research fellow, working on LLM interpretability relating to activation plateaus and deception probes. I'll be joining Anna Leshinskaya's Relational Cognition lab in the fall as a postdoc, working on moral reasoning in LLMs. Feel free to reach out if you have any ideas, questions, etc. on any of these topics!

Reply
36Automating AI Safety: What we can do today
2mo
0