LESSWRONG
LW

1414
Matthew Shinkle
53240
Message
Dialogue
Subscribe

Posts

Sorted by New

Wikitag Contributions

Comments

Sorted by
Newest
No wikitag contributions to display.
Hospitalization: A Review
Matthew Shinkle6d30

Very glad to hear you're okay!

Point Out Anything Suspicious

I've found this highly valuable. Doctors/nurses almost indubitably know more than I do, but also indubitably have way less time to obsessively stare at at scans and charts for hours on end.

And congrats on the wedding!

Reply
How To Become A Mechanistic Interpretability Researcher
Matthew Shinkle2mo30

PIBBSS definitely does some mech interp, and I believe AI safety camp has some mech interp projects.

Reply
How To Become A Mechanistic Interpretability Researcher
Matthew Shinkle2mo40

For people looking for MATS-like programs in other locations, with different timelines, etc. this page is a great resource for finding other training programs, a number of which (PIBBSS, Pivotal, LASR Labs, others) include mech interp research: https://www.aisafety.com/map

Reply
Open Thread - Summer 2025
Matthew Shinkle3mo40

Hello! Long-time lurker, planning to post research results on here in the near future. I'm a currently a PIBBSS research fellow, working on LLM interpretability relating to activation plateaus and deception probes. I'll be joining Anna Leshinskaya's Relational Cognition lab in the fall as a postdoc, working on moral reasoning in LLMs. Feel free to reach out if you have any ideas, questions, etc. on any of these topics!

Reply
25Activation Plateaus: Where and How They Emerge
3d
0
36Automating AI Safety: What we can do today
3mo
0