x

LESSWRONG

LW

Matthew Shinkle — LessWrong

Matthew Shinkle

Matthew Shinkle

Message

67

2

5

1y

Matthew Shinkle

67

1y

Activation Plateaus: Where and How They Emerge

By design, LLMs perform nonlinear mappings from their inputs (text sequences) to their outputs (next-token generations). Nonlinearities are built into the model architecture, but the way models use these nonlinearities in their representation of information is still an open question. By studying different aspects of this nonlinear mapping, we can...

Oct 17, 2025•37

Automating AI Safety: What we can do today

There have been multiple recent calls for the automation of AI safety and alignment research. There are likely many people who would like to contribute to this space, but would benefit from clear directions for how to do so. Stemming from a recent SPAR project and in light of limitations...

Jul 25, 2025•37