x

LESSWRONG

LW

Matthew Shinkle — LessWrong

Matthew Shinkle

Matthew Shinkle

Message

67

2

5

1y

Matthew Shinkle

67

1y

Activation Plateaus: Where and How They Emerge

By design, LLMs perform nonlinear mappings from their inputs (text sequences) to their outputs (next-token generations). Some of these nonlinearities are built-in to the model architecture, but others are learned by the model, and may be important parts of how the model represents and transforms information. By studying different aspects...

Oct 17, 2025•37

Automating AI Safety: What we can do today

There have been multiple recent calls for the automation of AI safety and alignment research. There are likely many people who would like to contribute to this space, but would benefit from clear directions for how to do so. Stemming from a recent SPAR project and in light of limitations...

Jul 25, 2025•37