vgel

Message

other presences:

https://vgel.me (theia@vgel.me)
https://x.com/voooooogel (thebes)
https://bsky.app/profile/vgel.me
discord: @vgel
signal: @vgel.01

149

2mo

Small Models Can Introspect, Too

Recent work by Anthropic showed that Claude models, primarily Opus 4 and Opus 4.1, are able to introspect--detecting when external concepts have been injected into their activations. But not all of us have Opus at home! By looking at the logits, we show that a 32B open-source model that at...

Dec 21, 2025122

LESSWRONG
LW

LESSWRONG
LW

vgel

vgel

Small Models Can Introspect, Too

vgel

vgel

Small Models Can Introspect, Too