x
This website requires javascript to properly function. Consider activating javascript to get access to all site functionality.
LESSWRONG
LW
Login
Mechanistic Interpretability Puzzles
Mechanistic Interpretability Puzzles — LessWrong
67
Mech Interp Puzzle 1: Suspiciously Similar Embeddings in GPT-Neo
Ω
Neel Nanda
2y
Ω
15
41
Mech Interp Puzzle 2: Word2Vec Style Embeddings
Ω
Neel Nanda
2y
Ω
4