x

LESSWRONG

LW

JackS — LessWrong

JackS

JackS

Message

107

2y

JackS

107

2y

OthelloGPT learned a bag of heuristics

by jylin04, JackS, Adam Karvonen, and Can

Work performed as a part of Neel Nanda's MATS 6.0 (Summer 2024) training program. TLDR This is an interim report on reverse-engineering Othello-GPT, an 8-layer transformer trained to take sequences of Othello moves and predict legal moves. We find evidence that Othello-GPT learns to compute the board state using many...

Jul 2, 2024•111