Exploring how OthelloGPT computes its world model
I completed this project for my bachelor's thesis and am now writing it up 2-3 months later. I think I found some interesting results that are worth sharing here. This post might be especially interesting for people who try to reverse-engineer OthelloGPT in the future. Summary * I suggest the...
Feb 2, 20258