Inside the mind of a superhuman Go model: How does Leela Zero read ladders?
Some activations inside Leela Zero for randomly selected boards. tl;dr—We did some interpretability on Leela Zero, a superhuman Go model. With a technique similar to the logit lens, we found that the residual structure of Leela Zero induces a preferred basis throughout network, giving rise to persistent, interpretable channels. By...
Mar 1, 2023159