Useful starting code for interpretability — LessWrong