Decompiling Tracr Transformers - An interpretability experiment
Note: This blog post is cross-posted from my personal website, where I expect a broader audience than here. If you are familiar with the difficulty and significance of neural network interpretability, skip to the third subsection titled "In defence of fighting fire with fire" Summary: This is a post about...
