Applying Network Motif Analysis to Transformer Attribution Graphs
TL;DR: Analyzing attribution graphs manually is tedious and doesn't scale. To help automate some of this process, I ported network motif analysis (the technique Uri Alon used to decode gene regulatory networks) into a tool for automatically fingerprinting the structural properties of transformer circuits. I ran it on 99 attribution...
Feb 71