x
How to use and interpret activation patching — LessWrong