An Interpretability Illusion for Activation Patching of Arbitrary Subspaces — LessWrong