Probe accuracy and causal sensitivity diverged 3.6x at the same layer. Here's what I think is happening.
This is still exploratory and I am documenting a specific failure case where the two tools that should agree with each other but instead, they give contradictory readings at the same layer, and I think I have a candidate explanation, but haven't confirmed it yet. These two models differ on...
May 251