x

Kriz Tahimic

Subscribe

Message

CS Undergraduate at De La Salle University | Interested in AI and Mechanistic Interpretability

3

2

4y

Kriz Tahimic

Subscribe

Message

CS Undergraduate at De La Salle University | Interested in AI and Mechanistic Interpretability

3

2

4y

Attribution-based parameter decomposition

Kriz Tahimic1y41

I think renaming "circuit" as "mechanism" is the right call. Prior to reading this post, when they talked about circuits, I thought they only meant cross-layer features.

Reply

Sparsify: A mechanistic interpretability research agenda

Kriz Tahimic1y10

This is a paradigm shift for me. When I entered the field, I thought the uniqueness of MechInterp was that it treated AI like natural science (e.g., biology); hence, I assumed the field primarily does hypothesis testing. However, I agree that augmenting it with a big data-driven approach is the way to move forward.

Reply