LESSWRONG
LW

699
yuwenlu
0010
Message
Dialogue
Subscribe

Posts

Sorted by New

Wikitag Contributions

Comments

Sorted by
Newest
No posts to display.
No wikitag contributions to display.
Case Study: Interpreting, Manipulating, and Controlling CLIP With Sparse Autoencoders
yuwenlu7mo10

Hey! Late to the party but this is *really* cool. 

A quick question: any reason to use CLIP embeddings as the SAE input, instead of directly using the images themselves? I understand that the goal is to understand CLIP inner workings, but curious if you have intuitions on whether directly feeding in images would work as well.

Reply