x

LESSWRONG

LW

eliotpbrenner — LessWrong

eliotpbrenner

eliotpbrenner

Message

1

1y

eliotpbrenner

1y

Towards Multimodal Interpretability: Learning Sparse Interpretable Features in Vision Transformers

eliotpbrenner1y10

Great work! I'm trying to reproduce the results. Using the parameters mentioned in the blog post and associated repo I have started.a training run (but with batch size even larger, 256 instead of 128). Based on progress so far it looks like it will take about a week to train over all the batches. Is this what your found @hugofry ? If that time sounds off to you I'll post my full parameters and ask you to check.