Faithful vs Interpretable Sparse Autoencoder Evals — LessWrong