Scaling and evaluating sparse autoencoders — LessWrong