Classifying representations of sparse autoencoders (SAEs) — LessWrong