x
Progress Report 1: interpretability experiments & learning, testing compression hypotheses — LessWrong