x
Quick Thoughts on Scaling Monosemanticity — LessWrong