Improving SAE's by Sqrt()-ing L1 & Removing Lowest Activating Features — LessWrong