Toy Models of Feature Absorption in SAEs — LessWrong