Activation Plateaus: Where and How They Emerge — LessWrong