Mechanistic Interpretability for the MLP Layers (rough early thoughts) — LessWrong