Takeaways from a Mechanistic Interpretability project on “Forbidden Facts” — LessWrong