New OpenAI Paper - Language models can explain neurons in language models — LessWrong