Antonym Heads Predict Semantic Opposites in Language Models — LessWrong