x
Antonym Heads Predict Semantic Opposites in Language Models — LessWrong