x
Helping Friends, Harming Foes: Testing Tribalism in Language Models — LessWrong