x
Models have linear representations of what tasks they like — LessWrong