x
A Behavioural and Representational Evaluation of Goal-directedness in Language Model Agents — LessWrong