Mario Giulianelli

Message

Associate Professor of Computational Linguistics at University College London and Member of the European Laboratory for Learning and Intelligent Systems

5mo

Mario Giulianelli

Associate Professor of Computational Linguistics at University College London and Member of the European Laboratory for Learning and Intelligent Systems

Mario Giulianelli — LessWrong

Mario Giulianelli

Message

Associate Professor of Computational Linguistics at University College London and Member of the European Laboratory for Learning and Intelligent Systems

5mo

Mario Giulianelli

Associate Professor of Computational Linguistics at University College London and Member of the European Laboratory for Learning and Intelligent Systems

A Behavioural and Representational Evaluation of Goal-directedness in Language Model Agents

This work was conducted as part of Project Telos and supported by the SPAR mentorship program. For the full technical details, see our paper on arXiv. TL;DR: We present a framework for evaluating goal-directedness in LLM agents that combines behavioural evaluation with analysis of internal representations. Studying GPT-OSS-20B navigating 2D...

Mar 5•20

Modelling, Measuring, and Intervening on Goal-directed Behaviour in AI Systems

TL;DR This is the first post in an upcoming series of blog posts outlining Project Telos. This project is being carried out as part of the Supervised Program for Alignment Research (SPAR). Our aim is to develop a methodological framework to detect and measure goals in AI systems. In this...

Oct 31, 2025•15

LESSWRONG
LW

LESSWRONG
LW

Mario Giulianelli

Mario Giulianelli

Mario Giulianelli

Mario Giulianelli

A Behavioural and Representational Evaluation of Goal-directedness in Language Model Agents

Modelling, Measuring, and Intervening on Goal-directed Behaviour in AI Systems