x

LESSWRONG

LW

Arnau Padrés Masdemont — LessWrong

Arnau Padrés Masdemont

Arnau Padrés Masdemont

Message

5

9mo

Arnau Padrés Masdemont

5

9mo

No Answer Needed: Predicting LLM Answer Accuracy from Question-Only Linear Probes

by antonghawthorne, ivanvmoreno, Arnau Padrés Masdemont, David Africa, and LorenzoPacchiardi

TLDR: This is the abstract, introduction and conclusion to the paper. See here for a summary thread. Abstract Do large language models (LLMs) anticipate when they will answer correctly? To study this, we extract activations after a question is read but before any tokens are generated, and train linear probes...

Sep 16, 2025•10