Do LLMs Change Their Minds About Their Users… and Know It?
Executive Summary: Large Language Models (LLMs) often form surprisingly detailed and accurate depictions of their users, tailoring their responses to match these inferred traits. Previous research has shown that this ability can be surfaced and manipulated using linear probes. In this project, I explore the LLM’s dynamics of adaptation: if...