LESSWRONG
LW

452
Wikitags

Situational Awareness

Edited by Jacob Pfau, Ben Millwood, et al. last updated 6th Jun 2025

In the context of AI model capabilities, Ajeya Cotra uses the term "situational awareness" to refer to:

a cluster of skills including “being able to refer to and make predictions about yourself as distinct from the rest of the world,” “understanding the forces out in the world that shaped you and how the things that happen to you continue to be influenced by outside forces,” “understanding your position in the world relative to other actors who may have power over you,” “understanding how your actions can affect the outside world including other actors,” etc.

Alternatively, from an ML-perspective, situational awareness can be characterized as a strong form of out-of-context meta-learning applied to situationally-relevant statements.

"Situational awareness" of course has a broader meaning outside of the AI context. Even within the AI context, it's used to refer to both "the awareness that AIs have about their situation" and "the awareness that relevant human decision-making bodies have about the AI situation". Leopold Aschenbrenner's Situational Awareness is an example of the latter.

Subscribe
Discussion
1
Subscribe
Discussion
1
Posts tagged Situational Awareness
370Without specific countermeasures, the easiest path to transformative AI likely leads to AI takeover
Ω
Ajeya Cotra
3y
Ω
95
81Situational Awareness: A One-Year Retrospective
Nathan Delisle
3mo
4
109Me, Myself, and AI: the Situational Awareness Dataset (SAD) for LLMs
L Rudolf L, bilalchughtai, Jan Betley, kaivu, Jérémy Scheurer, Mikita Balesni, AlexMeinke, Owain_Evans, Marius Hobbhahn
1y
37
109Paper: On measuring situational awareness in LLMs
Ω
Owain_Evans, Daniel Kokotajlo, Mikita Balesni, Tomek Korbak, Asa Cooper Stickland, Meg, Maximilian Kaufmann
2y
Ω
17
95On the functional self of LLMs
Ω
eggsyntax
2mo
Ω
35
55Owain Evans on Situational Awareness and Out-of-Context Reasoning in LLMs
Ω
Michaël Trazzi
1y
Ω
0
43Interim Research Report: Mechanisms of Awareness
Ω
Josh Engels, Neel Nanda, Senthooran Rajamanoharan
5mo
Ω
6
32Investigating the Ability of LLMs to Recognize Their Own Writing
Ω
Christopher Ackerman, Nina Panickssery
1y
Ω
0
31Some Quick Follow-Up Experiments to “Taken out of context: On measuring situational awareness in LLMs”
Ω
Miles Turpin
2y
Ω
0
19Is there any rigorous work on using anthropic uncertainty to prevent situational awareness / deception?
QΩ
David Scott Krueger (formerly: capybaralet)
1y
QΩ
7
16Revising Stages-Oversight Reveals Greater Situational Awareness in LLMs
Ω
Sanyu Rajakumar
6mo
Ω
0
149It's hard to make scheming evals look realistic for LLMs
Igor Ivanov, Danil Kadochnikov
4mo
29
119Revealing Intentionality In Language Models Through AdaVAE Guided Sampling
Ω
jdp
2y
Ω
15
74The Zeroth Skillset
katydee
13y
109
59Do models know when they are being evaluated?
Govind Pimpale, Giles, Joe Needham, Marius Hobbhahn
7mo
8
Load More (15/28)
Add Posts