LESSWRONG
LW

Wikitags

Embedded Agency

Written by Ben Pace, Noosphere89, et al. last updated 4th Jan 2023

Embedded Agency is the problem that an understanding of the theory of rational agents must account for the fact that the agents we create (and we ourselves) are inside the world or universe we are trying to affect, and not separated from it. This is in contrast with much current basic theory of AI or Rationality (such as Solomonoff induction or Bayesianism) which implicitly supposes a separation between the agent and the-things-the-agent-has-beliefs about. In other words, agents in this universe do not have Cartesian or dualistic boundaries like much of philosophy assumes, and are instead reductionist, that is agents are made up of non-agent parts like bits and atoms.

Embedded Agency is not a fully formalized research agenda, but Scott Garrabrant and Abram Demski have written the canonical explanation of the idea in their sequence Embedded Agency. This points to many of the core confusions we have about rational agency and attempts to tie them into a single picture.

Subscribe
2
Subscribe
2
Discussion2
Discussion2
Posts tagged Embedded Agency
210Embedded Agency (full-text version)
Ω
Scott Garrabrant, abramdemski
7y
Ω
17
234Embedded Agents
Ω
abramdemski, Scott Garrabrant
7y
Ω
42
82Humans Are Embedded Agents Too
Ω
johnswentworth
6y
Ω
21
155Introduction to Cartesian Frames
Ω
Scott Garrabrant
5y
Ω
32
47Draft papers for REALab and Decoupled Approval on tampering
Ω
Jonathan Uesato, Ramana Kumar
5y
Ω
2
121Decision Theory
Ω
abramdemski, Scott Garrabrant
7y
Ω
45
116Robust Delegation
Ω
abramdemski, Scott Garrabrant
7y
Ω
10
102Subsystem Alignment
Ω
abramdemski, Scott Garrabrant
7y
Ω
12
96Embedded World-Models
Ω
abramdemski, Scott Garrabrant
7y
Ω
16
42Embedded Agency via Abstraction
Ω
johnswentworth
6y
Ω
20
91Embedded Curiosities
Ω
Scott Garrabrant, abramdemski
7y
Ω
1
82Updates and additions to "Embedded Agency"
Ω
Rob Bensinger, abramdemski
5y
Ω
1
24You Only Get One Shot: an Intuition Pump for Embedded Agency
Ω
Oliver Sourbut
3y
Ω
4
155Reducing LLM deception at scale with self-other overlap fine-tuning
Ω
Marc Carauleanu, Diogo de Lucena, Gunnar_Zarncke, Judd Rosenblatt, Cameron Berg, Mike Vaiana, AE Studio
4mo
Ω
41
114Infra-Bayesian physicalism: a formal theory of naturalized induction
Ω
Vanessa Kosoy
4y
Ω
23
Load More (15/99)
Add Posts