LESSWRONG
LW

1766
Wikitags

Goal-Directedness

Edited by Jon Garcia, adamShimi, et al. last updated 30th Dec 2024

Goal-Directedness is the property of some system to be aiming at some goal. It is in need of formalization, but might prove important in deciding which kind of AI to try to align.

A goal may be defined as a world-state that an agent tries to achieve. Goal-directed agents may generate internal representations of desired end states, compare them against their internal representation of the current state of the world, and formulate plans for navigating from the latter to the former.

The goal-generating function may be derived from a pre-programmed lookup table (for simple worlds), from directly inverting the agent's utility function (for simple utility functions), or it may be learned through experience mapping states to rewards and predicting which states will produce the largest rewards. The plan-generating algorithm could range from shortest-path algorithms like A* or Dijkstra's algorithm (for fully-representable world graphs), to policy functions that learn through RL which actions bring the current state closer to the goal state (for simple AI), to some combination or extrapolation (for more advanced AI).

Implicit goal-directedness may come about in agents that do not have explicit internal representations of goals but that nevertheless learn or enact policies that cause the environment to converge on a certain state or set of states. Such implicit goal-directedness may arise, for instance, in simple reinforcement learning agents, which learn a policy function π:S→A that maps states directly to actions.

Subscribe
Discussion
1
Subscribe
Discussion
1
Posts tagged Goal-Directedness
80Literature Review on Goal-Directedness
Ω
adamShimi, Michele Campolo, Joe Collman
5y
Ω
21
134Coherence arguments do not entail goal-directed behavior
Ω
Rohin Shah
7y
Ω
69
66FAQ: What the heck is goal agnosticism?
porby
2y
38
21Behavioral Sufficient Statistics for Goal-Directedness
Ω
adamShimi
5y
Ω
12
68AI safety without goal-directed behavior
Ω
Rohin Shah
7y
Ω
15
61Will humans build goal-directed agents?
Ω
Rohin Shah
7y
Ω
43
59Measuring Coherence of Policies in Toy Environments
Ω
dx26, Richard_Ngo
1y
Ω
9
55Intuitions about goal-directed behavior
Ω
Rohin Shah
7y
Ω
15
27Deliberation Everywhere: Simple Examples
Ω
Oliver Sourbut
3y
Ω
3
21Goal-directed = Model-based RL?
Ω
adamShimi
6y
Ω
10
19Focus: you are allowed to be bad at accomplishing your goals
Ω
adamShimi
5y
Ω
17
16Locality of goals
Ω
adamShimi
5y
Ω
8
14Goals and short descriptions
Ω
Michele Campolo
5y
Ω
8
9Goal-Directedness: What Success Looks Like
Ω
adamShimi
5y
Ω
0
6Goal-directedness is behavioral, not structural
Ω
adamShimi
5y
Ω
12
Load More (15/80)
Add Posts