You are viewing version 1.2.0 of this page. Click here to view the latest version.

Reflective stability

Edited by Eliezer Yudkowsky last updated 21st May 2016

You are viewing revision 1.2.0, last edited by Eliezer Yudkowsky

An agent is "reflectively stable" if, asked to construct a successor agent or modify its own code, it would construct a successor that thinks much the same way it does. For instance, we say that causal decision theory is not "reflectively stable" because causal decision theorists don't construct other causal decision theorists. Similarly, having a utility function that only weighs paperclips is "reflectively stable" because paperclip maximizers try to build other paperclip maximizers. If, thinking the way you currently do (in some regard), it seems optimal or acceptable to think that way (in that regard), then you are reflectively stable (in that regard).

Parents:

Vingean reflection

Children:

Other-izing (wanted: new optimization idiom)

Reflectively consistent degree of freedom

and 1 more

LESSWRONG
LW

LESSWRONG
LW

Reflective stability

Reflective stability