Paperclip maximization would be a quantitative internal alignment error. Ironically the error of drawing boundary between paperclip maximization & squiggle maximization was itself arbitrary decision. Feel free to message me to discuss this.

•

Applied to Seeking feedback on a critique of the paperclip maximizer thought experiment by RobertM 4mo ago

•

Applied to A simple case for extreme inner misalignment by Raemon 4mo ago

•

Applied to What's wrong with the paperclips scenario? by No77e 6mo ago

135806mo10

I addressed this in my top level comment also but do we think Yud here has the notion that there is such a thing as "our full moral architecture" or is he reasoning from the impossibility of such completeness that alignment cannot be achieved by modifying the 'goal'?

135806mo10

This entry should address the fact the "the full complement of human values" is an impossible and dynamic set. There is no full set, as the set is interactive with a dynamic environment that presents infinite conformations (from an obviously finite set of materials), and also because the set is riven with indissoluble conflicts (hence politics); whatever set was given to the maximizer AGI would have to be rendered free of these conflicts which would then no longer be the full set etc.

•

Applied to Towards an Ethics Calculator for Use by an AGI by sweenesm 1y ago

•

Applied to Reaction to "Empowerment is (almost) All We Need" : an open-ended alternative by Ryo 1y ago

•

Applied to Out of the Box by jesseduffield 1y ago

•

Applied to Non-superintelligent paperclip maximizers are normal by Adam Zerner 1y ago

•

Applied to Nature < Nurture for AIs by scottviteri 1y ago

•

Applied to Will Artificial Superintelligence Kill Us? by James_Miller 1y ago

•

Applied to But What If We Actually Want To Maximize Paperclips? by snerx 1y ago

•

Applied to Prediction: any uncontrollable AI will turn earth into a giant computer by Karl von Wendt 2y ago

WilliamKiely2y10

Question: Are innerly-misaligned (superintelligent) AI systems supposed to necessarily be squiggle maximizers, or are squiggle maximizers supposed to only be one class of innerly-misaligned systems?

LESSWRONGTags
LW

Squiggle Maximizer (formerly "Paperclip maximizer")