AIXI

~~Gloss: AIXI is the~~ ~~perfect rolling sphere~~ of ~~advanced agent theory, an ideal intelligent agent that uses infinite computing power to consider all computable hypotheses that relate its actions and sensory data to its rewards, then maximizes expected reward.~~

~~Summary:~~ Marcus Hutter's AIXI is the perfect rolling sphere of advanced agent theory - it's not realistic, but you can't understand more complicated scenarios if you can't envision the rolling sphere. At the core of AIXI is Solomonoff induction, a way of using infinite computing power to probabilistically predict binary sequences with (vastly) superintelligent acuity. Solomonoff induction proceeds roughly by considering all possible computable explanations, with prior probabilities weighted by their algorithmic simplicity, and updating their probabilities based on how well they match observation. We then translate the agent problem into a sequence of percepts, actions, and rewards, so we can use sequence prediction. AIXI is roughly the agent that considers all computable hypotheses to explain the so-far-observed relation of sensory data and actions to rewards, and then searches for the best strategy to maximize future rewards. To a first approximation, AIXI could figure out every ordinary problem that any human being or intergalactic civilization could solve. If AIXI actually existed, it wouldn't be a god; it'd be something that could tear apart a god like tinfoil.

~~Technical Summary:~~ ~~Marcus Hutter's~~summary(Brief): AIXI is the [perfect rolling sphere ~~combines Solomonoff induction, expected utility maximization, and the~~of ~~Cartesian agent-environment-reward formalism~~advanced agent theory ~~to yield a completely specified superintelligent~~, an ideal intelligent agent that ~~can be written out a single equation but would require a high-level~~ ~~halting oracle~~uses infinite computing power to ~~run. The formalism requires that percepts, actions, and rewards can all be encoded as integer sequences. AIXI considers~~consider all computable ~~hypotheses, with prior probabilities weighted by algorithmic simplicity,~~hypotheses that ~~describe the relation of~~relate its actions and ~~percepts~~sensory data to ~~rewards. AIXI updates on~~ its ~~observations so far,~~rewards, then maximizes ~~its next action's~~ expected reward, under the assumption that its future selves up to some finite time horizon will similarly update and maximize. The AIXI$tl$ variant requires (vast but) bounded computing power, and only considers hypotheses under a bounded length l ~~that can be computed within time~~ t~~. AIXI is a~~ ~~central example~~ ~~throughout~~ ~~value alignment theory; it illustrates the~~ ~~Cartesian boundary problem, the~~ ~~methodology of unbounded analysis, the~~ ~~Orthogonality Thesis, and~~ ~~seizing control of a reward signal~~.reward.]

Summary: Marcus Hutter's AIXI is the ideal rolling sphere of advanced agent theory - it's not realistic, but you can't understand more complicated scenarios if you can't envision the rolling sphere. At the core of AIXI is Solomonoff induction, a way of probabilistically predicting binary sequences with (vastly) superintelligent acuity, doing at least as well as any computable way of predicting sequences up to a bounded logarithmic error. Solomonoff induction proceeds roughly by considering all possible computable explanations, weighted by their simplicity, and doing Bayesian updating. AIXI then translates the central agent problem into a sequence of percepts, actions, and rewards. We can consider AIXI as roughly the agent that does Bayesian updating on an Occam-weighted version of all computable hypotheses to explain the so-far-observed relation of sensory data to actions and rewards, and then, given the updated model, searches for the best strategy to obtain future rewards. AIXI is a central example throughout value alignment theory and illustrates particularly the Cartesian boundary problem, the methodology of unbounded analysis, the Orthogonality Thesis, and shorting out on a reward signal.

~~Clickbait: How would you build~~Gloss: AIXI is the perfect rolling sphere of advanced agent theory, an ~~(evil) superintelligent AI using unlimited~~ideal intelligent agent that uses infinite computing power to consider all computable hypotheses that relate its actions and ~~one page of Python code? Answer: AIXI.~~sensory data to its rewards, then maximizes expected reward.

Summary: Marcus Hutter's AIXI is the ~~ideal~~perfect rolling sphere of advanced agent theory - it's not realistic, but you can't understand more complicated scenarios if you can't envision the rolling sphere. At the core of AIXI is Solomonoff induction, a way of using infinite computing power to probabilistically ~~predicting~~predict binary sequences with (vastly) superintelligent ~~acuity, doing at least as well as any computable way of predicting sequences up to a bounded logarithmic error.~~acuity. Solomonoff induction proceeds roughly by considering all possible computable explanations, with prior probabilitiesweighted by their ~~simplicity,~~algorithmic simplicity, and ~~doing Bayesian updating. AIXI~~updating their probabilities based on how well they match observation. We then ~~translates~~translate the ~~central~~ agent problem into a sequence of percepts, actions, and ~~rewards. We~~rewards, so we can ~~consider~~use sequence prediction. AIXI asis roughly the agent that ~~does Bayesian updating on an Occam-weighted version of~~considers all computable hypotheses to explain the so-far-observed relation of sensory data toand actions ~~and~~to rewards, and ~~then, given the updated model,~~then searches for the best strategy to ~~obtain~~maximize future rewards. To a first approximation, AIXI could figure out every ordinary problem that any human being or intergalactic civilization could solve even in principle. If AIXI actually existed, it wouldn't be a god; it'd be something that could tear apart a god like tinfoil.

Technical Summary: Marcus Hutter's AIXI combines Solomonoff induction, expected utility maximization, and the Cartesian agent-environment-reward formalism to yield a completely specified superintelligent agent that can be written out a single equation but would require a high-level halting oracle to run. The formalism requires that percepts, actions, and rewards can all be encoded as integer sequences. AIXI considers all computable hypotheses, with prior probabilities weighted by algorithmic simplicity, that describe the relation of actions and percepts to rewards. AIXI updates on its observations so far, then maximizes its next action's expected reward, under the assumption that its future selves up to some finite time horizon will similarly update and maximize. The AIXI$tl$ variant requires (vast but) bounded computing power, and only considers hypotheses under a bounded length l that can be computed within time t. AIXI is a central example throughout value alignment theory ~~and~~; it illustrates ~~particularly~~ the Cartesian boundary problem, the methodology of unbounded analysis, the Orthogonality Thesis, and ~~shorting out on~~seizing control of a reward signal.

Further information:

Summary: Marcus Hutter's AIXI is the ideal rolling sphere of advanced agent theory - it's not realistic, but you can't understand more complicated scenarios if you can't envision the rolling sphere. At the core of AIXI is Solomonoff induction, a way of probabilistically predicting binary sequences with (vastly) superintelligent acuity, doing at least as well as any computable way of predicting sequences up to a bounded logarithmic error. Solomonoff induction proceeds roughly by considering all possible computable explanations, weighted by their simplicity, and doing Bayesian updating. AIXI then translates the central agent problem into a sequence of percepts, actions, and rewards. We can consider AIXI as roughly the agent that does Bayesian updating on an Occam-weighted version of all computable hypotheses to explain the so-far-observed relation of sensory data to actions and rewards, and then, given the updated model, searches for the best strategy to obtain future rewards. AIXI is a central example throughout value alignment theory and illustrates particularly the boundary problem, the methodology of unbounded analysis, the Orthogonality Thesis, and out on a reward signal.

Summary: Marcus Hutter's AIXI is the perfect rolling sphere of advanced agent theory - it's not realistic, but you can't understand more complicated scenarios if you can't envision the rolling sphere. At the core of AIXI is Solomonoff induction, a way of using infinite computing power to probabilistically predict binary sequences with (vastly) superintelligent acuity. Solomonoff induction proceeds roughly by considering all possible computable explanations, with prior probabilities weighted by their algorithmic simplicity, and updating their probabilities based on how well they match observation. We then translate the agent problem into a sequence of percepts, actions, and rewards, so we can use sequence prediction. AIXI is roughly the agent that considers all computable hypotheses to explain the so-far-observed relation of sensory data and actions to rewards, and then searches for the best strategy to maximize future rewards. To a first approximation, AIXI could figure out every ordinary problem that any human being or intergalactic civilization could ~~solve even in principle.~~solve. If AIXI actually existed, it wouldn't be a god; it'd be something that could tear apart a god like tinfoil.

Clickbait: How would you build an (evil) superintelligent AI using unlimited computing power and one page of Python code? ~~The answer is AIXI, the perfect rolling sphere of advanced agents.~~Answer: AIXI.

Summary: Marcus Hutter's AIXI is the ideal rolling sphere of advanced agent theory - it's not realistic, but you can't understand more complicated scenarios if you can't envision the rolling sphere. At the core of AIXI is Solomonoff induction, a way of probabilistically predicting binary sequences with (vastly) superintelligent acuity, doing at least as well as any computable way of predicting sequences up to a bounded logarithmic error. Solomonoff induction proceeds roughly by considering all possible computable explanations, weighted by their simplicity, and doing Bayesian updating. AIXI then translates the central agent problem into a sequence of percepts, actions, and rewards. We can consider AIXI as roughly the agent that does Bayesian updating on an Occam-weighted version of all computable hypotheses to explain the so-far-observed relation of sensory data to actions and rewards, and then, given the updated model, searches for the best strategy to obtain future rewards. AIXI is a central example throughout value alignment theory and illustrates particularly the boundary problem, the methodology of unbounded analysis, the Orthogonality Thesis, and shorting out on a reward signal.

			v1.10.0Oct 6th 2017 GMT	(+89)
			v1.9.0Apr 1st 2016 GMT	(-267)
			v1.8.0Dec 17th 2015 GMT	(+176/-1333)
			v1.7.0Dec 3rd 2015 GMT	(+6/-24)
			v1.6.0Dec 3rd 2015 GMT	(+1848/-442)
			v1.5.0Dec 3rd 2015 GMT	(+10)
			v1.4.0Aug 12th 2015 GMT	(+22/-66)
			v1.3.0Aug 5th 2015 GMT	(+11)
			v1.2.0Aug 5th 2015 GMT
			v1.1.0Aug 5th 2015 GMT

			v1.10.0Oct 6th 2017 GMT	(+89)
			v1.9.0Apr 1st 2016 GMT	(-267)
			v1.8.0Dec 17th 2015 GMT	(+176/-1333)
			v1.7.0Dec 3rd 2015 GMT	(+6/-24)
			v1.6.0Dec 3rd 2015 GMT	(+1848/-442)
			v1.5.0Dec 3rd 2015 GMT	(+10)
			v1.4.0Aug 12th 2015 GMT	(+22/-66)
			v1.3.0Aug 5th 2015 GMT	(+11)
			v1.2.0Aug 5th 2015 GMT
			v1.1.0Aug 5th 2015 GMT

LESSWRONG
LW

LESSWRONG
LW

AIXI