The research field AGIFundamental Controllability Limits of Engineerable Control & Safety Impossibility Theorems (AGILECSIT) has the purpose of verifying (both the empirical soundness of premises and validity of formal reasoning of):
intrinsic non-reducible possibility for self-modification;.
and that/therefore; that the meta-algorithm is effectively arbitrary; hence;
that it is inherently undecidable as to whether all aspects of its own self agency/intention are fully defined by only its builders/developers/creators.
In theory, modellingthe control of system A over system B means that A can influence system B to achieve A’s desired subset of state space [Source: https://arxiv.org/pdf/2109.00484.pdf].
In practice, engineeringto engineer control of AGI requires simulating or detecting any unsafe effects internally, and then preventing or correcting those effects externally.
The research field
AGIFundamental Controllability Limitsof Engineerable Control & Safety Impossibility Theorems (AGILECSIT)has the purpose of verifying (both the empirical soundness of premises and validity of formal reasoning of):