Are there cognitive realms?

[-]Vanessa Kosoy1yΩ340Review for 2023 Review

This post states and speculates on an important question: are there different mind types that are in some sense "fully general" (the author calls it "unbounded") but are nevertheless qualitatively different. The author calls these hypothetical mind taxa "cognitive realms".

This is how I think about this question, from within the LTA:

To operationalize "minds" we should be thinking of learning algorithms. Learning algorithms can be classified according to their "syntax" and "semantics" (my own terminology). Here, semantics refers to questions such as (i) what type of object is the algorithm learning (ii) what is the feedback/data available to the algorithm and (iii) what is the success criterion/parameter of the algorithm. On the other hand, syntax refers to the prior and/or hypothesis class of the algorithm (where the hypothesis class might be parameterized in a particular way, with particular requirements on how the learning rate depends on the parameters).

Among different semantics, we are especially interested in those that are in some sense agentic. Examples include reinforcement learning, infra-Bayesian reinforcement learning, metacognitive agents and infra-Bayesian physicalist agents.

Do different agentic semantics correspond to different cognitive realms? Maybe, but maybe not: it is plausible that most of them are reflectively unstable. For example Christiano's malign prior might be a mechanism for how all agents converge to infra-Bayesian physicalism.

Agents with different syntaxes is another candidate for cognitive realms. Here, the question is whether there is an (efficiently learnable) syntax that is in some sense "universal": all other (efficiently learnable) syntaxes can be efficiently translated into it. This is a wide open question. (See also "frugal universal prior".)

In the context of AI alignment, in order to achieve superintelligence it is arguably sufficient to use a syntax equivalent to whatever is used by human brain algorithms. Moreover, it's plausible that any algorithm we can come up can only have an equivalent or weaker syntax (the process of us discovering the new syntax suggests an embedding of the new syntax into our own). Therefore, even if there are many cognitive realms, then for our purposes we mostly only care about one of them. However, the multiplicity of realms has implications on how simple/natural/canonical should we expect the choice of syntax for our theory of agents to be (the less realms, the more canonical).

[-][anonymous]3y40

Non-epistemic thinking. An agent might rearrange itself to be suitable for different tasks in a way that's not easy to understand as following rules that produce accurate beliefs. Again, evolution may be an example: although segments of the genome can sometimes be taken to correspond to something (e.g. a niche or element of the environment), they don't seem to constitute propositions (besides a monotone "this code-fragment is useful in this context"), and it's not obvious to me that you'd want to say that an agent has beliefs constituted by something other than propositions. It might be wrong to call this "thinking", but it's at least rearrangement towards suitability, and in the case of evolution can be very strong, strong enough to matter. Of course, the laws of information theory still apply; the point is that this sort of mind or agent may not be well-interpretable as having beliefs in the sense of propositions, which is a main meaning of the everyday word "belief".

I think a good example of this is minds that optimize for competitiveness in decision theory. For example, negotiation and persuasion.

the classical understanding of negotiation often recommends "rationally irrational" tactics in which an agent handicaps its own capabilities in order to extract concessions from a counterparty: for example, in the deadly game of chicken, if I visibly throw away my steering wheel, oncoming cars are forced to swerve for me in order to avoid a crash, but if the oncoming drivers have already blindfolded themselves, they wouldn't be able to see me throw away my steering wheel, and I am forced to swerve for them.

Also, skill at self-preservation could been continuously optimized/selected for at all stages of the evolution of intelligence, including early stages. This includes the neolithic period, where language existed but not written language, and extremely limited awareness of how to succeed at thinking or even what thinking is.

It seems plausible that the reason [murphyjitsu] works for many people (where simply asking “what could go wrong?” fails) is that, in our evolutionary history, there was a strong selection pressure in favor of individuals with a robust excuse-generating mechanism. When you’re standing in front of the chief, and he’s looming over you with a stone axe and demanding that you explain yourself, you’re much more likely to survive if your brain is good at constructing a believable narrative in which it’s not your fault.

It wouldn't be surprising if non-epistemic thinking was already substantially evolved and accessible/retrievable in humans, in which case research into distant cognitive realms is substantially possible with resources that are currently available.

[-]TsviBT3y20

For example, negotiation and persuasion

Oh yeah, that's (potentially) a great example. At least in the human regime, it does seem like you can get sets of people relating to each other so that they're very deeply into conflict frames. I wonder if that can extend to arbitrarily capable / intelligent agents.

LESSWRONG
is fundraising!
LW

LESSWRONG
is fundraising!
LW

34

Are there cognitive realms?

34

Ω 13

34

Ω 13

Realms

Systemically, radically distinct unbounded modes of thinking

Terms

Realm vs. domain

Realm vs. micro-realm

Possible examples

Implications

Do realms exist?

What does it mean for realms to exist?

Reasons realms might exist

Reasons realms might not exist