Building Conscious* AI: An Illusionist Case

[-]Adele Lopez2mo40

Illusionism basically says this: once we have successfully explained all our reports about consciousness, there will be nothing left to explain.

As a guiding intuition, consider the case of white light, which was regarded as an intrinsic property of nature until Newton discovered that it is in fact composed of seven distinct colours. White light is an illusion in the sense that it does not possess an intrinsic property “whiteness” (even though it seems to). Suppose we manage to explain, with a high degree of precision, exactly how and when we perceive white, and why we perceive it the way we do. We do not subsequently need to formulate a “hard problem of whiteness” asking why, on top of this, whiteness arises. Illusionists claim that consciousness is an illusion in the same sense that whiteness is.^[1]

So literally everything that is in some way an abstraction is an illusion? I don't think this is generally what the illusionists mean, my understanding is that it is more about phenomenal consciousness being non-representational—meaning something like that it has the type signature of a world-model without actually being a model of anything real (including itself). That could very well be a misrepresentation of what illusionists believe, but I'm still pretty sure it's not just any reductionist explanation of consciousness.

[-]OscarGilg1mo30

Thanks for the comment! I had to have a think but here's my response:

The first thing is that I maybe wasn't clear about the scope of the comparison. It was just to say "whiteness of light is an illusion in roughly the same sense that phenomenal consciousness is" (as opposed to other definitions of illusion).

Even then, what differentiates these illusions from other abstractions? Obviously not all abstractions are illusions.

Take our (functional) concept of heat. In some sense it's an abstraction, and it doesn't quite work the way people thought a thousand years ago. But crucially, there exists a real-world process which maps onto our folk concept extremely nicely, such that the folk concept remains useful and tracks something real. Unlike phenomenal consciousness, it just so happens that we evolved our concept of heat without us attributing too many weird properties to it. Once we developed models of molecular kinetic energy, we could just plug them right in.

Where I think you might have a point is that this is arguably not a binary distinction, some concepts are clearly confused and others clearly not but in some cases it might be blurry (and consciousness might be one of those, i'm not sure).

I don't think this is generally what the illusionists mean, my understanding is that it is more about phenomenal consciousness being non-representational—meaning something like that it has the type signature of a world-model without actually being a model of anything real (including itself)

I think most illusionists believe consciousness involves real representations, but systematic misrepresentations. The cognitive processes are genuinely representing something (our cognitive states), but they are attributing phenomenal properties that don't actually exist in those states. That's quite different from it being non-representational, and not being a model of anything.

At least that's my understanding which comes from the Daniel Dennett/Keith Frankish views. I'd be interested in learning about others.

[-]TAG1mo*30

As a guiding intuition, consider the case of white light, which was regarded as an intrinsic property of nature until Newton discovered that it is in fact composed of seven distinct colours. White light is an illusion in the sense that it does not possess an intrinsic property “whiteness” (even though it seems to). Suppose we manage to explain, with a high degree of precision, exactly how and when we perceive white, and why we perceive it the way we do. We do not subsequently need to formulate a “hard problem of whiteness” asking why, on top of this, whiteness arises. Illusionists claim that consciousness is an illusion in the same sense that whiteness is.[1]

What the explanation of white would be analogous to, if it existed, is a solution to the Hard Problem, in its own terms -- an explanation of phenomenal consciousness, that doesn't dismiss consciousness. Because the explanation of white light doesn't tell you that there is no such thing, it just tells you that it is made of parts, non fundamental. (Confusion between reduction and elimination is rife).

White light isn't a good example of an illusion: rainbows would be better.

There would be no need for a Hard Problem if there were no prima facie evidence for phenomenality ... but there is. If there were not, there would be no need to explain away as an illusion.

So illusionists don’t deny that conscious experiences exist in some sense (we’re talking about them right now!). They deny that conscious experiences have a special kind of property: phenomenality (although they really seem to have phenomenality

Which means they have phenomenality, unless the seeming is cashed out in entirely cognitive terms.

The most common objection to illusionism is straightforward: how can consciousness be an illusion when I obviously feel pain

No the most common objection is that it is self defeating, in that it needs to appeal to one kind of phenomenally "quasi phenomenality" to explain away another.

Illusionism basically says this: once we have successfully explained all our reports about consciousness, there will be nothing left to explain.

But illusionism has no argument that it's inevitable that reports will be reports of intrinsically subjective phenomena. Illusionist argument s always boil down to possibility.

There is little reason to expect evolution to enforce that our reports be correct

@Adele Lopez

that it is more about phenomenal consciousness being non-representational—meaning something like that it has the type signature of a world-model without actually being a model of anything real (including itself).

That might be a more accurate version of what illusionists actually think, but it's still incredible. Of course phenomenal consciousness is a model of the outside world: the quale Red represents certain frequencies of light, and so on.

[-]OscarGilg1mo10

Greatly appreciate the comment. I agree with most of it. As for the white light analogy, I'm definitely updating towards being less confident about it. Here is a perhaps stronger way of reframing it that I would be keen to get thoughts on:

Because the explanation of white light doesn't tell you that there is no such thing, it just tells you that it is made of parts, non fundamental. (Confusion between reduction and elimination is rife).

So if I understand correctly, you're saying that illusionism wants to eliminate phenomenal consciousness but that here I'm using a reduction analogy with white light. But I think both white light and consciousness deserve both treatments depending on what exactly we're targeting.

Of course white light is real, and can be reduced to xyz. But now consider something like (and this is where I'm somewhat moving the goalposts compared to what I originally wrote) "whiteness as pure luminance", then that can be eliminated. The analogy was chosen because pre-Newton white light was considered the purest, most fundamental form of light - with colours thought to be modifications or corruptions of this pure white light

For consciousness, illusionists seek to eliminate phenomenal consciousness but simply reduce so-called intuitions about consciousness. Eliminate the hard problem but simply reduce the meta-problem.

Thoughts?

Of course phenomenal consciousness is a model of the outside world: testable rd presents certain frequencies of light, and so on.

I'm not sure. The way I roughly see it, when you see red you have A) a representation that tracks red wavelengths and enables red-appropriate behaviours B) a representation of some additional "reddish feel". The first one definitely models the outside world but the second is a systematic misrepresentation.

[-]TAG1mo30

For consciousness, illusionists seek to eliminate phenomenal consciousness but simply reduce so-called intuitions about consciousness. Eliminate the hard problem but simply reduce the meta-problem.

Thoughts?

Thats what illusionist mean by illusionism, but you haven't offered much motivation to believe it, and it needs motivation, because its far from obvious:-

Illusionists side with dualists in that they regard phenomenal consciousness as irreducible to physics. But their solution is a denial of phenomenal consciousness.

The motivation, therefore is a strong belief in physicalism. Note that the basic manouvre -- saying that if E is apparently evidence against hypothesis H , E cannot be true -- can be generalised to other areas , and can be used to "prove" almost anything. Note also that it's a rejection of a widely accepted principle,that a scientific hypothesis should always be open to falsification.

Note also that the mystery or spookiness of Qualia isn't intrinsic, it's relative to expectations derived from physicalism. The ordinary person who is not versed in science and philosophy, doesn't regard qualia as mysterious, in fact they regard colour qualua a as ninmental properties of external objects. And , on top of that, physics can be quite counter intuitive, it doesn't equate to the non spooky.

Illusionism , as opposed to delusionism, has the further problem Th t it explains the illusion of Qualia as a quasi-phenomenal property: so it eliminates the kind of phenomenal properties for which there is direct evidence in favour of another kind for which there is none.

The way I roughly see it, when you see red you have A) a representation that tracks red wavelengths and enables red-appropriate behaviours B) a representation of some additional “reddish feel”. The first one definitely models the outside world but the second is a systematic misrepresentation.

That isn't the way I see it in terms of direct experience. It sounds like a theory to me.

[-]OscarGilg1mo10

Thats what illusionist mean by illusionism, but you haven't offered much motivation to believe it, and it needs motivation, because its far from obvious:-

Sure, right now I haven't offered that much motivation. The post is already probably too long.

The motivation, therefore is a strong belief in physicalism. Note that the basic manouvre -- saying that if E is apparently evidence against hypothesis H , E cannot be true -- can be generalised to other areas , and can be used to "prove" almost anything

Agreed that this would be very bad reasoning. But some inference can still be drawn right. If you have very strong independent priors for H and I show E is apparently evidence against H then your credence in E might go down (as might your credence in H to be fair). That would be one prima facie motivation.

Illusionism , as opposed to delusionism, has the further problem Th t it explains the illusion of Qualia as a quasi-phenomenal property: so it eliminates the kind of phenomenal properties for which there is direct evidence in favour of another kind for which there is none.

This is where the big disagreement happens imo. Because illusionists would say the data to explain is "reports about qualia" and not "qualia" (Dennett spoke about heterophenomenology). And for that we have a lot of evidence.

Circling back to motivations, one I didn't mention in the post is unreliability of introspection undercutting reasons to believe our intuitions about consciousness. A great book, Eric Schwitzgebel's Perplexities of Consciousness, talks about this e.g. "do we dream in colours?" "can humans echolocate?" "Do you constantly experience your feet in your shoes?".

One final prima facie motivation is that we should not necessarily expect evolution to produce transparent introspective access to our cognitive processes. Obviously there is some information being exchanged, but the representation only needs to be as faithful as survival demands.

[-]TAG1mo20

This is where the big disagreement happens imo. Because illusionists would say the data to explain is “reports about qualia” and not “qualia

Whereas the anti illusionist thinks that's failing to engage with the topic of consciousness at all. Having a dogma in favour of objective reports over subjective introspection isn't much better than having a dogma in favour of ontological physicalism.

Circling back to motivations, one I didn’t mention in the post is unreliability of introspection undercutting reasons to believe our intuitions about consciousness

If intuition is the only evidence you have , it is the most reliable evidence you have.

Note that the case for nonphysicalism doesn't have to depend on direct intuitions about nonphysicality.

A great book, Eric Schwitzgebel’s Perplexities of Consciousness, talks about this e.g. “do we dream in colours?” “can humans echolocate?” “Do you constantly experience your feet in your shoes?”.

We still have some sort of conscious experience even we don't precisely know it. Ineffability is a way of not knowing it, and also a major problem.

Obviously there is some information being exchanged, but the representation only needs to be as faithful as survival demands

That's not really the problem. Of course, introspection doesn't reveal anything at all about neural activity. The question is what is going on instead. Introspection reveals a rich phenomenology that we don't have a reductive explanation for.

ETA

We don't have a reductive explanation of phenomenality , assumed to be real appearances, and we also don't have a reductive explanation of (quasi) phenomenally assumed to be illusory appearances...and in fact, the two things are almost the same thing.

[-]soycarts2mo20

llusionism basically says this: once we have successfully explained all our reports about consciousness, there will be nothing left to explain. Phenomenal experiences are nothing more than illusions. For illusionists, the meta-problem is not just a stepping stone, it's the whole journey.
As a guiding intuition, consider the case of white light, which was regarded as an intrinsic property of nature until Newton discovered that it is in fact composed of seven distinct colours. White light is an illusion in the sense that it does not possess an intrinsic property “whiteness” (even though it seems to). Suppose we manage to explain, with a high degree of precision, exactly how and when we perceive white, and why we perceive it the way we do. We do not subsequently need to formulate a “hard problem of whiteness” asking why, on top of this, whiteness arises. Illusionists claim that consciousness is an illusion in the same sense that whiteness is.

I love your description of Illusionist thought, and pattern-match it as a successful application of self-reference (a cognitive tool I particularly value).

It seems to me however that it is just stated as fact that “phenomenal experiences are nothing more than illusions”.

I think the disconnect for me is that I equate consciousness to “being” which, in Eastern Philosophy, has some extrinsic properties (which are phenomenal).

This means that agents cannot wholly describe the “being” of another agent — its nature of being is not clearly bounded.

There is a correct explanation of our intuitions about consciousness which is independent of consciousness.
If there is such an explanation, and our intuitions are correct, then their correctness is a coincidence.

Initially I agreed with this because I thought you meant “a correct explanation of our intuitions about consciousness” in a partial sense — i.e. not a comprehensive explanation. This is then used to “debunk consciousness”.

It seems to me that we can talk about components of conscious experience without needing to reach a holistic definition, and then we might still be able to discuss Consciousness* as the components of conscious experience minus phenomena. Maybe this matches what you’re saying?

I’m on board with the core idea of intentionally building consciousness into AI (as far as we can ambiguously define it) as a driver of alignment… but perhaps at a later development stage when we’re confident we can absolve the AI of suffering.

[-]OscarGilg1mo10

Thanks for the comment and the kind words!

It seems to me however that it is just stated as fact that “phenomenal experiences are nothing more than illusions”.

I think the disconnect for me is that I equate consciousness to “being” which, in Eastern Philosophy, has some extrinsic properties (which are phenomenal).

I'm no expert in Eastern Philosophy conceptions of consciousness, I've been meaning to but haven't gotten around to digging into it.

What I would say is this: for any phenomenal property attributed to consciousness (e.g. extrinsic ones), you can formulate an illusionist theory of it. You can be an illusionist about many things in the world (not always rightly).

The debunking argument might have to be tweaked, e.g. it might not be about "intuitions", and of course you could reject this kind of argument. Personally I would expect it to also be quite strong across the "phenomenal" range. I would be very happy to see some (counter-)examples!

Initially I agreed with this because I thought you meant “a correct explanation of our intuitions about consciousness” in a partial sense — i.e. not a comprehensive explanation. This is then used to “debunk consciousness”.
It seems to me that we can talk about components of conscious experience without needing to reach a holistic definition, and then we might still be able to discuss Consciousness* as the components of conscious experience minus phenomena. Maybe this matches what you’re saying?

I guess this sounds a bit like weak illusionism? Where phenomenal consciousness exists, but some of our intuitions about it are wrong. We would indeed also be able to discuss consciousness* (with asterisk), but we'd run into other problems and I don't think the argument about moral intuitions would be nearly as strong. Weak illusionism basically collapses to realism. It would point to consciousness* being more cognitively important so many of the points would be preserved. Let me know if this isn't what you meant.

^{^}

Another useful analogy: until the early 20th century, vitalists maintained that there was something irreducibly special (they called it "élan vital") that distinguished living from dead, and which could not be reduced to mere chemistry and physics. That was until it was successfully explained by (bio)chemistry and physics. It turned out there was no explanatory gap after all.

^{^}

This is totally inspired by the concept of quasi-phenomenality introduced by Keith Frankish here.

^{^}

It seems common in AI consciousness research (e.g. this paper) to refrain from committing to any one theory, and argue we should proceed with uncertainty. I totally agree with this, but I also think opinionated takes help advance knowledge.

^{^}

The arguments here very much come from my own interpretation of illusionism. I'm skipping over some assumptions (e.g. materialism). There are also many disagreements between illusionists.

^{^}

Graziano goes into more detail on how AST is illusionist-compatible in his article: Illusionism Big and Small.

LESSWRONG
LW

LESSWRONG
LW

1

Building Conscious* AI: An Illusionist Case

1

1

The case for illusionism

**Introducing consciousness* (with an asterisk)**

The consequences of illusionism on ethics

Conscious AI: an illusionist critique of the Centrist Manifesto

**Why we should build conscious* AI**

Closing remarks

Appendix