[Intuitive self-models] 1. Preliminaries

The sequence has been reviewed by Scott Alexander in Practically-A-Book Review: Byrnes on Trance.

I too find that the dancer just will. not. spin. counterclockwise. no matter how long I look at it.

But after trying a few things, I found an "intervention" to make it so. (No clue whether it'll work for anyone else, but I find it interesting that it works for me.) While looking at the dancer, I hold my right hand in front of the gif on the screen, slightly below so I can see both; then as the leg goes leftward, I perform counter-clockwise rotation with the hand, as if loosening an oversized screw. (And I try to make the act very deliberate, rather than absent-mindedly doing the movement.) After repeating this a few times, I generally perceive the counter-clockwise rotation, which sometimes lasts a few seconds and sometimes longer.

I also tried putting other counter-clockwise-spinning animations next to the dancer, but that didn't do anything.

[-]Linda Linsefors1y80

I tried it and it works for me too.

For me the dancer was spinning contraclockwise and would not change. With your screwing trick I could change rotation, and where now stably stuck in the clockwise direction. Until I screwed in the other direction. I've now done this back and forth a few times.

[-]Steven Byrnes1y60

In case you missed it, there’s a show/hide box at the bottom of the wiki article with three side-by-side synchronized spinning dancers—the original one in the middle, and broken-symmetry ones on either side, with internal edges drawn in to break the ambiguity. I fixed my gaze on the counterclockwise dancer while gradually uncovering the original dancer with my hand, and thus got the original spinning counterclockwise in my peripheral vision. Then I gradually moved my eyes towards the original and had her spinning counterclockwise in the center of my view for a bit! …But then she flipped back when I blinked. Sounds vaguely similar to the kind of thing you were doing. I got bored pretty quick and stopped trying :)

[-]David Joshua Sartor1y10

I can see the dancers spinning in different directions.

[-]Lucie Philippon1y30

I find that by focusing on the legs of the dancer, I managed to see it oscillating: half-turn clockwise then half-turn counterclockwise with the feet towards the front. However, this always break when I start looking at the arms. Interesting

[-]kvas_it1y30

For me your method didn't work, but I found another one. I wave the finger (that's pointing down) in front of the image in a spinning motion synchronized with the leg movement and going in the direction that I want. The finger obscures the dancer quite a bit, which makes it easier for me to imagine it spinning in the "right" direction. Sometimes I'd see it spin in the "right" direction for like 90 degrees and then stubbornly go back again, but eventually it complies and starts spinning in how I want it. Then I can remove the finger and it continues.

[-]Morpheus1y30

On my phone, rotating the screen by 180° quickly reverses the direction and then I rotate it back slowly.

[-]Optimization Process2mo60

I've read about half of this sequence, and it's certainly the most palatable, well-founded-seeming discussion of consciousness I've ever encountered.

But... I've kind of run aground on the question: how would I tell if this is true? (Or, you know, all models are false etc., but how would I tell if this is useful?)

Three examples of how a theory can useful: "Hey, I came up with this new theory of blurtzian phenomena! ...

Make predictions: "...The literature has catalogued 347 kinds of blurtz, but under this model, there should be at least two more, with the following characteristics: [...]"
Distill: "...The literature has catalogued 351 kinds of blurtz with various complicated characteristics, but under this model, all those complicated characteristics are pretty closely retrodicted by modeling each of the (3^3 choose 2) blurtzes as being the interaction of [...]"
Babble: "...The literature has a couple different models of blurtzes, all with various open questions. Here's one more. It's not obviously right, but it's another promising direction to go."

This sequence doesn't feel like (1) or (2) to me. Is it (3), or something else?

[-]Steven Byrnes2mo40

Thanks! My perspective for this kind of thing is: if there’s some phenomenon in psychology or neuroscience, I’m not usually in the situation where there are multiple incompatible hypotheses that would plausibly explain that phenomenon, and we’d like to know which of them is true. Rather, I’m usually in the situation where I have zero hypotheses that would plausibly explain the phenomenon, and I’m trying to get up to at least one.

There are so many constraints from what I (think I) know about neuroscience, and so many constraints from what I (think I) know about algorithms, and so many constraints from what I (think I) know about everyday life, that coming up with any hypothesis at all that can’t be easily refuted from an armchair is a huge challenge. And generally when I find even one such hypothesis, I wind up in the long term ultimately feeling like it’s almost definitely true, at least in the big picture. (Sometimes there are fine details that can’t be pinned down without further experiments.)

It’s interesting that my outlook here is so different from other people in science, who often (not always) feel like the default should be to have multiple hypotheses from the get-go for any given phenomenon. Why the difference? Part of it might be the kinds of questions that I’m interested in. But part of it, as above, is that I have lots of very strong opinions about the brain, which are super constraining and thus rule out almost everything. I think this is much more true for me than almost anyone else in neuroscience, including professionals. (Here’s one of many example constraints that I demand all my hypotheses satisfy.)

So anyway, the first goal is to get up to even one nuts-and-bolts hypothesis, which would explain the phenomenon, and which is specific enough that I can break it down all the way down to algorithmic pseudocode, and then even further to how that pseudocode is implemented by the cortical microstructure and thalamus loops and so on, and that also isn’t immediately ruled out by what we already know from our armchairs and the existing literature. So that’s what I’m trying to do here, and it’s especially great when readers point out that nope, my hypothesis is in fact already in contradiction to known psychology or neuroscience, or to their everyday experience. And then I go back to the drawing board. :)

[-]Optimization Process2mo40

I see! Thanks for the thoughtful response. I think my problem is caused by not having brought enough neuroscience and psychology textbooks to my armchair, leaving me in too-many-plausible-hypotheses-land, rather than your too-few-. I'll take another stab at this sequence if/when I collect more background knowledge!

[-]Dalcy10mo40

Curious about the claim regarding bistable perception as the brain "settling" differently on two distinct but roughly equally plausible generative model parameters behind an observation. In standard statistical terms, should I think of it as: two parameters having similarly high Bayesian posterior probability, but the brain not explicitly representing this posterior, instead using something like local hill climbing to find a local MAP solution—bistable perception corresponding to the two different solutions this process converges to?

If correct, to what extent should I interpret the brain as finding a single solution (MLE/MAP) versus representing a superposition or distribution over multiple solutions (fully Bayesian)? Specifically, in which context should I interpret the phrase "the brain settling on two different generative models"?

[-]Steven Byrnes10mo30

should I think of it as: two parameters having similarly high Bayesian posterior probability, but the brain not explicitly representing this posterior, instead using something like local hill climbing to find a local MAP solution—bistable perception corresponding to the two different solutions this process converges to?

Yup, sounds right.

to what extent should I interpret the brain as finding a single solution (MLE/MAP) versus representing a superposition or distribution over multiple solutions (fully Bayesian)?

I think it can represent multiple possibilities to a nonzero but quite limited extent; I think the superposition can only be kinda local to a particular subregion of the cortex and a fraction of a second. I talk about that a bit in §2.3.

in which context should I interpret the phrase "the brain settling on two different generative models"

I wrote "your brain can wind up settling on either of [the two generative models]", not both at once.

…Not sure if I answered your question.

[-]Dalcy10mo40

I wrote "your brain can wind up settling on either of [the two generative models]", not both at once.

Ah that makes sense. So the picture I should have is: whatever local algorithm oscillates between multiple local MAP solutions over time that correspond to qualitatively different high-level information (e.g., clockwise vs counterclockwise). Concretely, something like the metastable states of a Hopfield network, or the update steps of predictive coding (literally gradient update to find MAP solution for perception!!) oscillating between multiple local minima?

[-]justinpombrio1y41

This is fantastic! I've tried reasoning along these directions, but never made any progress.

A couple comments/questions:

Why "veridical" instead of simply "accurate"? To me, the accuracy of a map is how well it corresponds to the territory it's trying to map. I've been replacing "veridical" with "accurate" while reading, and it's seemed appropriate everywhere.

Do you see the Spinning Dancer going clockwise? Sorry, that’s not a veridical model of the real-world thing you’re looking at. [...] after all, nothing in the real world of atoms is rotating in 3D.

I think you're being unfair to our intuitive models here.

The GIF isn't rotating, but the 3D model that produced the GIF was rotating, and that's the thing our intuitive models are modeling. So exactly one of [spinning clockwise] and [spinning counterclockwise] is veridical, depending on whether the graphic artist had the dancer rotating clockwise or counterclockwise before turning her into a silhouette. (Though whether it happens to be veridical is entirely coincidental, as the silhouette is identical to the one that would have been produced had the dancer been spinning in the opposite direction.)

If you look at the photograph of Abe Lincoln from Feb 27, 1860, you see a 3D scene with a person in it. This is veridical! There was an actual room with an actual person in it, who dressed that way and touched that book. The map's territory is 164 years older than the map, but so what.

(My favorite example of an intuitive model being wildly incorrect is Feynman's story of learning to identify kinds of galaxies from images on slides. He asks his mentor "what kind of galaxy is this one, I can't identify it", and his mentor says it's a smudge on the slide.)

[-]Steven Byrnes1y40

This is fantastic!

Thanks! :)

Why "veridical" instead of simply "accurate"?

Accurate might have been fine too. I like “veridical” mildly better for a few reasons, more about pedagogy than anything else.

One reason is that “accurate” has a strong positive-valence connotation (i.e., “accuracy is good, inaccuracy is bad”), which is distracting, since I’m trying to describe things independently of whether they’re good or bad. I would rather find a term with a strictly neutral vibe. “Veridical”, being a less familiar term, is closer to that. But alas, I notice from your comment that it still has some positive connotation. (Note how you said “being unfair”, suggesting a frame where I said the intuition was non-veridical = bad, and you’re “defending” that intuition by saying no it’s actually veridical = good.) Oh well. It’s still a step in the right direction, I think.

Another reason is I’m trying hard to push for a two-argument usage (“X is or is not a veridical model of Y“), rather than a one-argument usage (“X is or is not veridical”). I wasn’t perfect about that. But again, I think “accurate” makes that problem somewhat worse. “Accurate” has a familiar connotation that the one-argument usage is fine because of course everybody knows what is the territory corresponding to the map. “Veridical” is more of a clean slate in which I can push people towards the two-argument usage.

Another thing: if someone has an experience that there’s a spirit talking to them, I would say “their conception of the spirit is not a veridical model of anything in the real world”. If I said “their conception of the spirit is not an accurate model of anything in the real world”, that seems kinda misleading, it’s not just a matter of less accurate versus more accurate, it’s stronger than that.

The GIF isn't rotating, but the 3D model that produced the GIF was rotating, and that's the thing our intuitive models are modeling. So exactly one of [spinning clockwise] and [spinning counterclockwise] is veridical, depending on whether the graphic artist had the dancer rotating clockwise or counterclockwise before turning her into a silhouette.

It was made by a graphic artist. I’m not sure their exact technique, but it seems at least plausible to me that they never actually created a 3D model. Some people are just really good at art. I dunno. This seems like the kind of thing that shouldn’t matter though! :)

Anyway, I wrote “that’s not a veridical model of the real-world thing you’re looking at” to specifically preempt your complaint. Again see what I wrote just above, about two-argument versus one-argument usage :)

[-]justinpombrio1y30

I like “veridical” mildly better for a few reasons, more about pedagogy than anything else.

That's a fine set of reasons! I'll continue to use "accurate" in my head, as I already fully feel that the accuracy of a map depends on which territory you're choosing for it to represent. (And a map can accurately represent multiple territories, as happens a lot with mathematical maps.)

Another reason is I’m trying hard to push for a two-argument usage

Do you see the Spinning Dancer going clockwise? Sorry, that’s not a veridical model of the real-world thing you’re looking at.

My point is that:

The 3D spinning dancer in your intuitive model is a veridical map of something 3D. I'm confident that the 3D thing is a 3D graphical model which was silhouetted after the fact (see below), but even if it was drawn by hand, the 3D thing was a stunningly accurate 3D model of a dancer in the artist's mind.
That 3D thing is the obvious territory for the map to represent.
It feels disingenuous to say "sorry, that's not a veridical map of [something other than the territory map obviously represents]".

So I guess it's mostly the word "sorry" that I disagree with!

By "the real-world thing you're looking at", you mean the image on your monitor, right? There are some other ways one's intuitive model doesn't veridically represent that such as the fact that, unlike other objects in the room, it's flashing off and on at 60 times per second, has a weirdly spiky color spectrum, and (assuming an LCD screen) consists entirely of circularly polarized light.

It was made by a graphic artist. I’m not sure their exact technique, but it seems at least plausible to me that they never actually created a 3D model.

This is a side track, but I'm very confident a 3D model was involved. Plenty of people can draw a photorealistic silhouette. The thing I think is difficult is drawing 100+ silhouettes that match each other perfectly and have consistent rotation. (The GIF only has 34 frames, but the original video is much smoother.) Even if technically possible, it would be much easier to make one 3D model and have the computer rotate it. Annnd, if you look at Nobuyuki Kayahara's website, his talent seems more on the side of mathematics and visualization than photo-realistic drawing, so my guess is that he used an existing 3D model for the dancer (possibly hand-posed).

[-]Steven Byrnes1y30

I think we’re in agreement on everything.

By "the real-world thing you're looking at", you mean the image on your monitor, right?

Yup, or as I wrote: “2D pattern of changing pixels on a flat screen”.

I'm very confident a 3D model was involved

For what it’s worth, even if that’s true, it’s still at least possible that we could view both the 3D model and the full source code, and yet still not have an answer to whether it’s spinning clockwise or counterclockwise. E.g. perhaps you could look at the source code and say “this code is rotating the model counterclockwise and rendering it from the +z direction”, or you could say “this code is rotating the model clockwise and rendering it from the -z direction”, with both interpretations matching the source code equally well. Or something like that. That’s not necessarily the case, just possible, I think. I’ve never coded in Flash, so I wouldn’t know for sure. Yeah this is definitely a side track. :)

Nice find with the website, thanks.

[-]justinpombrio1y40

I think we’re in agreement on everything.

Excellent. Sorry for thinking you were saying something you weren't!

still not have an answer to whether it’s spinning clockwise or counterclockwise

More simply (and quite possibly true), Nobuyuki Kayahara rendered it spinning either clockwise or counterclockwise, lost the source, and has since forgotten which way it was going.

[-]Kaj_Sotala1y40

On the topic of bistable perception, this is one of my favorite examples:

(Animated version)

[-]Gunnar_Zarncke1y40

…So that’s all that’s needed. If any system has both a capacity for endogenous action (motor control, attention control, etc.), and a generic predictive learning algorithm, that algorithm will be automatically incentivized to develop generative models about itself (both its physical self and its algorithmic self), in addition to (and connected to) models about the outside world.

Yes, and there are many different classes of such models. Most of them boring because the prediction of the effect of the agent on the environment is limited (small effect or low data rate) or simple (linear-ish or more-is-better-like).

But the self-models of social animals will quickly grow complex because the prediction of the action on the environment includes elements in the environment - other members of the species - that themselves predict the actions of other members.

You don't mention it, but I think Theory of Mind or Emphatic Inference play a large role in the specific flavor of human self-models.

[-]Rafael Harth1y40

1.6.2 Are explanations-of-self-reports a first step towards understanding the “true nature” of consciousness, free will, etc.?

Fwiw I've spent a lot of time thinking about the relationship between Step 1 and Step 2, and I strongly believe that step 1 is sufficient or almost sufficient for step 2, i.e., that it's impossible to give an adequate account of human phenomenology without figuring out most of the computational aspects of consciousness. So at least in principle, I think philosophy is superfluous. But I also find all discussions I've read about it (such as the stuff from Dennett, but also everything I've found on LessWrong) to be far too shallow/high-level to get anywhere interesting. People who take the hard problem seriously seem to prefer talking about the philosophical stuff, and people who don't seem content with vague analogies or appeals to future work, and so no one -- that I've seen, anyway -- actually addresses what I'd consider to be the difficult aspects of phenomenology.

Will definitely read any serious attempt to engage with step 1. And I'll try not be biased by the fact that I know your set of conclusions isn't compatible with mine.

[-]Paradiddle1y32

I strongly believe that step 1 is sufficient or almost sufficient for step 2, i.e., that it's impossible to give an adequate account of human phenomenology without figuring out most of the computational aspects of consciousness.

Apologies for nitpicking, but your strong belief that step 1 is (almost) sufficient for step 2 would be more faithfully re-phrased as: it will (probably) be possible/easy to give an adequate account of human phenomenology by figuring out most of the computational aspects of consciousness. The way you phrased it (viz., "impossible...without") is equivalent to saying that step 1 is necessary for step 2, an importantly different claim (on this phrasing, something besides the computational aspects may be required). Of course, you may think it is both necessary and sufficient, I'm just pointing out the distinction.

[-]Rafael Harth1y20

Mhh, I think "it's not possible to solve (1) without also solving (2)" is equivalent to "every solution to (1) also solves (2)", which is equivalent to "(1) is sufficient for (2)". I did take some liberty in rephrasing step (2) from "figure out what consciousness is" to "figure out its computational implementation".

[-]Stepan3mo30

Thank you for that series! Learnt about it from Scott's book review, and decided to read the original.

The first half of this post is the conventional basic knowledge from neuroscience, as I understand it. I was following and nodding along and thinking "yeah this is cool, makes sense" until section 1.4, where the solid before logic started breaking down a bit, or at least it seems so to me.

Before that, when you were talking about predicting, you were talking about predicting sensory input. There is some suspiciously car-shaped sensory input on my retina, then I get engine-and-tires-shaped sensory input in my ears. I would be less surprised to hear "wrrrr" after I see something car-shaped if I develop a "car" concept and learn to invoke it when I see something car-shaped, which is most likely a car.

Then, if I see a road, I would be less surprised when I hear "wrrrr" if I learn to invoke the "car" concept even before seeing a car, in a situation where cars are likely to appear. "Less surprised" in a technical sense, obviously: I assign more probability to hearing "wrrrr" when I see a road, because of the "car" model being active. There is a learned connection between "road-shaped sensory input" -> " 'road' concept" -> " 'car' concept" -> "prediction of car-shaped sensory input" because the car-shaped sensory input just follows the road-shaped sensory input. When I observe one, I actually expect the other.

Then you introduce the distinction between the model being active for "exogenous" and "endogenous" reasons, and start talking about how predicting when a model will be active for endogenous reasons is good for predicting... what? It kind of feels like we've lost the "sensory data" in "predicting sensory data", and now just predicting when the concept itself is active became good for some reason. Predicting when the "car" concept will be active for exogenous reasons was good because it's likely that the sensory data predictable by the car concept will follow. Is it at all the case with the "endogenous" reasons? Let's look at the examples in the post:

I’m thinking about screws right now [? not sure]

I’m worried about the screws [NO?]

I can never remember where I left the screws [YES? --- you would kind of go looking for screws and then maybe find them?]

Maybe the “car” concept is active in my mind because it spontaneously occurred to me that it would be a good idea to go for a drive right about now [YES]

Or maybe it’s on my mind because I’m anxious about cars [NO]

It kind of goes either way, and if predicting the sensory input was the actual end goal of the predictive algorithm, wouldn't the distinction between the two cases be very important, and wouldn't it be worth predicting only one of them?

I think it would help if you clarified what we are actually predicting by predicting when some concept will be active, and why we are doing that.

[-]intiluha1mo40

We don't need to explain why/whether predicting "endogenous" activations is good, if we accept hypothesis that that's how brain is wired - it runs prediction learning by default. It makes sense, because the affected cluster of neurons doesn't know if this activation is exo or endo.
Prediction learning for endo activations is conceptually the learning of shortcuts: if screw model activation predictably leads through a chain of intermediate steps to "being worried" model, then a good predictor would learn to activate the latter model right away after seeing screw.

[-]Stepan1mo30

That makes sense, thanks!

[-]Paradiddle1y32

Section 1.6 is another appendix about how this series relates to Philosophy Of Mind. My opinion of Philosophy Of Mind is: I’m against it! Or rather, I’ll say plenty in this series that would be highly relevant to understanding the true nature of consciousness, free will, and so on, but the series itself is firmly restricted in scope to questions that can be resolved within the physical universe (including physics, neuroscience, algorithms, and so on). I’ll leave the philosophy to the philosophers.

At the risk of outing myself as a thin-skinned philosopher, I want to push back on this a bit. If we are taking "philosophy of mind" to mean, "the kind of work philosophers of mind do" (which I think we should), then your comment seems misplaced. Crucially, one need not be defending particular views on "big questions" about the true nature of consciousness, free will, and so on to be doing philosophy of mind. Rather, much of the work philosophers of mind do is continuous with scientific inquiry. Indeed, I would say some philosophy of mind is close to indistinguishable from what you do in this post! For example, lots of this work involves trying to carve up conceptual space in a way that coheres with empirical findings, suggests avenues for further research, and renders fruitful discussion easier. Your section 1.3 in this post features exactly the kind of conceptual work that is the bread-and-butter of philosophy. So, far from leaving philosophy to the philosophers, I actually think your work would fit comfortably into the more empirically informed end of contemporary philosophy of mind. To end on a positive note, I think it's really clearly written, fascinating, and fun to read. So thanks!

[-]Steven Byrnes1y40

Thanks for the kind words!

The thing you quoted was supposed to be very silly and self-deprecating, but I wrote it very poorly, and it actually wound up sounding kinda judgmental. Oops, sorry. I just rewrote it. I agree with everything you wrote in this comment.

[-]Satya Benson1y10

Your brain has a giant space of possible generative models^[2] that map from underlying states of the world (e.g. “there’s a silhouette dancer with thus-and-such 3D shape spinning clockwise against a white background etc.”) to how the photoreceptor cells would send signals into the brain (“this part of my visual field is bright, that part is dark, etc.”)

How do you argue that the models are really implemented backwards like this in the brain?

^{^}

In case you’re wondering, this series will centrally involve probabilistic inference, but will not involve “active inference”. I think most “active inference” discourse is baloney (see Why I’m not into the Free Energy Principle), and indeed I’m not sure how active inference ever became so popular given the obvious fact that things can be plausible but not desirable, and that things can be desirable but not plausible. I think “plausibility” involves probabilistic inference, while “desirability” involves valence—see my Valence series.

^{^}

I think it would be a bit more conventional to say that the brain has a (singular) generative model with lots of adjustable parameters / settings, but I think the discussion will sound more intuitive and flow better if I say that the brain has a whole space of zillions of generative models (plural), each with greater or lesser degrees of a priori plausibility. This isn’t a substantive difference, just a choice of terminology.

^{^}

We can’t read Aristotle’s mind, so we don’t actually know for sure what Aristotle’s intuitive model of the sun was; it’s technically possible that Aristotle was saying things about the sun that he found unintuitive but nevertheless intellectually believed to be true (see §1.3.2.2). But I think that’s unlikely. I’d bet he was describing his intuitive model.

^{^}

The idea that a simpler generative model can’t predict the behavior of a big complicated algorithm is hopefully common sense, but for a related formalization see “Computational Irreducibility” (more discussion here).

^{^}

Reinforcement Learning (RL) is obviously indirectly relevant to the formation of generative models that don’t involve actions. For example, if I really like clouds, then I might spend all day watching clouds, and spend all night imagining clouds, and I’ll thus wind up with unusually detailed and accurate generative models of clouds. RL is obviously relevant in this story: RL is how my love of clouds influences my actions, including both attention control (thinking about clouds) and motor control (looking at clouds). And those actions, in turn, influence the choice of data that winds up serving as a target for predictive learning. But it’s still true that my generative models of clouds are updated only by predictive learning, not RL.

^{^}

“The Standard Model of Particle Physics including weak-field quantum general relativity (GR)” (I wish it was better-known and had a catchier name) appears sufficient to explain everything that happens in the solar system (ref). Nobody has ever found any experiment violating it, despite extraordinarily precise and varied tests. This theory can’t explain everything that happens in the universe—in particular, it can’t make any predictions about either (A) microscopic exploding black holes or (B) the Big Bang. Also, (C) the Standard Model happens to include 18 elementary particles (depending on how you count), because those are the ones we’ve discovered; but the theoretical framework is fully compatible with other particles existing too, and indeed there are strong theoretical and astronomical reasons to think they do exist. It’s just that those other particles are irrelevant for anything happening on Earth—so irrelevant that we’ve spent decades and billions of dollars searching for any Earthly experiment whatsoever where they play a measurable role, without success. Anyway, I think there are strong reasons to believe that our universe follows some set of orderly laws—some well-defined mathematical framework that elegantly unifies the Standard Model with all of GR, not just weak-field GR—even if physicists don’t know what those laws are yet. (I think there are promising leads, but that’s getting off-topic.) …And we should strongly expect that, when we eventually discover those laws, we’ll find that they shed no light whatsoever into how consciousness works—just as we learned nothing whatsoever about consciousness from previous advances in fundamental physics like GR or quantum field theory.

^{^}

Maybe Tor Norretranders’s The User Illusion (1999) belongs in this category, but I haven’t read it.

LESSWRONG
is fundraising!
LW

LESSWRONG
is fundraising!
LW

99

[Intuitive self-models] 1. Preliminaries

99

99

1.1 Summary & Table of Contents

1.1.1 Summary & Table of Contents—for the whole series

1.1.2 Summary & Table of Contents—for this first post in particular

1.2 Generative models and probabilistic inference

1.2.1 Example: bistable perception

1.2.2 Probabilistic inference

1.2.3 The thing you “experience” is the generative model (a.k.a. “intuitive model”)

1.2.4 Explanation of bistable perception

1.2.5 Teaser: Unusual states of consciousness as a version of bistable perception

1.3 Casting judgment upon intuitive models

1.3.1 “Is the intuitive model real, or is it fake?”

1.3.2 “Is the intuitive model veridical, or is it non-veridical?”

1.3.2.1 Non-veridical intuitive models are extremely common and unremarkable

1.3.2.2 …But of course it’s good if you’re intellectually aware of how veridical your various intuitive models are

1.3.3 “Is the intuitive model healthy, or is it pathological?”

1.4 Why does the predictive learning algorithm build generative models / concepts related to what’s happening in your own mind?

1.4.1 Further notes on the path from predictive learning algorithms to intuitive self-models

1.5 Appendix: Some terminology I’ll be using in this series

1.5.1 Learning algorithms and trained models

1.5.2 Concepts, models, thoughts, subagents

1.6 Appendix: How does this series fit into Philosophy Of Mind?

1.6.1 Introspective self-reports as a “straightforward” scientific question

1.6.2 Are explanations-of-self-reports a first step towards understanding the “true nature” of consciousness, free will, etc.?

1.6.3 Related work

1.7 Conclusion