Yes.
In my head I usually think of it as non-myopic in spacetime (as opposed to just time), but the version which is (somewhat) justified by the Toy Coherence Theorem is non-myopia over time.
My standard answer to the kinds of points you're making here is in A Simple Toy Coherence Theorem, specifically this section.
Those are useful examples, thanks. I'm gonna come at this from a different angle, but first to answer this question:
Maybe put another way: under your model, can the guy in the neckbeard meme dress well?
Unsure, either "no" or "it would be hard enough that I have no idea how he'd do it". I do not think that makes this a model of attractiveness per se, rather I think every outfit has some level of attractiveness required to pull it off (which, for the worst outfits, may be beyond the attractiveness of any actual human). As the relevant section of the post said, attractiveness and charisma are necessary conditions.
Now back to the main cruxy part.
Suppose the neckbeard you linked to looks in the mirror and, in complete self-honesty, gets the vibe that he looks awesome. Like, he imagines someone else who looks just like that, and his gut response is "that guy looks awesome". Well, as long as he's being honest with himself... his own reaction is in fact extremely strong evidence that lots of other people would agree that he looks awesome. It's very unlikely that he is so unique in his tastes (or lack thereof) that nobody else shares them. There may be lots of people who hate it, but at that point it is plausibly just correct to respond "well you can't please everyone, and anyone who stands out is gonna bother some people".
Personally, at this point I would tell him that he's farther than he thinks from the best he could be doing, but yes there are probably many people who will in fact get a pretty good vibe from that outfit (again, assuming the premise that his own honest read was that it looks awesome). So at that point it's down to numbers: just how many people would think it looks awesome, and how many would be driven away? In that guy's case, I'd guess the "awesome" camp would be in the minority, but not so rare that it would never make sense to take the tradeoff ("better to be loved by a few and hated by many, than for everyone to feel nothing" as the saying goes).
This is importantly different from the fat guy in the meme! For that guy, the number of people who think he looks awesome is going to be very close to zero, and probably even in his own head he needs to motivatedly-ignore some things about the guy in the mirror in order to think he looks good.
Main point of all that: insofar as one is honest with oneself, one's own reaction/vibe to a mirror is damn strong evidence of other peoples' reaction/vibe. Probably at least a large minority will feel the same reaction/vibe, even if one lacks the majority's taste.
And for most people, whatever reaction/vibe they get will basically match the reaction/vibe most other people would get; by definition most people are in the majority. If most people have the relevant taste, then most peoples' honest vibe from the mirror will reflect that, and in that case the neckbeard is in the minority. If most people don't have the relevant taste, then most peoples' honest vibe from the mirror will reflect that, and in that case it's fashion twitter which would be in the minority.
I think the neckbeard really truly believes that their outfit is vibe-appropriate.
I buy that, but I think the cause is more like a motivated blindspot than lack of taste.
Let's look at the classic meme:
This guy's problem is not combining an ironic graphic tee with a visor and a fedora and a katana. The guy's outfit is not the main problem. The guy's taste is not the main problem. The problem is he weighs like 300 pounds. What he needs is Ozempic.
Now, what does that guy think when he looks in the mirror?
When that guy looks in the mirror, I think he's telling himself that the outfit looks great, and he's motivatedly-not-thinking about how overweight he is. He's telling himself that the outfit puts him in the same bucket as a better-looking guy wearing that same outfit and facial hair.
That's not a problem of lack of taste, that's a problem of lack of self-honesty. He needs to be honest with himself about what he sees in the mirror.
Maybe a concrete deviation: It seems like you see fashion largely through the lens of offensive vs. nonoffensive? At least this seems to be the main failure mode you describe, when talking about how an outfit can fail. [...] Is it your intuition that people fail to be fashionable by failing to be appropriately noisy? This would explain why bravery is the main requisite virtue to you while being nearly irrelevant to me.
I wouldn't say that's the main lens I use in my head, but that is an accurate description of a major failure mode, plausibly the most common failure mode.
(Aside: I'd prefer the term "loud" over "noisy", because "noisy" is easily confused with noise in the sense of signal vs noise, and signal vs noise is a very central concept here.)
By choosing colors and silhouettes and layers that make sense to me on a particular day, though, I can somewhat-reliably show my friends that I'm feeling energetic and we should do something interesting, or I'm feeling cozy and we should stay in and laugh together, and I can interpret the same from their outfits.
I want to dig in on this. You of course have much more visibility into your own experience than I do, but just going by that description, my first guess would be that there's a different thing actually going on.
Notably, "how energetic is <person> feeling" is one of the easiest things to read off of body language and speech patterns. There's usually a pretty clear energy level being broadcast.
My guess would be:
In particular, one prediction this model makes is that if you dress low-energy but feel/act high energy, your friends will mostly pick up high energy vibes rather than low energy vibes. They might not even notice the clothes mismatching the vibes.
Curious to hear how well that does/doesn't feel like it fits your experiences.
There's a relevant selection effect: plenty of people would benefit from just trying to emulate an archetype, but these are mostly people who are good at picking up non-verbal communication, so will learn a lot by just trying an outfit, noticing how they are perceived, repeating. [...]
I disagree with this model in multiple ways. I don't currently have a short summary of that disagreement, so I'll hit a bunch of points which might gesture in the right direction:
When I imagine someone who "understands the relevant things but hasn't been brave enough to use it", a central example would be someone who can look in the mirror and accurately assess whether they've got the vibe they want, but has mostly optimized their wardrobe based on valence-feedback from other people, without a coherent story/role/narrative. I want to tell that person "pick a strong archetype, make the person in the mirror vibe that archetype, be honest when assessing yourself, then be brave and go rock it".
Channeling an archetype very hard without deeply understanding that archetype makes you look like somebody who really wants to be perceived in a particular way...
I mean, yes, obviously you need to understand the archetype deeply in order to channel it real hard? Ok, I suppose the existence of neckbeards indicates that that is not obvious and I should be explicit about it. But yeah, as with most things, one can certainly shoot oneself in the foot by not understanding what one is doing.
I guess you're imagining that the advice would be "disastrous in basically all cases" because basically everyone who tries it will execute incompetently rather than competently? I'm not sure why you think that, "basically all cases" seems way too strong here. There are, for example, probably people who already understand the relevant social signals and are capable of competent execution, but either haven't thought through what would actually benefit them most or haven't been brave enough to actually do what would benefit them most.
Regarding your four specific points:
On a meta note: I'm responding to most of your points because I find this discussion interesting. But I know a dense thread like this can be tiring, so please don't feel too obligated to respond if you find it draining.
If you want to get better at fashion because you want to unify external perception, I expect this to fail unless you want your perception to be a stereotype (scene clothes, suit-and-tie, fedora and neckbeard).
That would be roughly my recommendation: pick a universally-recognizable archetype and channel that archetype real hard. (This generalizes beyond "coolness" to other goals.) Otherwise, you're just sending messages which are in-practice far more ambiguous and weak than you think to an audience which is small to begin with, and probably the number of people who will actually recognize what you intended the clothes to say is approximately zero.
The second one: the AIs will be producing legible-and-verifiable breakthroughs in other fields, but those same AIs will be producing slop in the case of alignment.
This should be reasonable on priors alone: hard-to-verify problems are a systematically different class than easy-to-verify problems, so it should not be a huge surprise if generalization failure occurs across that gap.
It also makes sense from the perspective of "what's incentivized by RL?". In hard to verify areas, evaluators have systematic and predictable biases; outputs which play to those biases can and sometimes will outscore actually-good outputs. So we should expect an RL'd system will be actively selected to produce slop which plays to those biases (as opposed to straightforwardly good things), e.g. sycophancy.
And we already see this phenomenon on easier problems today. Today's AI already do the most impressive things in e.g. math and programming competitions, the easiest cases for verification. The harder verification gets, the more they spit out slop. So just extrapolate what we already see to the more extreme regimes of stronger AI.
(To be clear, my median guess is that we're about one transformers-level paradigm shift away from strong AI still, and it makes a lot less sense to extrapolate today's AI to a different paradigm. But insofar as one expects LLMs to hit criticality, especially using RL, one should expect that they'll produce systematically more slop in harder to verify areas.)
Ah, yes, that is almost correct. You need redundancy over TWO distinct observables (i.e. the subset must be at least size two), not just one, but otherwise yes. With just one observable, you don't have two branches to dangle a Λ′ off of in the graphical proof, so we can't get Λ between two copies of Λ′.