Optimization Process

[Intuitive self-models] 1. Preliminaries

I see! Thanks for the thoughtful response. I think my problem is caused by not having brought enough neuroscience and psychology textbooks to my armchair, leaving me in too-many-plausible-hypotheses-land, rather than your too-few-. I'll take another stab at this sequence if/when I collect more background knowledge!

[Intuitive self-models] 1. Preliminaries

Optimization Process11d60

I've read about half of this sequence, and it's certainly the most palatable, well-founded-seeming discussion of consciousness I've ever encountered.

But... I've kind of run aground on the question: how would I tell if this is true? (Or, you know, all models are false etc., but how would I tell if this is useful?)

Three examples of how a theory can useful: "Hey, I came up with this new theory of blurtzian phenomena! ...

Make predictions: "...The literature has catalogued 347 kinds of blurtz, but under this model, there should be at least two more, with the following characteristics: [...]"
Distill: "...The literature has catalogued 351 kinds of blurtz with various complicated characteristics, but under this model, all those complicated characteristics are pretty closely retrodicted by modeling each of the (3^3 choose 2) blurtzes as being the interaction of [...]"
Babble: "...The literature has a couple different models of blurtzes, all with various open questions. Here's one more. It's not obviously right, but it's another promising direction to go."

This sequence doesn't feel like (1) or (2) to me. Is it (3), or something else?

Optimization Process's Shortform

Optimization Process2mo20

Heuristic: distrust any claim that's much memetically fitter than its retraction would be. (Examples: "don't take your vitamins with {food}, because it messes with {nutrient} uptake"; "Minnesota is much more humid than prior years because of global-warming-induced corn sweat"; "sharks are older than trees"; "the Great Wall of China is visible from LEO with the naked eye")

Histograms are to CDFs as calibration plots are to...

Optimization Process5mo10

It sounds like you're assuming you have access to some "true" probability for each event; do I misunderstand? How would I determine the "true" probability of e.g. Harris winning the 2028 US presidency? Is it 0/1 depending on the ultimate outcome?

Histograms are to CDFs as calibration plots are to...

Optimization Process5mo10

(Hmm. Come to think of it, if the y-axis were in logits, the error bars might be ill-defined, since "all the predictions come true" would correspond to +inf logits.)

Histograms are to CDFs as calibration plots are to...

Optimization Process5mo20

Ah-- I took every prediction with p<0.50 and flipped 'em, so that every prediction had p>=0.50, since I liked the suggestion "to represent the symmetry of predicting likely things will happen vs unlikely things won't."

Thanks for the close attention!

Histograms are to CDFs as calibration plots are to...

Optimization Process5mo60

I like the idea, but with n>100 points a histogram seems better, and for few points it's hard to draw conclusions. e.g., I can't work out an interpretation of the stdev lines that I find helpful.

Nyeeeh, I see your point. I'm a sucker for mathematical elegance, and maybe in this case the emphasis is on "sucker."

I'd make the starting point p=0.5, and use logits for the x-axis; that's a more natural representation of probability to me. Optionally reflect p<0.5 about the y-axis to represent the symmetry of predicting likely things will happen vs unlikely things won't.

(same predictions from my last graph, but reflected, and logitified)

Hmm. This unflattering illuminates a deficiency of the "cumsum(prob - actual)" plot: in this plot, most of the rise happens in the 2-7dB range, not because that's where the predictor is most overconfident, but because that's where most of the predictions are. A problem that a normal calibration plot wouldn't share!

(A somewhat sloppy normal calibration plot for those predictions:

Perhaps the y-axis should be be in logits too; but I wasn't willing to figure out how to twiddle the error bars and deal with buckets where all/none of the predictions came true.)

Histograms are to CDFs as calibration plots are to...