A self-experiment in training "noticing confusion"

[-]ntroPi12y110

I find the qualitative reflections most enlightening and especially that you said: "But never in the course of this experiment did I count something that turned out to be unimportant."

Your under-confidence in that point may be very common leading to thoughts like: "Yea noticing confusion is all nice but I usually do that already. I'm fairly certain that I'm only missing some irrelevant confusion." Your experience suggests that there is no such thing as irrelevant confusion. The art is to notice as many as humanly possible instead of just some.

I have never read a better motivation to go and actively try to notice confusion than this sentence. Thanks.

[-]VincentYu12y80

This is a well-written post! Upvoted.

A nitpick:

If I naïvely say that Week 1 establishes a true distribution for averaged weekly counts, then being more than 1σ above the mean for three weeks would have a probability of about p = (0.16)^3 = 0.0041 if that true count distribution remained constant.

Unfortunately, this p-value is poorly calibrated because the sampling errors in the estimates of the weekly means and σ are non-negligible compared to the value of σ.* We can obtain an accurate p-value by simulation. Under the null hypothesis of no change in counting frequency, the count for each day follows a Poisson distribution with mean = 150 counts / 35 days (I got 150 from adding up all the counts in the plot; there is some sampling error in this estimate, but its effect on the estimated p-value is negligible). From simulating 10^5 samples, I found 8296 samples with week 3–5 means that are all greater than the sum of the week 1 mean and week 1 SD. This gives p = 0.083.

An alternative (and standard) way to get a p-value here is to use Kendall tau as a test statistic, which gives a non-parametric rank-based test for monotone association. The single-tailed Kendall tau gives p = 0.076.

* ETA: Let me add more explanation for any reader who is not sure what's going on. The p-value in the post is (exactly) correct if the weekly mean and SD under the null hypothesis can be determined without any error. Unfortunately, we cannot do that—the best we can do is to estimate the weekly mean and SD using the week 1 mean and SD, so our estimates contain sampling errors. Often, we do not care about sampling errors when we are working with large samples because these errors are negligibly small compared to the SD. However, in this case, our sample has only n = 7, so sampling errors are non-negligible compared to the SD. This becomes a problem when we work with p-values because the null hypothesis is dependent on our estimates, but the errors in these estimates are not taken into consideration when we calculate the p-value. A common way to work around this is to use simulations, as I did. Alternatively, because our null hypothesis is rather simple, it might be feasible to use analytic methods to calculate a correct p-value.

[-]whales12y60

Thanks! Hardly a nitpick, I should really know better. It looks especially bad that my laziness/carelessness led to overstated results. 150 is the correct number of counts, and I agree with your calculation. Embarrassingly, I also screwed up the p-value for the sleep correlation, [EDIT] which I retracted briefly but now have fixed.

[-]LoganStrohl11y30

This is one of the most valuable things I've read in months. Thank you!

[-]whales11y20

Thanks, I'm glad you liked it!

Did someone link this recently? It seems to have gotten a new burst of votes.

[-]Thecommexokid11y20

Yes, Brienne herself posted it to Facebook (commenting "This post does not have nearly as many upvotes as it deserves") and Eliezer liked her post.

[-]WingedViper12y20

I have a (kind of) meta question: What's up with the "zir" and "zirself" in the text? I've never heard/read that word before and from context I'd infer that it should be "their" and "themselves". Would you clear that up?

[-]Creutzer12y80

"ze/zir/zirself" is an artificial gender-neutral third person singular pronoun.

[-]WingedViper12y20

Thanks for clearing that up. That was my guess, I was just confused that it suddenly popped up without me ever having heard about it. Is it popular/well-known? When I googled it, there were no hits for an explanation.

[-]fubarobfusco12y50

For more explanation, see Wikipedia.

[-]Creutzer12y40

I don't know, I've only ever seen it here and on Yvain's blog. Ironically, it doesn't really work for me as far as disrupting intuitive connections and depictions is concerned because it sounds pretty much like the German third person singular feminine pronoun. ;-)

[-]Thecommexokid11y00

While there have been many attempts at a set of such pronouns and none ever became standard, this is the set I see by far most commonly. Several non-gender-binary-identifying people I know use ze/zir/zirs as their preferred pronouns. They definitely crop up in many more places than just here and SlateStarCodex, as someone else replied, but it tends to be mostly in communities that have a particular focus on gender identity.

[-][anonymous]12y00

When I see a word I've never seen before, I google it. Here you go.

[This comment is no longer endorsed by its author]Reply

[-]Thecommexokid11y10

At first, I didn't seem to exercise this skill on days where I wasn't doing cognitively demanding work, or when most of my work was not in an academic context (typically weekends). Over time, I began doing so more, although still less than on demanding academic days.

I know quite a bit of time has passed since you posted this, but do you recall any specific instances of non-cognitively-demanding weekend-type confusions you could share?

[-]whales11y10

I wrote down a handful as I was doing this, but not all of them. There were a couple about navigation (where rather than say "well, I don't know where I am, I'll just trust the group" I figured out how I was confused about different positions of landmarks). I avoided overbaking my cookies when the recipe had the wrong time written down. Analytics for a site I run pointed to a recent change causing problems for some people, and I saw the (slight) pattern right away but ignored it until it got caught on my confusion hook. It's also a nice hook for asking questions in casual conversations. People are happy to explain why they like author X but not the superficially similar author Y I've heard them complain about before, for example.

[-]katydee11y10

Extremely good post. I'd love to see more content like this on LessWrong.

[-]Colombi12y-10

Sorry for being nit-picky, but one thing here really bugs me.

I would recommend extreme caution when recording data you remember from the experience of a lucid dream. Despite the fact that you may have been conscious that you were unconscious, the fact that you were in a dream-like state could mess with what you remember. While I personally have little (okay, no) experience with lucid dreaming, It seems safe to assume that you might forget details of the dream after waking up and trying to recall it, especially if you wait days before trying to remember the dream. Obviously this is often the case in regular dreams, and while you could make the case that lucid dreams are more vivid and thus easier to remember, its still too sketchy for me to take that as evidence without being heavily skeptic.

Otherwise, well done.

[-]glomerulus12y30

a) In my experience, lucid dreams are more memorable than normal dreams

b) You seem to assume that Whales completely forgot about the dream until they wrote this blog post, which is unlikely, because obviously they'd be thinking about it as soon as they woke up, and probably taking notes.

c) Whales already said that it hardly even constitutes evidence

LESSWRONG
is fundraising!
LW

LESSWRONG
is fundraising!
LW

65

A self-experiment in training "noticing confusion"

65

65

Background

Design

Results

Conclusion