Explanations as Hard to Vary Assertions

[-]lsusr4y70

Modern telescopes contain automatic mechanisms that continuously change the shape of the mirror to compensate for the shimmering of the Earth's atmosphere.

Source? I've never heard of such a telescope. All the modern telescopes I've ever used have solid, fixed mirrors. I thought the way to get around atmospheric distortions is to put your telescope in orbit.

[-]Alexander4y150

Source (emphasis added by me):

Large ground based telescopes can make images as sharp as or sharper than the Hubble Space Telescope, but only if atmospheric blurring is corrected. Previously, the deformable mirrors available to do this were small, flat, and relatively inflexible. They could be used only as part of complex instruments attached to conventional telescopes.
But in this new work, one of the two mirrors that make up the telescope optics is used to make the correction directly. The new secondary mirror makes the entire correction with no other optics required, making for a more efficient and cleaner system.
Like other secondary mirrors, this one is made of glass over 2 feet in diameter and is a steeply curved dome shape. But under the surface, it is like no other. The glass is less than 2 millimeters thick (less than eight-hundredths of an inch). It literally floats in a magnetic field and changes shape in milliseconds, virtually real-time. Electro-magnetically gripped by 336 computer-controlled "actuators" that tweak it into place, nanometer by nanometer, the adaptive secondary mirror focuses star light as steadily as if Earth had no atmosphere. Astronomers can study precisely sharpened objects rather than blurry blobs of twinkling light.

[-]lsusr4y60

Cool!

[-]TAG4y50

As I read through Rationality: A-Z, I kept seeing similarities to David Deutsch’s worldview.

Deutsch is really opposed to induction , though.

[-]Alexander4y70

Thank you for pointing this out, by the way. This is an important nuance. I just read this: Simple refutation of the ‘Bayesian’ philosophy of science.

By ‘Bayesian’ philosophy of science I mean the position that (1) the objective of science is, or should be, to increase our ‘credence’ for true theories, and that (2) the credences held by a rational thinker obey the probability calculus. However, if T is an explanatory theory (e.g. ‘the sun is powered by nuclear fusion’), then its negation ~T (‘the sun is not powered by nuclear fusion’) is not an explanation at all. Therefore, suppose (implausibly, for the sake of argument) that one could quantify ‘the property that science strives to maximise’. If T had an amount q of that, then ~T would have none at all, not 1-q as the probability calculus would require if q were a probability.
Also, the conjunction (T₁ & T₂) of two mutually inconsistent explanatory theories T₁ and T₂ (such as quantum theory and relativity) is provably false, and therefore has zero probability. Yet it embodies some understanding of the world and is definitely better than nothing.
Furthermore if we expect, with Popper, that all our best theories of fundamental physics are going to be superseded eventually, and we therefore believe their negations, it is still those false theories, not their true negations, that constitute all our deepest knowledge of physics.
What science really seeks to ‘maximise’ (or rather, create) is explanatory power.

And I am now really confused and conflicted. I would love it if someone could enlighten me on how Deutsch's definition of explanation (hard-to-vary assertions about reality) and Bayesian probability conflict with each other. I am missing something very subtle here.

For context, I am aware of Popper and falsification, but wouldn't a theory eventually become practically falsified within Bayesian updating if there is enough evidence against it?

[-]Yoav Ravid4y60

I read that too a some time ago and he makes a really basic error, which made me lose some respect for him (If I was able to catch that error surely he should have, and if he didn't, then he should have heard a correction and corrected it by now).

The error is the assumption that what Bayes does is compare between H and !H, or to take his example, ‘the sun is powered by nuclear fusion’ VS ‘the sun is not powered by nuclear fusion’. What the math really says you should do, is compare all possible hypothesis, so the term !H isn't itself an explanation/hypothesis, it's the sum of all other explanations/hypotheses.

I think Abram Demski (which, unlike me, is actually qualified to talk about this stuff) talked about this error in Bayes' Law is About Multiple Hypothesis Testing (though not directly referring to Deutsch).

I don't know if Bayes and Deutsch view of explanation actually conflict. It feels to me like he kinda wants them to conflict.

[-]Alexander4y*60

Wow, this is honestly baffling. It sounds as if Deutsch doesn't know about the generalised form of Bayes' theorem (I'm sure he does know, which makes me feel worse).

You make an excellent point. Bayes' theorem can be applied to all possible hypotheses, not just $H$ and $\neg H$ .

If a top physicist can be this biased, then I cannot be surprised by anything anymore.

Thank you very much for your response Yoav Ravid.

[-]Vyacheslav Ladischenski (Slava)8mo32

Deutsch's objection is not to Bayes' theorem itself but to the idea that updating numbers is what science is about. In his Popperian picture, knowledge grows through explanatory creativity and critical elimination, and the notion that evidence confirms or raises the probability of a sweeping theory is, literally, impossible.

[-]TAG8mo20

Partly. But you can use Bayes to support induction, which is another problem for Popperians.

[-]TAG4y*30

Bayes can explain why negative, disconfirmatory evidence counts more than positive support, and so sport a version of falsificationism. But it can't rule out positive support, so doesn't imply the more extreme Popperian doctrine that there is no justification.

A hard-to-vary explanation is a minimal explanation, one with no redundant parts. So hardness-to-vary is a simplicity criterion, a form of Occam's razor. Compared to the simplicity criterion favoured by Bayesians, programme length, it is rather subjective. Neither criterion answers the hard problem,the problem of why simplicity implies truth. But Deutsch is more interested in Knowledge , which is left very vaguely defined.

In theory, Bayes is is about adjusting the credences of Every Possible Hypothesis. In practice, you don't know every possible hypothesis, so there is some truth to Deutch's claim that not-H is a blob ... you might be able to locate some hypotheses other than H, but you have no chance of specifying all infinity.

Bayesians tend to be incurious about where hypotheses come from. That's one of Chapman's criticisms, that Bayes isn't a complete epistemology because it can't generate hypotheses. Popperians , by contrast, put a lot of emphasis on hypothesis-formation as a an informal, non-mechanistic process.

[-]Alexander4y30

Good points. There were several chapters in Rationality: A-Z dedicating to this. According to Max Tegmark's speculations, all mathematically possible universes exist, and we happen to be in one described by a simple Standard Model. I suspect that this question about why simple explanations are so effective in this universe is unanswerable but still fun to speculate about.

Good points about the lack of emphasis on hypothesis-formation within the Bayesian paradigm. Eliezer talks about this a little in Do Scientists Already Know This Stuff?

Sir Roger Penrose—a world-class physicist—still thinks that consciousness is caused by quantum gravity. I expect that no one ever warned him against mysterious answers to mysterious questions—only told him his hypotheses needed to be falsifiable and have empirical consequences.

I long for a deeper treatment on hypothesis-formation. Any good books on that?

[-]TAG4y10

. I suspect that this question about why simple explanations are so effective in this universe is unanswerable but still fun to speculate about.

What does "effective" mean? If you are using a simplicity criterion to decide between theories that already known to be predictive , as in Solomonoff induction, then simplicity doesn't buy you any extra predictiveness.

[-]Vyacheslav Ladischenski (Slava)8mo10

According to DD evidence doesnt "confirm" anything. It never justifies belief or increases probability of theory being right.

Evidence can only falsify a theory outright. Or it can fail to find a flaw, leaving the theory "unrefuted for now."

There is no middle state in which evidence makes the theory more likely true.

[-]Alexander4y*40

Oh yes, I didn't mention the differences between the worldview presented in Rationality: A-Z and that of David Deutsch.

For example, Deutsch is strongly opposed to the dogmatic nature of Empiricism, which is the sixth virtue of rationality in the LessWrong worldview. My take is that Deutsch believes that explanatory theories are more foundational to our understanding of reality than our experiences or observations. He asserts that we interpret our experiences and observations of reality through explanatory theories. He further asserts that experiences and observations are not the sources of our theories. For example, Einstein came up with Relativity with no direct observational data, Einstein didn’t use the perihelion precession of Mercury. Instead, experiences and observations are what we use to judge competing explanatory theories.

I don't feel too strongly either way at this point in my journey. I think Deutsch makes a good point, but so does Eliezer. I will probably start to feel more strongly about this in one direction or the other as I study more science.

[-]AnthonyC4y30

Whenever I find myself in a situation where I'm around people arguing about -isms or definitions, I usually find that the meaningful parts of the disagreement get hidden in the small words in the sentences. Like when I try to find a concise definition of empiricism, I'm told it's that "all knowledge is derived from sense-experience." Well, what does "derived from" mean? That phrase can easily include all of epistemic rationality. What does "all" mean? Obviously some level of information comes from our genes instead, but is that "knowledge"? And is knowledge quantitative or categorical? What is sense-experience? Does it include every bit of physical or chemical information that affects our biology from the moment of conception, or only what registers to our conscious awareness through the traditional five senses, or something else?

In other words, I'm saying it's very important that EY labeled the sixth virtue "empiricism," and not "Empiricism." That capital "E" can hide a lot of assumptions. And, of course, the he labeled empiricisim the sixth virtue, after argument and four others. I'm also saying that in many of the cases where the structure of language forces us to use words as if they drew fairly firm boundaries, the underlying reality is often continuous and nebulous.

[-]Alexander4y*40

In a literal sense, Eliezer said, "The roots of knowledge are in observation." If we took this statement in isolation to Deutsch, he would vehemently disagree and tell us, "No, we interpret observations through explanatory theories." However, I don't think Eliezer and Deutsch disagree here. Both agree that there is a map and a territory and that the map comprises models, i.e., explanatory theories.

[-]dedz4y*40

If there is some evidence E that the assertion A can't explain, then the likelihood P(E|A) will be tiny. Thus, the numerator P(E|A)P(A) will also be tiny, and likewise the posterior probability P(A|E). Updating on the near impossibility of evidence E has driven the probability of the assertion A

This isn't quite right. The tiny probability of an observation given the hypothesis does not imply that the posterior of the hypothesis will be low. Suppose there's a lottery with 10 million tickets. We have very good reasons to believe the lottery is fair. Still, whoever the winner X is, P(X is the winner|The lottery is fair) = 1/10000000. The reason P(The lottery is fair|X is the winner) is not low is that the alternative hypothesis "The lottery is not fair" also does a poor job at predicting the result (why rigged in favor of X specifically and not the other 9999999 people?) and the prior on P(The lottery is not fair) is very low. Ok, but what about the hypothesis "The lottery is 100% rigged in favor of X"? The probability that X is the winner given this alternative is 1. But the prior on that hypothesis is basically zero, so it doesn't matter. (Things are different if we have reasons to think X is suspicious. Then the fact that X won is a good reason to suspect the lottery isn't fair.)

tl;dr: The posterior P(H1|E) is tiny iff P(H1)P(E|H1) is tiny relative to all other P(Hi)P(E|Hi).

[-]Alexander4y30

I agree with you here. I made a mistake but on the bright side, I learnt a lot about the generalised form of Bayes' theorem which applies to all possible hypotheses. This was also how Eliezer explained this relationship between the posterior and the numerator in Decoherence is Falsifiable and Testable. I was trying to simplify the relationship between Bayes' theorem and Deutsch's criterion for good explanations for the sake of the post but I oversimplified too much.

I still think that Bayes' theorem and Deutsch's criterion for good explanation are compatible and in a practical sense, one can be explained in terms of the other but, using the generalised form of Bayes is necessary.

I updated my post to explain that this part is slightly incorrect.

[-]dedz4y30

It seems that he makes the same mistake in that post (though he makes it clear in the rest of the essay that alternatives matter). You paraphrased him right.

Incidentally, Popper also thought that you couldn't falsify a theory unless we have a non-ad hoc alternative that explains the data better.

[-]Alexander4y30

Incidentally, Popper also thought that you couldn't falsify a theory unless we have a non-ad hoc alternative that explains the data better.

This is so interesting. Do you know where I can read more about this? Conjectures and Refutations?

[-]Charlie Steiner4y40

I think focusing on the phenomenon of "explanation" is pretty helpful not just in science but also in philosophy - there are lots of places where people say they want an explanation for this or that thing, but what they mean can vary from case to case and person to person. But for this more general sort of explanation, I don't think the definition of "hard to vary model of the world" works, there needs to be more of a social / psychological perspective.

We are not special; we share more than half our genes with a banana.

This is more apparent for some people I know than others. (As the joke goes)

[-]Alexander4y*30

I tend to agree. It isn't easy to generalise what entails a successful explanation, especially as one goes higher up the layers of abstraction (as you've put it) or further out to the more infeasibly testable realm.

What do you think is an elegant way to define the phenomenon of explanation that is more general than "hard-to-vary assertions about reality"?

[-]Charlie Steiner4y40

I'm not sure there's a neat form. Consider the explanation of why a mirror flips left and right but not up and down. Maxwell's equations predict mirrors just fine, but it's certainly not what people (well, most people) want from this explanation. Even if we try to be elegant we'll probably have yo say complicated words like "the listener's understanding".

[-]Yoav Ravid4y40

[-]Alexander4y*40

This is a fascinating critique of David Deutsch ~~and The Beginning of Infinity~~ by one of his former colleagues.

It is ironic that Deutch sees himself as an expert on counter-dogma, yet he is dogmatic about his convictions. Cultish Countercultishness springs to mind.

[-]Yoav Ravid4y20

That link seems much more a critique of Deutsch than The Beginning of Infinity. Except the part on misquotations, which is actually its own post.

[-]Alexander4y30

I agree, it is more a critique of Deutsch as a person than of the book. I still think it is a good book overall.

LESSWRONG
is fundraising!
LW

LESSWRONG
is fundraising!
LW

17

Explanations as Hard to Vary Assertions

17

17

Background

Overview

Good Explanations

Bad Explanations

Dogma

Prejudice

Modes of Explanations

Science and Humanity

Conclusion