Occam's Razor, Complexity of Verbal Descriptions, and Core Concepts

[-]Manfred15y90

Your description of Occam/Ockham's razor is wrong - "entities must not be multiplied beyond necessity" is one common statement. This would give equal chances to both storms and sea monsters (barring, e.g. the separate observation of storms and the lack of observation of sea monsters), though it gives a greater chance to sea monsters than green scaly sea monsters.

Modern science uses a few variations on Occam's razor that add the requirement that you don't pull any information out of thin air, mostly captured by the Einstein quote "Make everything as simple as possible, but not simpler."

And here at LW we often use a quantitative measurement of simplicity called Kolmogorov complexity, which is how complicated a computer has to be before it can output your hypothesis. Not in natural language, but in terms of actual properties.

The reason it makes sense to act as if natural language is how we should describe things is because when natural language reflects things we've already seen, it's simpler (in terms of properties) to make hypotheses about the whole universe that reuse parts, rather than hypotheses that have lots of new parts all the time - each of your mini-hypotheses is really part of the bigger hypothesis "what is the universe like?"

But since the correspondence between natural language and "stuff we've already seen" isn't perfect, this breaks down in places. For example, in natural language, the hypothesis "god did it" is almost unsurpassed in simplicity. The fossil record, rainbows, why light things fall as fast as heavy things in a vacuum. The reason Occam's razor does not suggest that "god did it" is the best explanation for everything is because god is a very complicated concept despite being a short word. So when you use something like Kolmogorov complexity that measures the size of concepts rather than the number of letters, you get evolution, diffraction, and gravity.

[-]DanielLC15y50

The reason Occam's razor does not suggest that "god did it" is the best explanation for everything is because god is a very complicated concept despite being a short word.

It's more because "it" is a very complicated concept.

[-]Johnicholas15y50

Kolmogorov complexity is not used at LessWrong; it is not used anywhere because it is uncomputable. Approximations of Kolmogorov complexity (replacing the Turing machine in the definition with something weaker) do not have the same nigh-magical properties that Kolmogorov complexity would have, if it were available.

[-]endoself15y10

Kolmogorov complexity is computable for some hypotheses, just not all (for each formal axiomatic system, there is an upper bound to the complexity of hypotheses that can have their complexity determined by the system). Anyways, while we can never use Kolmogorov complexity to analyze all hypotheses, I believe that Manfred merely meant that we use it as an object of study, rather than to implement full Solomonoff induction.

[-]TCB15y10

I am aware that my definition of Occam's razor is not the "official" definition. However, it is the definition which I see used most often in discussions and arguments, which is why I chose it. The fact that this definition of Occam's razor is common supports my claim that humans consider it a good heuristic.

Forgive me for my ignorance, as I have not studied Kolmogorov complexity in detail. As you suggest, it seems that human understanding of a "simple description" is not in line with Kolmogorov complexity.

I think the intention of my post may have been unclear. I am not trying to argue that natural language is a good way of measuring the complexity of statements. (I'm also not trying to argue that it's bad.) My intention was merely to explore how humans understand the complexity of ideas, and to investigate how such judgements of complexity influence the way typical humans build models of the world.

The fact that human understanding of complexity is so far from Kolmogorov complexity indicates to me that if an AI were to model its environment using Kolmogorov complexity as a criterion for selecting models, the model it developed would be different from the models developed by typical humans. My concern is that this disparity in understanding of the world would make it difficult for most humans to communicate with the AI.

[-]Manfred15y00

As you suggest, it seems that human understanding of a "simple description" is not in line with Kolmogorov complexity.

Rather than this, I'm suggesting that natural language is not in line with complexity of the "minimum description length" sort. Human understanding in general is pretty good at it, actually - it's good enough to intuit, with a little work, that gravity really is a simpler explanation than "intelligent falling, " and that the world is simpler than solipsism that just happens to replicate the world. Although humans may consider verbal complexity "a good heuristic," humans can still reason well about complexity even when the heuristic doesn't apply.

[-]prase15y80

The Swadesh list isn't aimed to provide the most basic concepts, but rather words that are likely to survive without changes of meaning. For not so closely related languages it may be difficult to establish what words are actually cognates; cf. German haben and Latin habere with the same meaning aren't in fact cognates - the actual German cognate of habere is geben and the Latin cognate of haben is capere. Therefore, when linguists want to establish regular phonological correspondences between two related languages, they have to rely on words which are likely to retain their meaning. Those words are usually numerals, concrete nouns, personal pronouns and some well defined concepts, as "cold" or "big". Actually the list was designed to establish a well-defined measure of the rate of phonological change. Concepts like snake, dog, knee, road, dirty or belly are likely to be expressed by single words and not change their meaning substantially over time, but they are hardly "basic concepts" as an AI designer would probably understand the term.

[-][anonymous]15y30

[-]Vladimir_Nesov15y20

Occam's razor, as it is popularly known, states that "the simplest answer is most likely to be correct"1. It has been noted in other discussion threads that the phrase "simplest description" is somewhat misleading, and that it actually means something along the lines of "description that is easiest to express concisely using natural language".

"A witch did it" seems to qualify. See Occam's Razor.

[-]JoshuaZ15y00

The idea of using the core vocab as a measure of hypothesis complexity is a really interesting one. Like many potentially good ideas it is obvious in hindsight. But, I'm not sure that using such a vocab to communicate with an AI is necessarily a good idea. Many words on the Swadesh list are extremely concrete and thus don't touch much on the really tricky part of communicating with an AI (or at least are less of a problem.) However, others on the list are so complicated that defining them would almost be equivalent to solving FAI and other problems besides. That is, "good", "bad", "because", and "name" are going to be really difficult.

LESSWRONG
is fundraising!
LW

LESSWRONG
is fundraising!
LW

8

Occam's Razor, Complexity of Verbal Descriptions, and Core Concepts

8

8