Efficient Induction

You know more about this than me - is the three-body problem computable by the technical definition?

Ok. I really want to know why this question got voted down. It seemed like a perfectly reasonable question for someone to ask if they don't know much about these subjects and have heard that the three body problem is unsolvable.

[-]paulfchristiano15y20

You can numerically integrate the three-body problem. So there is a linear time algorithm to approximately compute what will happen after linear time. There just isn't a logarithmic time algorithm (which is what we would normally mean by "closed form").

[-]Manfred15y00

Ah, okay. So all that matters is that the right answer is computable given infinite resources. How do you reconcile this with the apparently finite computational resources of the universe?

[-]Sniffnoy15y40

To be clear - the three-body problem deals with real numbers. Computers can't work directly with real numbers, since real numbers can't be finitely represented. Hence when we speak about the computability of a problem "compute f(x)" that calls for a real number answer, we really mean the computability of the problem "compute f(x) to within epsilon" (where now epsilon is another input to the problem).

[-]paulfchristiano15y00

You can compute what will happen efficiently. If you want to know where the 3 bodies are after t seconds to within some constant error, it only takes you about O(t) steps.

[-]rwallace15y10

But errors accumulate over time. That means if you want the error to be bounded by a constant at the end of the calculation, the larger t is, the more you have to do along the way to avoid or correct error. If that just means you have to use higher precision calculations, that would give O(t log t), but if it means you have to use finer time steps, that would make the computational effort O(t^2) or worse.

[-]JoshuaZ15y00

This is primarily a function of how accurately you can do division and multiplication. Even if it isn't O(t) it is almost probably O(t^(1+epsilon)) for any epsilon>0.

[-]rwallace15y70

Suppose for the sake of argument you had infinite precision division and multiplication. There would still be finite error due to the use of finite time step size. (Runge-Kutta methods etc. give smaller error for a given step size than the more obvious numerical integration methods, but still not zero.) If you want to reduce the error, you need to use a smaller step size. Generally speaking, the error is a polynomial function of the step size (unlike arithmetic error, which decreases exponentially with the number of digits of precision), so you would expect O(t^(1+epsilon)) to be unattainable. Unless there's some method of reducing error exponentially with step size for the three body problem that I'm missing?

[-]paulfchristiano15y40

Runge-Kutta takes O(delta^(-1/4)) time to get an approximation quality of delta, I think. I don't know if we can yet, but I suspect is is possible to get an approximation quality of delta in time O(delta^(-epsilon)) for any epsilon>0 (in the same sense that I suspect it will eventually possible to multiply two nxn matrices in time O(n^(2 + epsilon)) for any epsilon>0, even though its not practical at all). This would probably imply JoshuaZ's stated time bound. It doesn't require exponentially fast error reduction, just arbitrarily good polynomial error reduction.

Anyway, the model I described in the post doesn't actually have this problem. More precision just comes from using a finer discrete system to approximate the universe (if in fact it is continuous, which I would put a probability of less than 50% on) and still using a linear size circuit to do the simulation. You only pay logarithmically for using a finer grid, in any of the schemes I proposed.

[-]rwallace15y20

An infinite sequence of algorithms converging on arbitrarily good polynomial error reduction? Fair enough, I certainly can't rule that out at this stage.

But I don't understand your last point: how can you pay only logarithmically for using a finer grid?

[-]paulfchristiano15y30

The post had a concrete complexity measure, which pays logarithmically for a finer grid (that is, doubling the size of the universe is the same as adding one more bit of complexity). The point is, you can only afford to pay logarithmically in the size of the universe (if you want known physical theories to have good complexity as compared to stupid explanations for our observations). Making the grid twice as fine is just making the universe twice as large, so you only pay 1 more bit: the extra bit needed to describe the larger size of the universe. If you disagree with this then you probably disagree fundamentally with my approach. That is obviously valid; I don't really like my approach that much. But alternative approaches, like the speed prior, seem much worse to me.

[-]rwallace15y00

Oh, sorry, yes, when your cost measure is complexity, then a finer grid incurs at most a logarithmic penalty, I agree. I also agreed the speed prior is a much worse approach -- I would go so far as to say it is flat-out falsified by the observed extreme computational cost of physics.

[-]JoshuaZ15y30

Hmm, yes you are correct. I was being stupid.

[-]Manfred15y-10

Well, it's simple to find a chaotic problem that's not efficient. I was just trying to understand what "the universe is computable" really means since the universe isn't exactly computable.

[-]saturn15y20

It seems like you and some others in this thread are assuming that real numbers describe some actual behavior of the universe, but that's begging the question. If the universe is computable, it implies that all quantities are discrete.

[-]rwallace15y10

Well, if it turns out the universe is continuous, then when we conjecture it to be computable, we typically mean the same thing we mean when we say pi is computable: there exists a fixed length program that could compute it to any desired degree of precision (assuming initial conditions specified to sufficient precision).

[-]Manfred15y00

Continuous quantities are the simplest explanation for the evidence we have - there are some hints that it could be otherwise, but they're only hints.

[-]JoshuaZ15y30

We can sort of see what might cause someone to change their views on what their generic priors should look like by looking at semi-historical examples.

Early on in the theory of computable functions it seemed like all computable functions might be primitive recursive. Presumably, if one didn't know about things llike the Ackermann function, you'd have no issue phrasing a general prior in terms of primitive recursive functions.

Another example is actually in your text, where people realized that quantum systems seemed to be able to do things that non-quantum systems could not (within certain time bounds).

Thus. updating our general framework seems to occur when aspects of the universe around us suggest that modeling them requires a larger class of functions than we anticipated. In the case of primitive recursive functions, it turned out that empirically the human brain could calculate a specific function that wasn't primitive recursive.

For what it is worth, I don't share your confidence that priors won't need to be drastically re-examined. One issue that Eliezer and others have observed with the Solomonoff prior is that it assigns equal probability to different ideas regardless of the computational time. While privileging polynomial time descriptions might help, it isn't clear how one should distinguish between two Turing machines, one which runs in very short time (say degree 2) and another that is long (say degree 20) but the degree 2 has a much smaller number of states. Which one of those is simpler?

[-]Kazuo_Thow15y50

On the problem of distinguishing between Turing machines of the kinds you mentioned, does Jürgen Schmidhuber's idea of a speed prior help at all? Searching for "speed prior" here on Less Wrong didn't really turn up any previous discussion.

[-]timtyler15y00

I discuss that concept here: http://alife.co.uk/essays/the_one_true_razor/

[-]JoshuaZ15y00

Hmm, I had not seen the speed prior before. It seems to make strong testable predictions about how the universe functions. I'll have to look into it.

[-]paulfchristiano15y20

While privileging polynomial time descriptions might help, it isn't clear how one should distinguish between two Turing machines, one which runs in very short time (say degree 2) and another that is long (say degree 20) but the degree 2 has a much smaller number of states. Which one of those is simpler?

I have given one answer in the post, selected somewhat arbitrarily (but with the desirable property of working more or less correctly for physical theories we have seen so far). I think basing things on circuits rather than TM's is clearly a better start.

For one thing, I don't know what "degree 2" means for you. Is that in terms of the size of the universe? The # of the current observation? Both of these have significant problems (the first fails if the universe gets padded with junk, the second fails if your observations don't begin at the beginning of time).

For a second thing, Turing machines are a horrible way to measure the degree of polynomial run-times. There is basically no reason to believe that you get out a reasonable thing, unlike with circuits. Of course, using circuits introduces other issues, which require new solutions. I suggest one.

Of course, the post just contains my best try. I would be happy to see a better go at it, but I would be very surprised if it was based on any inherently uniform model of computation which is known today.

[-]JoshuaZ15y00

I was thinking in terms of number of observations. I agree that this has problems.

[-]taw15y00

You're using phrases like "universe is computable in polynomial time", and I understand your intuition, but such phrases don't really mean anything.

LESSWRONG
is fundraising!
LW

LESSWRONG
is fundraising!
LW

7

Efficient Induction

7

7