Sleeping Beauty Resolved?

[-]Radford Neal8y60

Thanks for presenting my take on Sleeping Beauty. Your generalization beyond my assumption that Beauty's observations on Monday/Tuesday are independent and low-probability are interesting.

I'm not as dismissive as you are of betting arguments. You're right, of course, that a betting argument for something having some probability could be disputed by someone who doesn't accept your ideas of decision theory. But since typically lots of people will agree with with your ideas of decision theory, it may be persuasive to some.

Now, I have to... (read more)

4Chris_Leong8y

I don't understand how the halfer makes the wrong bet. If we are talking about probabilities, then a), b) and c) all have a probability of 1/2 of occurring. If we want the probabilities to sum to 1, then we need to do the following: * If heads occurs, Monday always "counts" * If tails occurs, we need to flip a second coin to determine if Monday or Tuesday "counts". So b) is "It's Monday, and the coin landed tails and Monday counts" And c) is "It's Tuesday, and the coin landed tails and Tuesday counts" So b) and c) are exclusive, so c) doesn't override b). On the other hand, if b) and c) aren't exclusive, then then are both 0.5 instead. So b) being ignored wouldn't matter as c) would suffice by itself. The only way we get the wrong answer is if b) and c) overlap and are not 0.5. This makes no sense for the halfer model.

4Charlie Steiner8y

I agree that the Sailor's Child is the correct translation of Sleeping Beauty into a situation with no copies (Do you remember Psy-Kosh's non-anthropic problem?), but I think some people might even deny that any such translation exists.

2Confusion8y

Don't worry about not being able to convince Lubos Motl. His prior for being correct is way too high and impedes his ability to consider dissenting views seriously.

1ksvanhorn8y

In regards to betting arguments: 1. Traditional CDT (causal decision theory) breaks down in unusual situations. The standard example is the Newcomb Problem, and various alternatives have been proposed, such as Functional Decision Theory. The Sleeping Beauty problem presents another highly unusual situation that should make one wary of betting arguments. 2. There is disagreement as to how to apply decision theory to the SB problem. The usual thirder betting argument assumes that SB fails to realize that she is going to both make the same decision and get the same outcome on Monday and Tuesday. It has been argued that accounting for these facts means that SB should instead compute her expected utility for accepting the bet as Pr(H)⋅payoff(H)+2Pr(T)payoff(T). 3. Your own results show that the standard betting argument gets the wrong answer of 1/3, when the correct answer is 1/(3−p(x)). At best, the standard betting argument gets close to the right answer; but if Beauty is sensorily impoverished, or has just awakened, then p(x) can be sufficiently large that the answer deviates substantially from 1/3. BTW, I was a solid halfer until I read your paper. It was the first and only explanation I've ever seen of how Beauty's state of information after awakening on Monday/Tuesday differs from her state of information on Sunday night in a way that affects the probability of Heads. With regards to your "Sailor's Child" problem: It was not immediately obvious to me that this is equivalent to the SB problem. I had to think about it for some time, and I think there are some differences. One is, again the different answers of 1/3 versus 1/(3−p(x)). I've concluded that the SC problem is equivalent to a variant of the SB problem where (1) we've guaranteed that Beauty cannot experience the same thing on both Monday and Tuesday, and (2) there is a second coin toss that determines whether Beauty is awakened on Monday or on Tuesday in the case that the first coin toss comes up Head

[-]Charlie Steiner8y60

rising ways.

Here, you dropped this from the last bullet point at the end :)

A very clear walkthrough of full nonindexical conditioning. Thanks! I think there's still a big glaring warning sign that this could be wrong, which is the mismatch with frequency (and, by extension, betting). Probability is logically prior to frequency estimation, but that doesn't mean I think they're decoupled. If your "probability" has zero application because your decision theory uses "likeliness weights" calculated an entirely different way, I... (read more)

5Chris_Leong8y

MEE constraint?

4Charlie Steiner8y

"mutually exclusive and exhaustive." Usually just means the probabilities of AND-ing them is zero, and the total probability is one.

3ksvanhorn8y

There are no frequencies in this problem; it is a one-time experiment. That's not what I said; I said that probability theory is logically prior to decision theory. Yes; what's gone wrong is that you're misapplying the decision theory, or your decision theory itself breaks down in certain odd circumstances. Exploring such cases is the whole point of things like Newcomb's problem and Functional Decision Theory. In this case, it's clear that Beauty is going to make the same betting decision, with the same betting outcome, on both Monday and Tuesday (if the coin lands Tails). The standard betting arguments use a decision rule that fails to account for this. See my response to Dacyn below ("Classical propositions are simply true or false..."). Classical propositions do not change their truth value over time.

3Charlie Steiner8y

One can do things multiple times. I tried to get at this in the big long paragraph of "'Monday' is an abstraction, not a fundamental." There is no such thing as a measurement of absolute time. When someone says "no, I mean to refer to the real Monday," they are generating an abstract model of the world and then making their probability distributions within that model. But then there still have to be rules that cash your nice absolute-time model out into yucky relative-time actual observables. It's like Solomonoff induction. You have a series of data, and you make predictions about future data. Everything else is window dressing (sort of). But it's not so bad. You can have whatever abstractions you want, as long as they cash out to the right thing. You don't need time to actually pass within predicate logic. You just need to model the passage of time and then cash the results out. It's also like how probability distributions are not about what reality is, they are about your knowledge of reality. "It is Monday" changes truth value depending on the external world. But P(It is Monday | Information)=0.9 is a perfectly good piece of classical logic. In fact, this exactly the same as how you can treat P(H)=0.5, even though classical propositions do not change their truth value when you flip over a coin. I dunno, putting it that way makes it sound simple. I still think there's something important in my weirder rambling - but then, I would.

[-]Chris_Leong8y50

I wrote up a response to this, but I thought it was also worthwhile writing a comment that directly responds to the argument about whether we can update on a random bit of information.

@travisrm89 wrote:

How can receiving a random bit cause Beauty to update her probability, as in the case where Beauty is an AI? If Beauty already knows that she will update her probability no matter what bit she receives, then shouldn't she already update her probability before receiving the bit?

Ksvanhorn responds by pointing out that this assumes that the probabilities a... (read more)

1ksvanhorn7y

That would be a valid description if she were awakened only on one day, with that day chosen through some unpredictable process. That is not the case here, though. What you're doing here is sneaking in an indexical -- "today" is either Monday if Heads, and "today" is either Monday or Tuesday if Tails. See Part 2 for a discussion of this issue. To the extent that indexicals are ambiguous, they cannot be used in classical propositions. The only way to show that they are unambiguous is to show that there is an equivalent way of expressing that same thing that doesn't use any indexical, and only uses well-defined entities -- in which case you might as well use the equivalent expression that has no indexical.

[-]Lukas Finnveden8y50

Insofar as I understand, you endorse betting on 1:2 odds regardless of whether you believe the probability is 1/3 or 1/2 (i.e., regardless of whether you have received lots of random information) because of functional decision theory.

But in the case where you receive lots of random information you assign 1/3 probability to the coin ending up heads. If you then use FDT it looks like there is 2/3 probability that you will do the bet twice with the outcome tails; and 1/3 probability that you will do the bet once with the outcome heads. Therefore, you should b... (read more)

2Lukas Finnveden8y

Actually, I realise that you can get around this. If you use a decision theory that assumes that you are deciding for all identical copies of you, but that you can't affect the choices of copies that has diverged from you in any way, math says you will always bet correctly.

1ksvanhorn7y

Yes, that is shown in Part 2.

1ksvanhorn8y

As I understand it, FDT says that you go with the algorithm that maximizes your expected utility. That algorithm is the one that bets on 1:2 odds, using the fact that you will bet twice, with the same outcome each time, if the coin comes up tails.

1Lukas Finnveden8y

I agree with that description of FDT. And looking at the experiment from the outside, betting at 1:2 odds is the algorithm that maximizes utility, since heads and tails have equal probabilities. But once you're in the experiment, tails have twice the probability of heads (according to your updating procedure) and FDT cares twice as much about the worlds in which tails happens, thus recommending 1:4 odds.

[-]Dacyn8y50

Why do you say that there is no "now", "today", or "here" in classical logic? Classical logic is just a system of logic based on terms and predicates. There is no reason that "now", "today", and "here" can't be terms in the logic. Now presumably you meant to say that such words cause a statement to have different meanings depending on who speaks it. But why is this a problem?

3ksvanhorn8y

Classical propositions are simply true or false, although you may not know which. They do not change from false to true or vice versa, and classical logic is grounded in this property. "Propositions" such as "today is Monday" are true at some times and false at other times, and hence are not propositions of classical logic. If you want a "proposition" that depends on time or location, then what you need is a predicate---essentially, a template that yields different specific propositions depending on what values you substitute into the open slots. "Today is Monday" corresponds to the predicate A(t), where A(t)≜(dayOfWeek(t)=Monday). The closest we can come to an actual proposition meaning "today is Monday" would be ∀t.memories(t)=y⇒A(t) where y is some memory state and memories(t) means your memory state at time t.

7AlexMennen8y

In any particular structure, each proposition is simply true or false. But one proposition can be true in some structure and false in another structure. The universe could instantiate many structures, with non-indexical terms being interpreted the same way in each of them, but indexical terms being interpreted differently. Then sentences not containing indexical terms would have the same truth value in each of these structures, and sentences containing indexical terms would not. None of this contradicts using classical logic to reason about each of these structures. I'm sympathetic to the notion that indexical language might not be meaningful, but it does not conflict with classical logic.

6ksvanhorn8y

The point is that the meaning of a classical proposition must not change throughout the scope of the problem being considered. When we write A1, ..., An |= P, i.e. "A1 through An together logically imply P", we do not apply different structures to each of A1, ..., An, and P. The trouble with using "today" in the Sleeping Beauty problem is that the situation under consideration is not limited to a single day; it spans, at a minimum, both Monday and Tuesday, and arguably Sunday and/or Wednesday also. Any properly constructed proposition used in discussing this problem should make sense and be unambiguous regardless of whether Beauty or the experimenters are uttering the proposition, and whether they are uttering it on Sunday, Monday, Tuesday, or Wednesday.

5Dacyn8y

That's not how I understand the term "classical logic". Can you point to some standard reference that agrees with what you are saying? I skimmed the SEP article I linked to and couldn't find anything similar. You run into the same problems with any sort of pronouns or context-dependent reference, and as far as I know most philosophers consider statements like "the thing that I'm pointing at right now is red" to be perfectly valid in classical logic. The main point of classical logic is that it has a system of deduction based on axioms and inference rules. Are you saying that you think these don't apply in the case of centered propositions? Does modus ponens or the law of the excluded middle not work for some reason? If not, I'm not sure why it matters whether centered propositions are really a part of "classical logic" or not -- you can still use all the same tools on them as you can use for classical logic. Finally, if you accept the MWI then every statement about the physical world is a centered proposition, because it is a statement about the particular Everett branch or Tegmark universe that you are currently in. So classical logic would be pretty weak if it couldn't handle centered propositions!

1TAG8y

If classical logic means propositional calculus, then there are no predicates, and no ability to express time-indexed truths.

3Dacyn8y

At least according to SEP classical logic includes predicates. But in any case if you want to do things with the propositional calculus, then I see no difference between saying "Let P = 'Today is Monday' " and "Let P = 'Sleeping Beauty is awake on Monday' ". Both of them are expressing a proposition in terms of a natural language statement that includes more expressive resources than the propositional calculus itself contains. But I don't see why that should be a problem in one case but not in the other.

1TAG8y

The first case has a truth value that varies with time.

4Dacyn8y

And the second case has a truth value that varies depending on what Everett branch you are in. Does it matter?

1a gently pricked vein8y

There is a relevant distinction: the machinery being used (logical assignment) has to be stable for the duration of the proof/computation. Or perhaps, the "consistency" of the outcome of the machinery is defined on such a stability. For the original example, you'd have to make sure that you finish all relevant proofs within a period in Monday or within a period in NotMonday. If you go across, weird stuff happens when attempting to preserve truth, so banning non-timeless propositions makes things easier. You can't always walk around while doing a proof if one of your propositions is "I'm standing on Second Main". You could, however, be standing still in any one place whether or not it is true. ksvanhorn might call this a space parametrization, if I understand him correctly. So here's the problem: I can't imagine what it would mean to carry out a proof across Everett branches. Each prover would have a different proof, but each one would be valid in its own branch across time (like standing in any one place in the example above). I think a refutation of that would be at least as bizarre as carrying out a proof across space while keeping time still (note: if you don't keep time still, you're probably still playing with temporal inconsistencies), so maybe come up with a counterexample like that? I'm thinking something along the lines of code=data will allow it, but I couldn't come up with anything.

3Dacyn8y

Sure, but I don't think anyone was talking about problems arising from Sleeping Beauty needing to do a computation taking multiple days. The computations are all simple enough that they can be done in one day.

1a gently pricked vein7y

I'd say your reply is at least a little bit of logical rudeness, but I'll take the "Sure, ...". I was pointing specifically at the flaw* in bringing up Everett branches into the discussion at all, not about whether the context happened to be changing here. I wouldn't really mind the logical rudeness (if it is so), except for the missed opportunity of engaging more fully with your fascinating comment! (see also *) It's also nice to see that the followup to OP starts with a discussion of why it's a good/easy first rule to, like I said, just ban non-timeless propositions, even if we can eventually come with a workable system that deals with it well. (*) As noted in GP, it's still not clear to me that this is a flaw, only that I couldn't come up with anything in five minutes! Part of the reason I replied was in the hopes that you'd have a strong defense of "everettian-indexicals", because I'd never thought of it that way before!

1Dacyn7y

Hmm. I don't think I see the logical rudeness, I interpreted TAG's comment as "the problem with non-timeless propositions is that they don't evaluate to the same thing in all possible contexts" and I brought up Everett branches in response to that, I interpreted your comment as saying "actually the problem with non-timeless propositions is that they aren't necessarily constant over the course of a computation" and so I replied to that, not bringing up Everett branches because they aren't relevant to your comment. Anyway I'm not sure exactly what kind of explanation you are looking for, it feels like I have explained my position already but I realize there can be inferential distances.

1TAG7y

It's more “the problem with non-timeless propositions is that they don’t evaluate to the same thing in all possible context AND a change of context can occur in the relevant situation". No one knows whether Everett branches are, or what they are. If they are macroscopic things that remain constant over the course of the SB story, they are not a problem....but time still is, because it doesn't. If branching occurs on coin flips, or smaller scales, then they present the same problem as time indexicals.

1Dacyn7y

Right, so it seems like our disagreement is about whether it is relevant whether the value of a proposition is constant throughout the entire problem setup, or only throughout a single instance of someone reasoning about that setup.

[-]travisrm898y50

This is a very enlightening post. But something doesn't seem right. How can receiving a random bit cause Beauty to update her probability, as in the case where Beauty is an AI? If Beauty already knows that she will update her probability no matter what bit she receives, then shouldn't she already update her probability before receiving the bit?

6ksvanhorn8y

That's a good point, but let's consider where that principle comes from: it derives from the fact that Pr(A∣M)=Pr((A & B1) or … or (A & Bn)∣M)=∑iPr(A & Bi∣M)=∑iPr(A∣Bi,M)Pr(Bi∣M) where B1,…,Bn are mutually exclusive and exhaustive propositions. The second equality above relies on the fact that the Bi are MEE; otherwise we'd have to subtract a bunch of terms for various conjunctions (ANDS) of the Bi. But the set of propositions X2(y)≜R(y,M) or R(y,T), indexed by y, are not mutually exclusive. If y and y′ are remembered perceptions on different days, then both X2(y) and X2(y′) will be true.

3ksvanhorn8y

What I wrote above may be a bit misleading. The issue isn't that you have additional terms for conjunctions of the Bi, but that the weights Pr(Bi∣M) sum to more than 1. In particular, consider the case when AI Beauty gets exactly one bit of input. Then for y=0 or 1, Pr(X2(y)∣M)=12Pr(H∣M)+34Pr(not H∣M)=12⋅12+34⋅12=58 and 5/8+5/8=5/4>1. If we try the same decomposition as in my previous comment, then usingPr(H∣X2(y),M)=1/2.5=2/5, we find Pr(H∣M)=Pr((H & X2(0)) or (H & X2(1))∣M)=Pr(H & X2(0)∣M)+Pr(H & X2(1)∣M)−Pr(H & X2(0) & X2(1)∣M)=Pr(X2(0)∣M)⋅Pr(H∣X2(0),M)+Pr(X2(1)∣M)⋅Pr(H∣X2(1),M)−0=58⋅25+58⋅25=12 and everything is still consistent.

1Charlie Steiner8y

And yet if you just keep them split up as R(y,d) indexed by both y and d, the MEE condition holds. So if Beauty expected to get both the observations and be told the day of those observations, she would expect no net update of P(H). Huh. Does this mean that if being told only the content y makes an agent predictably update towards P(H)<0.5, being told only the day d makes your procedure predictably update towards P(H)>0.5?

[-]Jeff Jo8y40

You mis-characterize what Elga does. He never directly formulates the state M1, where Beauty is awake. Instead, he formulates two states that are derived from information being added to M1. I'll call them M2A (Beauty learns the outcome is Tails) and M2B (Beauty learns that it is Monday). While he may not do it as formally as you want, he works backwards to show that three of the four components of a proper description of state M1 must have the same probability. What he skips over, is identifying the fourth component (whose probability is now zero).

Wha... (read more)

3ksvanhorn8y

Your whole analysis rests on the idea that "it is Monday" is a legitimate proposition. I've responded to this many other places in the comments, so I'll just say here that a legitimate proposition needs to maintain the same truth value throughout the entire analysis (Sunday, Monday, Tuesday, and Wednesday). Otherwise it's a predicate. The point of introducing R(y,d) is that it's as close as we can get to what you want "it is Monday" to mean.

1Jeff Jo6y

Well, I never checked back to see replies, and just tripped back across this. The error made by halfers is in thinking "the entire analysis" spans four days. Beauty is asked for her assessment, based on her current state of knowledge, that the coin landed Heads. In this state of knowledge, the truth value of the proposition "it is Monday" does not change. But there is another easy way to find the answer, that satisfies your criterion. Use four Beauties to create an isomorphic problem. Each will be told all of the details on Sunday; that each will be wakened at least once, and maybe twice, over the next two days based on the same coin flip and the day. But only three will be wakened on each day. Each is assigned a different combination of a coin face, and a day, for the circumstances where she will not be wakened. That is, {H,Mon}, {T,Mon}, {H,Tue}, and {T,Tue}. On each of the two days during the experiment, each awake Beauty is asked for the probability that she will be wakened only once. Note that the truth value of this proposition is the same throughout the experiment. It is only the information a Beauty has that changes. On Sunday or Wednesday, there is no additional information and the answer is 1/2. On Monday or Tuesday, an awake Beauty knows that there are three awake Beauties, that the proposition is true for exactly one of them, and that there is no reason for any individual Beauty to be more, or less, likely than the others to be that one. The answer with this knowledge is 1/3.

[-]Dagon8y40

Interesting, but I disagree. I fully agree that the problem is ambiguous in that it doesn't define what the actual proposition is. I think different assumptions can lead to saying 1/3 or 1/2, but with deconstruction can be shown to always be 1/2. I don't think anything in between is reasonable, and I don't think any information is gained by waking up (which has a prior of 1.0, so no surprise value).

Probability is in the map, not the territory. It matters a lot what is actually being predicted, which is what the "betting" approa... (read more)

4ksvanhorn8y

My intuition rebels against these conclusions too, but if the analysis is wrong, then where specifically is the error? Can you point to some place where the math is wrong? Can you point to an error in the modeling and suggest a better alternative? I myself have tried to disprove this result, and failed.

7Dacyn8y

The whole calculation is based on the premise that Neal's concept of "full non-indexical conditioning" is a reasonable way to do probability theory. Usually you do probability theory on what you are calling "centered propositions", and you interpret each data point you receive as the proposition "I have received this data". Not as "There exists a version of me which has received this data as well as all of the prior data I have received". It seems really odd to do the latter, and I think more motivation is needed for it. (To be fair, I don't have a better alternative in mind.)

[-]Wei Dai8y110

It seems really odd to do the latter, and I think more motivation is needed for it.

This old post of mine may help. The short version is that if you do probability with "centered propositions" then the resulting probabilities can't be used in expected utility maximization.

(To be fair, I don’t have a better alternative in mind.)

I think the logical next step from Neal’s concept of “full non-indexical conditioning” (where updating on one's experiences means taking all possible worlds, assigning 0 probability to those not containing "a version of me which has received this data as well as all of the prior data I have received", then renormalizing sum of the rest to 1) is to not update, in other words, use UDT. The motivation here is that from a decision making perspective, the assigning 0 / renormalizing step either does nothing (if your decision has no consequences in the worlds that you'd assign 0 probability to) or is actively bad (if your decision does have consequences in those possible worlds, due to logical correlation between you and something/someone in one of those worlds). (UDT also has a bunch of other motivations if this one seems insufficient by ... (read more)

3Dacyn8y

Yeah, but the OP was motivated by an intuition that probability theory is logically prior to and independent of decision theory. I don't really have an opinion on whether that is right or not but I was trying to answer the post on its own terms. The lack of a good purely-probability-theory analysis might be a point in favor of taking a measure non-realist point of view though. To make clear the difference between your view and ksvanhorn's, I should point out that in his view if Sleeping Beauty is an AI that's just woken up on Monday/Tuesday but not yet received any sensory input, then the probabilities are still 1/2; it is only after receiving some sensory input which is in fact different on the two days (even if it doesn't allow the AI to determine what day it is) that the probabilities become 1/3. Whereas for decision-theoretic purposes you want the probability to be 1/3 as soon as the AI wakes up on Monday/Tuesday.

1ksvanhorn8y

That is based on a flawed decision analysis that fails to account for the fact that Beauty will make the same choice, with the same outcome, on both Monday and Tuesday (it treats the outcomes on those two days as independent).

1Dacyn8y

So you want to use FDT, not CDT. But if the additional data of which direction the fly is going isn't used in the decision-theoretic computation, then Beauty will make the same choice on both days regardless of whether she has seen the fly's direction or not. So according to this analysis the probability still needs to be 1/2 after she has seen the fly.

1ksvanhorn8y

There are several misconceptions here: 1. Non-indexical conditioning is not "a way to do probability theory"; it is just a policy of not throwing out any data, even data that appears irrelevant. 2. No, you do not usually do probability theory on centered propositions such as "today is Monday", as they are not legitimate propositions in classical logic. The propositions of classical logic are timeless -- they are true, or they are false, but they do not change from one to the other. 3. Nowhere in the analysis do I treat a data point as "there exists a version of me which has received this data..."; the concept of "a version of me" does not even appear in the discussion. If you are quibbling over the fact that Pdt is only the stream of perceptions Beauty remembers experiencing as of time t, instead of being the entire stream of perceptions up to time t, then you can suppose that Beauty has perfect memory. This simplifies things---we can now let Pd simply be the entire sequence of perceptions Beauty experiences over the course of the day, and define R(y,d) to mean "y is the first n elements of Pd, for some n"---but it does not alter the analysis.

8Wei Dai8y

This confuses me. Dacyn's “There exists a version of me which has received this data as well as all of the prior data I have received” seems equivalent to Neal's "I will here consider what happens if you ignore such indexical information, conditioning only on the fact that someone in the universe with your memories exists. I refer to this procedure as “Full Non-indexical Conditioning” (FNC)." (Section 2.3 of Neal2007) Do you think Dacyn is saying something different from Neal? Or that you are saying something different from both Dacyn and Neal? Or something else?

1ksvanhorn8y

None of this is about "versions of me"; it's about identifying what information you actually have and using that to make inferences. If the FNIC approach is wrong, then tell me what how Beauty's actual state of information differs from what is used in the analysis; don't just say, "it seems really odd."

1Dacyn8y

I responded to #2 below, and #1 seems to be just a restatement of your other points, so I'll respond to #3 here. You seem to be taking what I wrote a little too literally. It looks like you want the proposition Sleeping Beauty conditions on to be "on some day, Sleeping Beauty has received / is receiving / will receive the data X", where X is the data that she has just received. (If this is not what you think she should condition on, then I think you should try to write the proposition you think she should condition on, using English and not mathematical symbols.) This proposition doesn't have any reference to "a version of me", but it seems to me to be morally the same as what I wrote (and in particular, I still think that it is really odd to say that that it is the proposition she should condition on, and that more motivation is needed for it).

2Dagon8y

It's a useless and misleading modeling choice to condition on irrelevant data, and even worse to condition on the assumption the unstated irrelevant data is actually relevant enough to change the outcome. That's not what "irrelevant" means, and the argument that humans are bad at knowing what's relevant does _NOT_ imply that all data is equally relevant, and even less does it imply that the unknown irrelevant data has precisely X relevance. Wei is correct that UDT is a reasonable approach that sidesteps the necessity to identify a "centered" proposition (though I'd argue that it picks Sunday knowledge as the center). But I think it's _also_ solvable by traditional means just be being clear what proposition about what prediction is being assigned/calculated a probability.

5ksvanhorn8y

Strictly speaking, you should always condition on all data you have available. Calling some data D irrelevant is just a shorthand for saying that conditioning on it changes nothing, i.e., Pr(A∣D,X)=Pr(A∣X) . If you can show that conditioning on D does change the probability of interest---as my calculation did in fact show---then this means that D is in fact relevant information, regardless of what your intuition suggests. There was no such assumption. I simply did the calculation, and thereby demonstrated that certain data believed to be irrelevant was actually relevant.

[-]JeffJo2y*20

This paper starts out with a misrepresentation. "As a reminder, this is the Sleeping Beauty problem:"... and then it proceeds to describe the problem as Adam Elga modified it to enable his thirder solution. The actual problem that Elga presented was:

Some researchers are going to put you to sleep. During the two days[1] that your sleep will last, they will briefly wake you up either once or twice, depending on the toss of a fair coin (Heads: once; Tails: twice). After each waking, they will put you to back to sleep with a drug that makes you forget that wak

... (read more)

[-]agilecaveman8y20

I think this post is fairly wrong headed.

First, your math seems to be wrong.

Your numerator is ½ * p(y), which seems like a Pr (H | M) * Pr(X2 |H, M)

Your denominator is 1/2⋅p(y)+1/2⋅p(y)(2−q(y)), which seems like

Pr(H∣M) * Pr(X2∣H,M) + Pr(¬H∣M) * Pr(X2∣¬H,M), which is Pr(X2 |M)

By bayes rule, Pr (H | M) * Pr(X2 |H, M) / Pr(X2 |M) = Pr(H∣X2, M), which is not the same quantity you claimed to compute Pr(H∣X2). Unless you have some sort of other derivation or a good reason why you omitted M in your calculations: this isn’t really “solving” anything.

Second,... (read more)

3ksvanhorn8y

That's a typo. I meant to write Pr(H∣X2,M), not Pr(H∣X2). I'll have more to say soon about what I think is the correct betting argument. Until then, see my comment in reply to Radford Neal about disagreement on how to apply betting arguments to this problem. I said logically prior, not chronologically prior. You cannot have decision theory without probability theory -- the former is necessarily based on the latter. In contrast, probability theory requires no reference to decision theory for its justification and development. Have you read any of the literature on how probability theory is either an or the uniquely determined extension of propositional logic to handle degrees of certainty? If not, see my references. Neither Cox's Theorem nor my theorem rely on any form of decision theory. I'll repeat my response to Jeff Jo: The standard textbook definition of a proposition is a sentence that has a truth value of either true or false. The problem with a statement whose truth varies with time is that it does not have a simple true/false truth value; instead, its truth value is a function from time to the set {true,false}. In logical terms, such a statement is a predicate, not a proposition. For example, "Today is Monday" corresponds to the predicate P(t)≜(dayof(t)=Monday). It doesn't become a proposition until you substitute in a specific value for t, e.g. "Unix timestamp 1527556491 is a Monday." You have not considered the possibility that the usual decision analysis applied to this problem is wrong. There is, in fact, disagreement as to what the correct decision analysis is. I will be writing more on this in a future post. In fact, I explicitly said that at the instant of awakening, Beauty's probability is the same as the prior, because at that point she does not yet have any new information. As she receives sensory input, her probability for Heads decreases asymptotically to 1/2. All of this is just standard probability theory, conditioning on the new i

3Jeff Jo8y

You said: "The standard textbook definition of a proposition is a sentence that has a truth value of either true or false. This is correct. And when a well-defined truth value is not known to an observer, the standard textbook definition of a probability (or confidence) for the proposition, is that there is a probability P that it is "true" and a probability 1-P that it is "false." For example, if I flip a coin but keep it hidden from you, the statement "The coin shows Heads on the face-up side" fits your definition of a proposition. But since you do not know whether it is true or false, you can assign a 50% probability to the result where "It shows Heads" is true, and a 50% probability the event where "it shows Heads" is false. This entire debate can be reduced to you confusing a truth value, with the probability of that truth value. * On Monday Beauty is awakened. While awake she obtains no information that would help her infer the day of the week. Later in the day she is put to sleep again. During this part of the experiment, the statement "today is Monday" has the truth value "true", and does not have the truth value "false." So by your definition, it is a valid proposition. But Beauty does not know that it is "true." * On Tuesday the experimenters flip a fair coin. If it lands Tails, Beauty is administered a drug that erases her memory of the Monday awakening, and step 2 is repeated. During this part of the experiment, the statement "today is Monday" has the truth value "false", and does not have the truth value "true." So by your definition, it is a valid proposition. But Beauty dos not know that it is "false." In either case, the statement "today is Monday" is a valid proposition by the standard definition you use. What you refuse to acknowledge, is that it is also a proposition that Beauty can treat as "true" or "false" with probabilities P and 1-P.

9habryka8y

[Moderator Note:] I am reasonably confident that this current format of the discussion is not going to cause any participant to change their mind, and seems quite stressful to the people participating in it, at least from the outside. While I haven't been able to read the whole debate in detail, it seems like you are repeating similar points over and over, in mostly the same language. I think it's fine for you to continue and comment, but I just really want to make sure that people don't feel an obligation to respond and get dragged into a debate that they don't expect to get any value from.

2agilecaveman8y

if the is indeed a typo, please correct it at the top level post and link to this comment. The broader point is that the interpretation of P( H | X2, M) is probability of heads conditioned on Monday and X2, and P (H |X2) is probability of heads conditioned on X2. In the later paragraphs, you seem to use the second interpretation. In fact, It seems your whole post's argument and "solution" rests on this typo. Dismissing betting arguments is very reminiscent of dismissing one-boxing in Newcomb's because one defines "CDT" as rational. The point of probability theory is to be helpful in constructing rational agents. If the agents that your probability theory leads to are not winning bets with the information given to them by said theory, the theory has questionable usefulness. Just to clarify, I have read Probability, the Logic of science, Bostrom's and Armstrong's papers on this. I have also read https://meaningness.com/probability-and-logic. The question of the relationship of probability and logic is not clear cut. And as Armstrong has pointed out, decisions can be more easily determined than probabilities, which means it's possible the ideal relationship between decision theory and probability theory is not clear cut, but that's a broader philosophical point that needs a top level post. In the meantime, Fix Your Math!

7ksvanhorn8y

No, P(H | X2, M) is Pr(H∣X2,M), and not Pr(H∣X2,Monday). Recall that M is the proposed model. If you thought it meant "today is Monday," I question how closely you read the post you are criticizing. I find it ironic that you write "Dismissing betting arguments is very reminiscent of dismissing one-boxing in Newcomb's" -- in an earlier version of this blog post I brought up Newcomb myself as an example of why I am skeptical of standard betting arguments (not sure why or how that got dropped.) The point was that standard betting arguments can get the wrong answer in some problems involving unusual circumstances where a more comprehensive decision theory is required (perhaps FDT). Re constructing rational agents: this is one use of probability theory; it is not "the point". We can discuss logic from a purely analytical viewpoint without ever bringing decisions and agents into the discussion. Logic and epistemology are legitimate subjects of their own quite apart from decision theory. And probability theory is the unique extension of classical propositional logic to handle intermediate degrees of plausibility. You say you have read PTLOS and others. Have you read Cox's actual paper, or any or detailed discussions of it such as Paris's discussion in The Uncertain Reasoner's Companion, or my own "Constructing a Logic of Plausible Inference: A Guide to Cox's Theorem"? If you think that Cox's Theorem has too many arguable technical requirements, then I invite you to read my paper, "From Propositional Logic to Plausible Reasoning: A Uniqueness Theorem" (preprint here). That proof assumes only that certain existing properties of classical propositional logic be retained when extending the logic to handle degrees of plausibility. It does not assume any particular functional decomposition of plausibilities, nor does it even assume that plausibilities must be real numbers. As with Cox, we end up with the result that the logic must be isomorphic to probability theory. In addit

[-]Jeff Jo8y20

You point out that Elga's analysis is based on an unproven assertion; that "it is Monday” and “it is Tuesday” are legitimate propositions. As far as I know, there is no definition of what can, or cannot, be used as a proposition. In other words, your analysis is based on the equally unproven assertion that they are not valid. Can remove the need to decide?

On Sunday, the steps of the following experiment are explained to Beauty, and she is put to sleep with a drug that somehow records her memory state. After she is put to sleep, two coins are flip

... (read more)

1ksvanhorn8y

On the first read I didn't understand what you were proposing, because of the confusion over "If the two coins show the same face" versus "If the two coins are not both heads." Now that it's clear it should be "if the two coins are not both heads" throughout, and after rereading, I now see your argument. The problem with your argument is that you still have "today" smuggled in: one of your state components is which way the nickel is lying "today." That changes over the course of the time period we are analyzing, so it does not give a legitimate proposition. To get a legitimate proposition we'll have to split it up into two propositions: "The nickel lies Heads up on Monday" and "The nickel lies Heads up on Tuesday". So in truth, the actual four possible outcomes are HHT, HTH, THT, and TTH. None of these is ruled out by the mere fact of waking up. Not until Beauty receives sufficient sensory input to provide a label for "today" that is nearly certain to be unique do we arrive at a situation in which your analysis is approximately correct. BTW, is this argument your own? Although I don't think it's right, it is an interesting argument. Is there a citation I should use if I want to reference it in future writing?

1ksvanhorn8y

The standard textbook definition of a proposition is this: (Adapted from https://www.cs.utexas.edu/~schrum2/cs301k/lec/topic01-propLogic.pdf.) The problem with a statement whose truth varies with time is that it does not have a simple true/false truth value; instead, its truth value is a function from time to the set {true,false}. As for the rest of your argument, my request is this: show me the math. That is, define the joint probability distribution describing what Beauty knows on Sunday night, and tell me what additional information she has after awakening on Monday/Tuesday. As I argued in the OP, purely verbal arguments are suspect when it comes to probability problems; it's too easy to miss something subtle. BTW, in one place you say "if the two coins are not both showing Heads," and in another you say "if the two coins show the same face"; which is the one you intended?

1Jeff Jo8y

(Sorry about the typo - I waffled between several isomorphic versions. The one I ultimately chose should have "both showed Heads.") In the OP, you said: Now you say: Are you really claiming that the statement "today is Monday" is not a sentence that is either true or false? That it is not "mutually exclusive" with "today is Tuesday"? Or are you simply ignoring the fact that the frame of reference, within which Beauty is asked to assess the proposition "The coin lands Heads," is a fixed moment in time? That she is asked to evaluate it at the current moment, and not over the entire time frame of the experiment? Let me insert an example here, to illustrate the problem with your assertion about functions. One half of a hidden, spinning disk is white; the other, black. It spins at a constant rate V, but you don't know its position at any previous time. There is a sensor aligned along its rim that can detect the color at the point in time when you press a button. You are asked to assess the probability of the proposition W, that the sensor will detect "white" when you first press the button. This is a valid proposition, even though it varies with time. It is valid because it doesn't ask you to evaluate the proposition at every time, but at a fixed point in time. It does have a simple true/false truth value if you are asked to evaluate it at fixed point in time. Your assertion applies to functions where every value of the dependent variable are considered to be "true" simultaneously. I did give you the math, but I'll repeat it in a slightly different form. Consider the point in time just before (A) in my version, when Beauty is awake and could be interviewed, or (B) in yours, when Beauty could be awakened. At this point in time, there are two valid-by-your-definition propositions: H, the proposition that "the coin lands Heads" and M, the proposition that "today is Monday." Each is asking about a specific moment in time, so your unsupported assertion that we need to

3ksvanhorn8y

Yes. It does not have a simple true/false truth value. Since it is sometimes true and sometimes false, its truth value is a function from time to {true, false}. That makes it a predicate, not a proposition. It is not a fixed moment in time; if it were, the SB problem would be trivial and nobody would write papers about it. The questions about day of week and outcome of coin toss are potentially asked both on Monday and on Tuesday. This makes the rest of your analysis invalid. You keep on asserting that "today is Monday" is evaluated at a fixed moment in time, when in reality it is evaluated at at least two separate moments in time with different answers. The sentence "the sensor detects white" is not a valid proposition; it is a predicate, because it is a function of time. Let's write P(t) for this predicate. But yes, the sentence "the sensor detects white when you first press the button" is a legitimate proposition, precisely because specifies a particular time t for which P(t) is true, and so the truth value of the statement itself does not vary with time. This gets us to the whole point of defining R(y,d): saying "Beauty has a stream of experiences y on day d" is as close as we can get to identifying a specific moment in time corresponding to the "this" in "this is day d". The more nearly that y uniquely identifies the day, the more nearly that R(y,d) can be interpreted to mean "this is day d".

1Jeff Jo8y

It most certainly does. It is true on Monday when Beauty is awake, and false on Sunday Night, on Tuesday whether or not Beauty is awake, and on Wednesday. A better random variable might be D, which takes values in {0,1,2,3} for these four days. What you refuse to deal with, is that its uninformed distribution depends on the stage of the experiment: {1,0,0,0} when she knows it is Sunday, {0,1/2,1/2,0} when she is awakened but not told the experiment is over, and {0,0,0,1} when she is told it is over. Or you could just recognize that the probability space when she awakes is not derived by removing outcomes from Sunday's. Which is how conventional problems in conditional probability work. That a new element of randomness is introduced by the procedures you use in steps 2 and 3. To illustrate this without obfuscation, ignore the amnesia part. Wake Beauty just once. It can happen any day during the rest of the week, as determined by a roll of a six-sided die. When she is awake, "Die lands 3" is just as valid a proposition - in fact, the same proposition - as "today is Wednesday." It has probability 1/6. If you add in the amnesia drug, and roll two dice (re-rolling if you get doubles so that you wake her on two random days), the probability for "a die lands 3" is 1/3, but for "today is Wednesday" it is 1/6. The proposition "coin lands heads" is sometimes true, and sometimes false, as well. In fact, you have difficulty expressing the tense of the statement for that very reason. But, it is a function of the parameters that define how you flip a coin: start position, force applied, etc. What you refuse to deal with, is that in this odd experiment, the time parameter Day is also one of the independent parameters that defines the randomness of Beauty's situation, and not one that makes Monday's state predicated on Sunday's. By being asked about the proposition H, Beauty knows that she is in either step 2 or step 3 of your experiment. This establishes a fixed value of th

3ksvanhorn8y

That's not a simple, single truth value; that's a structure built out of truth values. No, it is not. It has the same truth value throughout the entire scenario, Sunday through Wednesday. On Sunday and Monday it is impossible to know what that truth value is, but it is either true that the coin will land heads, or false that it will land heads -- and by definition, that is the same truth value you'll assign after seeing the coin toss. In contrast, the truth of "it is Monday" keeps on changing throughout the scenario. Likewise, the truth of "the sensor detects white" changes throughout the scenario you are considering in your button-and-sensor example. I don't know what it means to "define the randomness of the situation." In any event, the point you are missing is that Day changes throughout the problem you are analyzing -- not just that there are different possible values for it, and you don't know which is the correct one, but at different points in the same problem it has different values. Things like "today" and "now" are known as indexicals, and there is an entire philosophical literature on them because they are problematic for classical logic. Various special logics have been devised specifically to handle them. It would not have been necessary to devise such alternative logics if they posed no problem for classical logic. You can read about them in the article Demonstratives and Indicatives in The Internet Encyclopedia of Philosophy. Some excerpts: The problem with indexicals is that they have meanings that may change over the course of the problem being discussed. This is simply not allowed in classical logic. In classical logic, a proposition must have a stable, unvarying truth value over the entire argument. I'm going to appeal to authority here, and give you some quotes. Section 3.2, "Meanings of Sentences", in Propositions, Stanford Encyclopedia of Philosophy: (Emphasis added.) The above is telling us that a "proposition" involving an indexical is

-1Jeff Jo8y

(Not in order) Note the clause "in general." Any assertion that applies "in general" can have exceptions in specific contexts. We similarly cannot deduce, in general, that a coin toss which influences the path(s) of an experiment, is a 50:50 proposition when evaluated in the context of only one path. An awake Beauty is asked about her current assessment of the proposition "The coin will/has landed Heads." Presumably, she is supposed to answer on the same day. So, while the content of the expression "today" may change with the changing context of the overarching experiment, that context does not change between asking and answering. So this passage is irrelevant. And the problem with using this argument on the proposition "Today is Monday," is that neither the context, nor the meaning, changes within the problem Beauty addresses. No, it analyzed two specific usages of an indexical, and showed that they represented different propositions. And concluded that, in general, indexicals can represent different propositions. It never said that multiple usages of a time/location word cannot represent the same proposition, or that we can't define a situation where we know they represent the same proposition. So my corner bar can post a sign saying "Free Beer Tomorrow," without ever having to pour free suds. But if it says "Free Beer Today," they will, because the context of the sign is the same as the context when somebody asks for it. Both are indexicals, but the conditions that would make it ambiguous are removed. And over the duration of when Beauty considers the meaning of "today," it does not change. "Today" means the same thing every time Beauty uses it. This is different than saying the truth value of the statement is the same at different points in Beauty's argument; but it is. She is making a different (but identical) argument on the two days. Only if those circumstances might change within the scope of their use. And throughout Beauty's discussion of the pro

1ksvanhorn8y

Now you're really stretching. That duration potentially includes both Monday and Tuesday. This is getting ridiculous. "Today" means a different thing on every different day. That's why the article lists it as an indexical. Going back to the quote, the "discussion" is not limited to a single day. There are at least two days involved. I notice you carefully ignored the quote from Epstein's book, which was very clear that a classical proposition must not contain indexicals.

-1Jeff Jo8y

At any point in the history that Beauty remembers in step 2 of step 3, the proposition has a simple, single truth value. But she cannot determine what it that value is. This is basis for being able to describe its truth value with probabilities. In some instances of the experiment, it is true. In others, it is false. Just like "today is Monday" has the same truth value at any point in the history that Beauty remembers in step 2 of step 3. Your error is in falling to understand that, to an awake Beauty, the "experiment" she sees consists of Sunday and a single day after it. She just doesn't know which. In her experiment, the proposition "today is Monday" has a simple, single truth value. The truth of "it is Monday" never changes in any point of the scenario she sees after being wakened. And the point I am trying to get across to you is that it cannot change at any point of the problem Beauty is asked to analyze. The problem that I am analyzing is the problem that Beauty was asked to analyze. Not what an outside observer sees. She was told some details on Sunday, put to sleep, and is now awake on an indeterminate day. She is asked about a coin that may have been flipped, or has already been flipped, but to her that difference is irrelevant. "Today is Monday" is either true, or false (which means "Today is Tuesday"). She doesn't know which, but she does know that this truth value cannot change within the scope of the problem as she sees it now. No, "time" is an indexical. That means that the value of time can change the context of the problem when you consider different values to be part of the same problem. Not that a problem that deals with only one specific value, and so an unchanging context, has that property. While Beauty is awake, the day does not change. While Beauty is awake, the context of the problem does not change. While Beauty is awake, the other day of the experiment does not exist in her context. So for our problem, this resolves the issue that c

1ksvanhorn8y

No, it doesn't. This boils down to a question of identity. Absent any means of uniquely identifying the day -- such as, "the day in which a black marble is on the dresser" -- there is a fundamental ambiguity. If Beauty's remembered experiences and mental state are identical at a point in time on Monday and another point in time on Tuesday, then "today" becomes ill-defined for her. What instances are you talking about? We're talking about a single experiment. We're talking about epistemic probabilities, not frequencies. You need to relinquish your frequentist mindset for this problem, as it's not a problem about frequentist probabilities. No, it doesn't. She knows quite well that if the coin lands Tails, she will awaken on two separate days. It doesn't matter that she can only remember one of them. Epistemic probabilities are a function, not of the person, but of the available information. Any other person given the same information must produce the same epistemic probabilities. That's fundamental. Go read the quotes again. Are you a greater authority on this subject than the authors of the Stanford Encyclopedia of Philosphy? They're irrelevant. You added an extra layer of randomness on top of the problem. Each of the four card outcomes leads to a problem equivalent to the first. But randomly choosing one of four problems equivalent to the first problem doesn't tell you what the solution to the first problem is. I do not understand why you are so insistent on using "propositions" that include indexicals, especially when there is no need to do so -- we can express the information Beauty has in a way that does not involve indexicals. When we do so, we get an answer that is not quite the same as the answer you get when you play fast and loose with indexicals. Since you've never been able to point out a flaw in the argument -- all you've done is presented a different argument you like better -- you should consider this evidence that indexicals are, in fact, a probl

1Jeff Jo8y

At any point in the history that Beauty remembers when she is in one of those steps, the proposition M, "Today is Monday," has a simple, single truth value. All day. Either day. If she is in step 2, it is "true." If she is in step 3, it is "false." The properties of "indexicals" that you are misusing apply when, within her current memory state, the value of "today" could change. Not within the context of the overarching experiment. This has nothing to do with whether she knows what that truth value is. In fact, probability is how we represent the "fundamental ambiguity" that the simple, single truth value belonging to a proposition is unknown to us. If you want to argue this point, I suggest that you try looking for the forest through the trees. I tell you that I will flip a coin, ask a question, and then repeat the process. If the question is "What is the probability that the coin is showing Heads?", and I require an answer before I repeat the flip, then coin's state has a simple, single truth value that you can represent with a probability. If the question is "What is the probability that the coin is showing Heads?", and I require an answer only at after the second flip, the question only applies to the second since it asks about a current state.But it has a simple, single truth value that you can represent with a probability. If the question is "What is the probability of showing Heads?" then the we have the logical conundrum you describe. "Showing" is an indexical. It can change over time. But it is only an issue if we refer to it in the context of a range of time where it does change. That's why indexicals are a problem in general, but maybe not in a specific case. "Today" is never ill-defined for Beauty. The entirety of the experiment includes Sunday, Wednesday, and two other days. She knows that. The portion that exists in her memory state at the time she is asked to provide an answer consists of Sunday (when she learned it all), which cannot be "Tod

6habryka8y

[Kinda speaking from my experience as a moderator here, but not actually really doing anything super mod-related]: I haven't been able to follow the details from this conversation, and I apologize for that, but from the outside it does really look like you two are talking past each other. I don't know what the best way to fix that is, or even whether I am right, but my guess is that it's better to retire this thread for now and continue some other time. I am also happy to offer some more moderation if either of you requests that. Also feel free to ignore this and just continue with your discussion, but it seemed better to give you two an out, if either of you feels like you are wasting time but are forced to continue talking for some reason or another.

[-]habryka8y20

This seems great! I am interested in reading this in more detail when I have some more time.

[-]JeffJo2y10

Some researchers are going to put you to sleep. During the two days[1] that your sleep will last, they will briefly wake you up either once or twice, depending on the toss of a fair coin (Heads: once; Tails: twice). After each waking, they will put you to back to sleep with a drug that makes you forget that wak

... (read more)

[-][anonymous]8y10

As it stands now, I can't accept this solution, simply because it doesn't inform the right decision.

Imagine you were Beauty and q(y) was 1, and you were offered that bet. What odds would you take?

Our models exist to serve our actions. There is no such thing as a good model that informs the wrong action. Probability must add up to winning.

Or am I interpreting this wrong, and is there some practical reason why taking 1/2 odds actually does win in the q(y) = 1 case?

2ksvanhorn8y

Yes, there is. I'll be writing about that soon.

[-]musicmage41148y00

Beauty's physiological state (heart rate, blood glucose level, etc.) will not be identical, and will affect her thoughts at least slightly. Treating these and other differences as random,

Not all of the differences are random, though. Sleeping Beauty will always have aged by one day if awakened on Monday, and by two days if awakened on Tuesday, and even that much aging has distinguishable consequences. Now, I'm not at all familiar with the math involved, but it seems like this solution hinges on "everything" being random. If not everything is random, does this solution still work?

LESSWRONG
is fundraising!
LW

LESSWRONG
is fundraising!
LW

16

Sleeping Beauty Resolved?

16

16

Introduction

The standard framework for solving probability problems

Failure to properly apply probability theory

A red herring: betting arguments

Failure to construct legitimate propositions for analysis

Failure to include all relevant information

Defining the model

Analysis

Conclusion

References