Nathan1123

Are ya winning, son?

Hello, I've been reading through various Newcomblike Problems, in order to get a better understanding of the differences between each Decision Theory. From what I can tell, it seems like each Decision Theory gets evaluated based on whether they are able to "win" in each of these thought experiments. Thus, there is an overarching assumption that each thought experiment has an objectively "right" and "wrong" answer, and the challenge of Decision Theory is to generate algorithms that will guarantee that the agent will choose the "right" answer. However, I am having some trouble in seeing how some of these problems have an objectively "winning" state. In Newcomb's Problem, obviously one can say that one-boxing "wins" because you get way more money than two-boxing, and these are the only two options available. Of course, even here there is some room for ambiguity, as said by Robert Nozick: > To almost everyone it is perfectly clear and obvious what should be done. The difficulty is that these people seem to divide almost evenly on the problem, with large numbers thinking that the opposing half is just being silly. But other Newcomblike Problems leave me with some questions. Take, for example, the Smoking Lesion Problem. I am told that smoking is the winning decision here (as long as we take a suspension of disbelief from the fact that smoking is bad in the real world). But I'm not sure why that makes such a big difference. Yes, the problem states we would prefer to smoke if we could, but our preferences can come from many different dimensions such as our understanding of the environment, not just a spontaneous inner desire. So when EDT says that you shouldn't smoke because it increase the probability of having a cancerous lesion, then one could say that that information has shaped your preference. To use a different analogy, I may desire ice cream because it tastes good, but I may still prefer not to eat it out of my understanding of how it impacts my health and weig

14Aug 9, 2022

Nathan1123

Message

Implication of Uncomputable Problems

Some problems of mathematics like the Halting Problem and the Busy Beaver Problem are uncomputable, meaning that it is mathematically proven that any Turing-complete computer is physically incapable of solving the problem no matter how sophisticated its hardware or software is. Some algorithms on a Turing machine can be used...

Jan 30, 2025-3

Humans don't understand how we do most things

Hello, I find that a fundamental premise towards the development of AI is the ability to dissect all the internal calculations used by the human mind to make rational decisions, and then replicate those calculations in an electronic system. On a surface level, this works very well for many kinds...

Jun 5, 20232

The Stanley Parable: Making philosophy fun

Hello, If you are unaware, The Stanley Parable is a video game originally released in 2013, and later re-released 2022 under the title The Stanley Parable: Ultra Deluxe. The player takes control of the titular character Stanley, as the present and future actions of his life are being dictated by...

May 22, 20236

Is there any literature on using socialization for AI alignment?

Hello, I was recently thinking about the question of how humans achieve alignment with each other over the course of our lifetime, and how that process could be applied to an AGI. For example, why doesn't everyone shop lift from the grocery store? A grocery store isn't as secure as...

Apr 19, 202310

Could the simulation argument also apply to dreams?

> We know that a dream can be real, but who ever thought that reality could be a dream? We exist, of course, but how, in what way? As we believe, as flesh-and-blood human beings, or are we simply parts of someone's feverish, complicated nightmare? > > * Charles Beaumont...

Aug 17, 20226

What is the probability that a superintelligent, sentient AGI is actually infeasible?

Hello, I am not at all adverse to discussing contingencies for things that are either uncertain or unlikely, but I'm curious what the general consensus is for how likely a superintelligent AGI scenario actually is. To be clear, I am certainly aware that advances in AI have made leaps and...

Aug 14, 2022-3

An Uncanny Prison

In reading about proposals for AI Boxing, I notice two excerpts jump out at me. First, is a proposed scenario to uncover an unfriendly AI: > A virtual world between the real world and the AI, where its unfriendly intentions would be first revealed And the second excerpt of interest...

Aug 13, 20223

Load More (7/16)

LESSWRONG
LW

LESSWRONG
LW

Nathan1123

Nathan1123

Nathan1123

Are ya winning, son?

Newcombness of the Dining Philosophers Problem

Is there any literature on using socialization for AI alignment?

The Stanley Parable: Making philosophy fun

Nathan1123

Implication of Uncomputable Problems

Humans don't understand how we do most things

The Stanley Parable: Making philosophy fun

Is there any literature on using socialization for AI alignment?

Could the simulation argument also apply to dreams?

What is the probability that a superintelligent, sentient AGI is actually infeasible?

An Uncanny Prison

Are ya winning, son?

Newcombness of the Dining Philosophers Problem

Is there any literature on using socialization for AI alignment?

The Stanley Parable: Making philosophy fun

Implication of Uncomputable Problems

Humans don't understand how we do most things

The Stanley Parable: Making philosophy fun

Is there any literature on using socialization for AI alignment?

Could the simulation argument also apply to dreams?

What is the probability that a superintelligent, sentient AGI is actually infeasible?

An Uncanny Prison