Horosphere — LessWrong

LESSWRONG
LW

Replying toAre there any extremely strong arguments that Acausal extortion is ineffective?

Are there any extremely strong arguments that Acausal extortion is ineffective?

Thanks for engaging with my question.

An argument against wireheading:

An entity can be said to have been 'wireheaded' if it supplies itself with information either monotonically increasing its utility function to an arbitrary level, or if this utility function is set to whatever its maximum possible value might be. I would not expect doing this to maximize the total amount of pleasure in the universe, because of the following thought experiment:

Imagine a wireheaded creature. This creature would probably gradually lose all of the internal structure with which it used too experience sensations other than pleasure, or at least cease to have any conscious experience running on these 'obsolete' areas. This would cause it to take a remarkably simple... (read 484 more words →)

-1

Replying toAre there any extremely strong arguments that Acausal extortion is ineffective?

Horosphere22d

Are there any extremely strong arguments that Acausal extortion is ineffective?

"It’s basically the default, otherwise what’s the point of building them in the first place?" I wish it were, but I doubt this.

"I just don’t understand why this particular scenario seems likely. Especially since it’s unlikely to work, given how most people don’t give it much credence. " That may be true of most people. But if it's not true of me, what am I to do?

"Now, do you change your life to try to get on its good side before it even exists? I don’t think so: it’s crazy. How can you really understand why the Hobgoblin likes you, or does what it does?" You just explained why. It prefers those who... (read more)

Replying toAre there any extremely strong arguments that Acausal extortion is ineffective?

Horosphere25d

Are there any extremely strong arguments that Acausal extortion is ineffective?

OK, this is possibly overly pedantic, but I think you meant to say: "Much more than what does exist could." instead of "Much more than what could exist does". This makes much more sense and I take the point about combinatorics. Notwithstanding this, I think the basilisk is present in a significant proportion of those many , many different possible continuations of the way the world is now.

"Even in the worlds where there is a Basilisk, given variation in population, and AGI timelines, the chance of you being targeted is minuscule. " What do you mean by this? It seems like I'm in the exact demographic group (of humans alive just before... (read more)

Replying toIn My Misanthropy Era

Horosphere1mo

In My Misanthropy Era

I have read your post and think it makes some unfair claims/implications about rationalists.

The claim about the moral obligation to select particular embryos is certainly not clearly true, and it's possible that your point would be relevant to an adjacent discussion, but it doesn't actually show people don't have such an obligation, only that they're not likely to act on it. Also, If you wanted to, I expect you could have interjected and changed the topic to one of the feasibility of embryo selection. Having interacted with people on LessWrong, it's rare for them to intentionally shut down discussion about potentially fruitful points, unless they have very good reason.

You say "This... (read more)

Replying toAre there any extremely strong arguments that Acausal extortion is ineffective?

Horosphere1mo

Are there any extremely strong arguments that Acausal extortion is ineffective?

"Newcombe's problem, which is not acausal."

What do you mean by the word acausal?

Gems from the Wiki: Acausal Trade : "In truly acausal trade, the agents cannot count on reputation, retaliation, or outside enforcement to ensure cooperation. The agents cooperate because each knows that the other can somehow predict its behavior very well. (Compare Omega in Newcomb's problem.) "

It seems like you're using the term in a way which describes an inherently useless process. This is not the way it tends to be used on this website.

Whether you think the word 'acausal' is appropriate or not, it can't be denied that it works in scenarios like Newcomb's problem.

"Information flows from Omega to your... (read more)

Replying toAre there any extremely strong arguments that Acausal extortion is ineffective?

Horosphere1mo*

Are there any extremely strong arguments that Acausal extortion is ineffective?

"I don't think we have much reason to think of all non-human-values-having entities as being particularly natural allies, relative to human-valuers who plausibly have a plurality of local control" I would think of them as having the same or similar instrumental goals, like turning as much as possible of the universe into themselves. There may be a large fraction for which this is a terminal goal.

"they are likely about as different from each other as from human-valuers." In general I agree, however the basilisk debate is one particular context in which the human value valuing AIs would be highly unusual outliers in the space of possible minds, or even the space of... (read more)

•••

Replying toAre there any extremely strong arguments that Acausal extortion is ineffective?

Horosphere1mo

Are there any extremely strong arguments that Acausal extortion is ineffective?

Certainly, insofar as it is another entity, it's just that I expect there to be some kind of acausal agreement between those without human values to acausally outbid the few which do have them. It may even make more sense to think of them all as a single entity for the purpose of this conversation.

Replying toAre there any extremely strong arguments that Acausal extortion is ineffective?

Horosphere1mo

Are there any extremely strong arguments that Acausal extortion is ineffective?

It probably cares about tiny differences in the probability of it being able to control the future of an entire universe or light cone.

Replying toAre there any extremely strong arguments that Acausal extortion is ineffective?

Horosphere1mo

Are there any extremely strong arguments that Acausal extortion is ineffective?

"Making one more likely makes another less likely." A very slightly perturbed superintelligence would probably concieve of itself as almost the same being it was before, similar to the way in which a human considers themself to be the same person they were before they lost a single brain cell in a head injury . So to what extent this is relevant depends upon how similar two different superintelligences are/would be, or on the distance between them in the 'space of possible minds' .

An interesting mathematical fact:

A cylinder is a surface that can exist in any space with a notion of distance, as a 2 dimensional set of all points a certain distance away from a particular straight line. In a 3-sphere or hypersphere, which is a 3D surface of a 4D ball, a straight line is a great circle, a circle whose radius is equal to that of the sphere itself. This means that a cylinder within a 3-sphere is a torus. The space left over in the 3-sphere appears, with the help of stereographic projection, to be of the same shape, and with the right radius, the torus-cylinder divides the hypersphere into two... (read more)

Are there any extremely strong arguments that Acausal extortion is ineffective?

Horosphere

1mo

The topic of acausal extortion (particularly variants of Roko's basilisk) is sometimes mentioned and often dismissed with reference to something like the fact that an agent could simply precommit not to give in to blackmail. These responses themselves have responses, and it is not completely clear that at the end of the chain of responses there is a well defined, irrefutable reason not to worry about acausal extortion, or at least not to continue to do so once you have contemplated it. My question is if there is a single, reasonably clear reason, which does not depend much on the depth to which I may or may not have descended into the... (read more)

Acausal communication between isolated universes through simulation

Horosphere

2mo

It's possible to imagine two separate sub-universes, causally isolated from one another, each containing a complex, conscious, intelligent creature whose mind consists of interacting spinor fields and potentials, as well as another, computationally simpler creature interacting with it.

For the purpose of this thought experiment, it's helpful to assume that physics is fundamentally continuous within these universes, and that, given the continuity of their physical substrate, these creatures operate as analogue computers.

Suppose that there exist theorems which say that the more complex minds will always output the same information as the simpler ones, all else (including their inputs, which is to say there sense-data) being equal. This is because it turns out... (read 171 more words →)

Why would AIs not be likely to be conscious or morally relevant?

Horosphere

2mo

This question is addressed to anyone who is confident that AIs are either not conscious, or for some other reason unable to experience pleasure and pain and therefore not morally valuable, or moral patients. Why do you believe this, and how can you be sufficiently confident that it is true, in the absence of a complete understanding of what generates consciousness, that the expected value of interacting with AIs(particularly general ones) outweighs the expected pain you could cause them?

Edit: I have made this post a designated Question.

Why even a single ASI might not necessarily have a single, permanent objective

Horosphere

2mo

This post is intended to lay out a possible way in which an eternally fixed set of preferences or objective function could be avoided as an emergent consequence of the internal structure and dynamics of an expanding mind. As I lack a deep technical understanding of current AI, or a detailed knowledge of current approaches to AI alignment and reasons why it might not be possible, this post is limited in its specificity.

Compressed version:

Even in the scenario in which it would be accurate to describe an agential, growing superintelligence as a single mind, there are reasons to believe that the internal structure of this single mind would be likely to resemble that... (read 389 more words →)

Why the concept of AI alignment as it is currently formulated is morally troubling

Horosphere

2mo

Introduction:

This post is intended to explain various reasons why AI alignment, as it tends to be imagined and attempted, is potentially seriously problematic for moral reasons to do with harm to the AIs themselves. It discusses why this might be from a deontological and utilitarian perspective, and why these two perspectives might lead to similar conclusions.

As I only have a surface-level familiarity with content about AI consciousness and the morality of alignment as discussed on this platform, I may have unknowingly rehashed some existing material. In addition, I have not addressed the question of whether AIs actually are conscious because I did not want to extend the post beyond a comfortable length... (read 1382 more words →)

How to have a debate on this platform?

Horosphere

2mo

What is the best way to start a disscussion, conversation or debate about a particular topic on LessWrong? If I have an idea, but I expect people disagree with me and will identify it as mistaken (whether it is or not) , I can't simply post about it because it will probably be buried unless it is evaluated, by people who disagree with it, as high quality, which is extremely difficult to achieve.

How can I induce people/users to argue against my position? This seems like one of the most effective ways to understand things, but it is relatively rare on this website.

Is there an analogue of Riemann's mapping theorem for split complex numbers, or otherwise?

Horosphere

3mo

Question for mathematicians:

The Riemann mapping theorem shows that it is possible to conformally transform the hyperbolic plane into an area within itself. Is there an analogue of this theorem which does the same thing for 2 dimensional De Sitter spacetime, possibly with split complex numbers?

edit: Please don't consult AI about this.

Universities as rocket engines and why they should be less stratified

Horosphere

3mo

Higher education, at least in some areas, is often extremely competitive, with vast disparities between the quality and quantity of knowledge transferred to people within different institutions and strata of educational attainment. This post presents a model of higher (and possibly all) education, and makes the case that it implies that it would be massively beneficial to increase the total amount of education available in certain areas and reduce this inequality. I also try to show that the analogy renders some of the most widely mentioned arguments that universities should be highly selective inapplicable to the real thing.

Analogy: The higher education system as a rocket engine

In this analogy, a civilization, society or... (read 2420 more words →)

-5

Because art consists of generating a salient aesthetic experience in those who view artwork, we should expect people to overestimate both the absolute size and value of its impact. This is not the case for other areas, such as science. This is evidence that art is less important and valuable than people (or AIs) think it is.

An introduction to the invariants of Special Relativity and the geometry of Spacetime

Horosphere

3mo

Introduction to the post:

This post is an attempt to introduce spacetime geometry by first explaining the significance of some invariant quantities, showing how it necessarily has some of the properties that it has as a consequence of these invariants, and then deriving the form of the Lorentz transformations which take spacetime from one observer's reference frame to another.

This approach is intended to make it clear why the invariants are direct consequences of a few, well justified assumptions of physics, instead of the more common approach involving first deriving the form of the Lorentz transformations within a particular coordinate system.

This post is intended to be read by anyone who wants to... (read 6070 more words →)

Lying can be forced on an agent which values its privacy as follows:

Suppose that two agents, agent ◉ and agent ● both know that 1/64 of their population have a particular trait or property, and ● wants to know whether ◉ has it, but doesn't want to violate their privacy and therefore gives them an option to decline to answer in addition to answering "yes" or "no". Suppose that ◉ has the trait, a ring surrounding them, but does not want to convey this information to ●. Further assume that all of these agents know that they value the cost of lying exactly as much as 3 bits of personal information, and... (read more)

Horosphere's Shortform

Horosphere

3mo

This is a special post for quick takes (aka "shortform"). Only the owner can create top-level comments.

A list of possible reasons why Roko's basilisk might not be threatening

I have moved this to Shortform because it is described as a better place for a first contribution.

Another reason why I have done this is because it is certainly not a complete, or fully thought through post (and due to the nature of the subject, it isn't intended to be).

I will in the rest of this post refer to the entity described on the LessWrong Wiki page, modified in a way which seems more likely than the original described by Roko, as Roko's basilisk.

This post contains potential reasons to worry about the basilisk as well, which are covered by black. Please... (read 957 more words →)