Why should ethical anti-realists do ethics?

Looking forward to your next post, but in the meantime:

AI - Seems like it would be easier to build an AI that helps me get what I want, if "what I want" had various nice properties and I wasn't in “crossing that bridge when we come to it” mode all the time.
meta-ethical uncertainty - I can't be sure there is no territory.
ethics/philosophy as a status game - I can't get status from this game if I opt out of it.
morality as coordination - I'm motivated to make my morality have various nice properties because it helps other people coordinate with me (by letting them better predict what I would do in various situations/counterfactuals).

Thus, for example, intransitivity requires giving up on an especially plausible Stochastic Dominance principle, namely: if, for every outcome o and probability of that outcome p in Lottery A, Lottery B gives a better outcome with at least p probability, then Lottery B is better (this is very similar to “If Lottery B is better than Lottery A no matter what happens, choose Lottery B” – except it doesn’t care about what outcomes get paired with heads, and which with tails).

This principle is phrased incorrectly. Taken literally, it would imply that the mixed outcome "utility 0 with probability 0.5, utility 1 with probability 0.5" is dominated by "utility 2 with probability 0.5, utility -100 with probability 0.5". What you probably want to do is to add the condition that the function f mapping each outcome to a better outcome $f (o)$ is injective (or equivalently, bijective). But in that case, it is impossible for $f (o)$ to occur with probability strictly greater than $P (o)$ , since $P (o) \leq P (f (o)) = 1 - \sum o^{'} \neq o P (f (o^{'})) \leq 1 - \sum o^{'} \neq o P (o^{'}) = P (o) .$

[-]Joe Carlsmith3y30

Oops! You're right, this isn't the right formulation of the relevant principle. Will edit to reflect.

[-]Jonathan Claybrough3y30

I generally explain my interest in doing good and considering ethics (despite being anti realist) something like your point 5, and I don't agree with or fully get your refutation that it's not a good explanation, so I'll engage with it and hope for clarifications.

My position, despite anti-realism and moral relativism, is I do happen to have values (which I can "personal values", they're mine and I don't think there's an absolute reason for anyone else to have them, though I will advocate for them to some extent) and epistemics (despite the problem of the criterion) that have initialized in a space where I want to do Good, I want to know what is Good, I want to iterate at improving my understanding and actions doing Good.

A quick question - when you say "Personally, though, I collect stamps", do you mean your personal approach to ethics is descriptive and exploratory (and you're collecting stamps in the sense of physics vs stamp collection image), and that you don't identify as systematizer ?

I wouldn't identify as "systematizer for its sake" either, it's not a terminal value, but it's an instrumental value for achieving my goal of doing Good. I happen to have priors and heuristics saying I can do more Good by systematizing better so I do, and I get positive feedback from it so I continue this.
Re "conspicuous absence of subject-matter" - true for an anti realist considering "absolute ethics", but this doesn't stop an anti realist considering what they'll call "my ethics". There can be as much subject-matter there as in realist absolute ethics, because you can simulate absolute ethics in "my ethics" with : "I epistemically believe there is no true absolute ethics, but my personal ethics is that I should adopt what I imagine would be the absolute real ethics if it existed". I assume this is an existing theorized position but not knowing if it already has another standard name, I call this being a "quasi realist", which is how I'd describe myself currently.

I don't buy Anti realists treating consistency as absolute, so there's nothing to explain. I view valuing consistency as being instrumental and it happens to win all the time (every ethics has it) because of the math that you can't rule out anything otherwise. I think the person who answers "eh, I don’t care that much about being ethically consistent" is correct that it's not in their terminal values, but miscalculates (they actually should value it instrumentally), it's a good mistake to point out.
I agree that someone who tries to justify their intransitivities by saying "oh I'm inconsistent" is throwing out the baby with the bathwater when they could simply say "I'm deciding to be intransitive here because it better fits my points". Again, it's a good mistake to point out.
I see anti realists as just picking up consistency because it's a good property to have for useful ethics, not because "Ethics" forced it onto them (it couldn't, it doesn't exist).

On the final paragraph, I would write my position as : “I do ethics, as an anti-realist, because I have a brute, personal preference to Doing Good (a cluster of helping other people, reducing suffering, anything that stems from Veil of Uncertainty which is intuitively appealing), and that this is self reinforcing (I consider it Good to want to do Good and to improve and doing Good), so I want to improve my ethics. There exists zones of value space where I'm in the dark and have no intuition (eg. population ethics/repugnant conclusion) so I use good properties (consistency, ..) to craft a curve which extends my ethics, not because of personal preference for blah-structural-properties, but by belief that this will satisfy my preferences to Doing Good the best".
If a dilemma comes up pitting object level stakes and some abstract structural constraint, I weigh my belief that my intuition on "is this Good" is correct against my belief that "the model of ethics I constructed from other points is correct" and I'll probably update one or both. Because of the problem of the criterion, I'm neither gonna trust my ethics or my data points as absolute. I have uncertainty on the position of all my points and on the best shape of the curve, so sometimes I move my estimate of the point position because it fits the curve better, and sometimes I move the curve shape because I'm pretty sure the point should be there.

I hope that's a fully satisfying answer to "Why do ethical anti-realists do ethics".
I wouldn't say there's an absolute reason why ethical anti-realists should do ethics.

[-]TAG3y10

If you think morality would be of benefit to you, and you don't think it is pre existing in the territory, then you have a motivation to construct it.l, and therefore to engage with it

That's particularly true of the plural "you". Consider contractualism, one of the main forms of constructivism. It's in everyone's interests to get into arrangements of the form "I agree not to murder you if you agree not to murder me"...the benefits are considerable , and the costs are minor (so long as you are not a sociopath).

If you frame the question as "why should an iindividual , moral anti realist, who doesn't care about interactions or coordination with others, engage with ethics"... then it's much harder to solve .

[-]Noosphere893y10

Re bullet biting as a moral anti-realist: I don't actually agree that you don't bite certain bullets for moral anti-realism.

I think the major bullets you'd have to bite for moral anti-realism are:

Ethics gets personal, irrevocably. That is, there aren't universal principles for everyone, or even within your social group.
Values conflicts are unresolvable by default.
The Orthogonality Thesis is strengthened, in that all moral views are fundamentally right or valid.
When a group makes decisions, or when an individual makes decisions, they always impose their own values on things. Decisions are never value neutral.

[-]TAG3y10

Ethics gets personal, irrevocably. That is, there aren’t universal principles for everyone, or even within your social group.

There aren't unless you make them.

Values conflicts are unresolvable by default.

But you can construct conflict resolution mechanisms, and you are better off with them .

The Orthogonality Thesis is strengthened, in that all moral views are fundamentally right or valid.

There are many versions of anti realism. Some suggest that all moral claims are false or meaningless.

When a group makes decisions, or when an individual makes decisions, they always impose their own values on things.

Contractualism tends towards everyone having an equal say, since people voluntarily adhere to that kind of contract more readily

^{^}

In particular, I'm setting aside some issues to do with understanding what it means to "make a mistake," if you're an anti-realist. But it's also quite a large topic more generally.

^{^}

For some of my gripes with this picture, see here.

^{^}

Granted, one needs to say something about what’s going on with weakness of the will, here. But I expect to be able to do so.

^{^}

I expect this is the source, for example, of “open question” stuff. Thus, suppose we try to say, with some anti-realists, that what you “should do” is constituted by what you “would want to do in blah circumstances.” And suppose I tell you that in such circumstances, you’d want to kill babies. Does that settle the question of whether to kill babies? No. You’ve still got to decide. The territory is one thing. Your response is another.

“Ah,” say the realists. “That’s because you weren’t talking about the right sort of territory. There’s some ‘essentially practical’ territory, which consists of ‘should’-y properties and facts, irreducibly different from the standard story. If I told you that according to this territory, you should kill babies, then this would settle the question of whether to do so.” But would it?

^{^}

See Chapter 2 of Nick Beckstead’s thesis for a nice discussion of this picture.

^{^}

And I’m skeptical that appeals to “idealization” will be enough on their own.

^{^}

Some people go further, and start fetishizing simplicity to some more extreme extent (see e.g. my discussion of “simplicity realism” for some vibes in this broad vicinity). But again: in the context of anti-realist ethics, why such a fetish? And anyway, was that the problem with the slaveholders? That their values required too complex a description?

^{^}

It does seem like ascribing goals/values to other people is importantly tied to predicting their behavior, but it also seems to license attributions of “mistakes.”

^{^}

OK, OK, I exaggerate. Total utilitarianism isn’t actually about aiming at the lizards. Just: being willing to, if the time comes. (Which it won’t!) (We hope.) (But why should we hope that?)

^{^}

If necessary, we could formulate this inconsistency more precisely: i.e., by rephrasing (3) as “If (1) is true, then (2) is false,” or some such.

^{^}

If you don’t like the “morality game,” we can rephrase the example in terms of “I have most reason to,” or whatever. It loses some of its force, but the basic dynamic persists.

^{^}

Here I’m borrowing from Michael Huemer’s “In Defense of Repugnance,” which I recommend to people interested in the Repugnant Conclusion.

^{^}

Here, the name derives from the idea that if, for a fixed population, you have the option to improve the total and the average and the equality of the distribution, then you’d have to be actively against equality to pass on it (since presumably you like improvements to the total and the average, I guess?). I don’t like this name; nor do I think this principle especially obvious, but let’s go with it for now.

^{^}

There's also some question of whether the transitivity of betterness is a conceptual truth; but I don't think that's the best terrain on which to have the debate.

^{^}

Start with the Utopia. Then (per Benign Addition), make the lives of everyone in Utopia better, but also add, off in some distant part of the galaxy, a giant pit filled with a zillion zillion lizards, each living barely-barely worthwhile lives. Better for everyone, right? The utopians all agree: nice move. And the lizards, if you count them, would agree too. But note that you don’t even need to count them. That is, as long as you don’t think that creating the lizards is actively bad, you can just be focusing on the fact that you’re improving the lives of all the utopians, which seems hard to dislike, plus doing something either neutral or positive on the side (i.e., lizard farming).

(And note that if you say that it’s actively bad to create slightly-happy lizards, you can quickly end up saying that it’s betterto create beings with net negative lives than to create blah number of slightly-happy lizards, which also seems rough. And note, too, that to sufficiently advanced or happy beings, the best contemporary human lives might look lizard-like – barely conscious, vaguely pleasant but dismayingly dull, lacking in all but the most base goods. Do you want the aliens thinking of your life as bad to create, because too close to zero? If not, where and why does the line get drawn?)

But if there are enough lizards, then improving the lives of all the lizards by some small amount, plus bringing the lives of the utopians down to lizard-level, will end up mandated by Non-anti-egalitarianism. E.g., if there are 100 utopians all at welfare 100, plus a million lizards all at welfare 1, then putting everyone at 2 instead is a better total (2,000,200 vs. 100,010,000), a better average (2 vs. ~1), and a more equal distribution as well.

So by Transitivity, a sufficiently giant lizard farm is better than Utopia. (We can go further, here, and start adding in arbitrary Hells that get outweighed by the lizards. I’m going to pass on that for now, but people who like the repugnant conclusion should expect to have to grapple with it.)

^{^}

I think this is known as “McTaggart’s conclusion,” but can’t easily find the reference.

^{^}

Suppose you’ve got a great, hundred-year life overall. Now suppose we could make every moment of those hundred years better, plus give you an extra billion years in some slightly-net-positive state – say, some kind of slightly-pleasant nap, or watching some kind of slightly-good TV show that you don’t get bored of, or maybe you’re transformed into a slightly-happy lizard for that time but somehow you’re still yourself. Should you take the trade?

Hum, actually, I dunno. (Am I allowed to kill myself, somewhere in those billion years, if I decide that I want out? Will I still be in a position to make that decision? OK, OK, we’re stipulating that I wouldn’t want to, even from sort of idealized perspective or something. Do I trust that perspective? A billion years is a long time…)

OK, but suppose you do it, on grounds analogous to Benign Addition (i.e., it improves your existing life, plus adds something stipulated to be non-bad). But now suppose you can improve all that nap/TV/lizard time by some small amount, at the one-time, low-low cost of giving up ~everything you loved in your original life and napping/TV-ing/lizard-ing full-time. Higher total! Higher average! (Do we care about equality across the moments of our own life? I don’t; maybe the opposite.) Shouldn’t you do it? Note, for example, that the first hundred years of your life, at this point, are a tiny portion of the overall experience – the equivalent of the first three seconds of your hundred-year thing. Mostly, you’re a lizard. You had your great loves and joys back around the time multi-cellular life was evolving, and you’ve been a lizard ever since. Maybe you should focus on improving the lizard-ness? It’s really the main event…

^{^}

Thanks to Ketan Ramakrishnan for discussion of this point in a different context.

^{^}

These are sometimes called “impossibility results,” but often, “valid arguments” would do just as well.

^{^}

Though Alexander is responding to a formulation of this dilemma that relies on a version of Benign Addition that doesn’t improve the lives of existing people, and which is therefore, in my opinion, less forceful.

^{^}

Suppose we say it’s (slightly) bad to create someone whose life is positive, but below average. Then, at least if doing more bad things is more bad, and bads can trade off in standard ways, then you risk saying that it’s better to create someone with a net-negative life than to create a sufficiently large number of people with net positive lives (a violation of what I think is sometimes called “Anti-sadism”). Alternatively, if you say it’s neutral to create people with net positive but below-average lives, it sure looks like you ought to be accepting my version of Benign Addition above, given that it’s good to improve the lives of the existing people, and the creation of the extra lives in neutral. Also, if it’s neutral or bad to create net-positive life, then why is it OK to risk creating net-negative life when you e.g. have kids? Also, if it’s neutral to create a somewhat happy but below-average child, and neutral to create a more happy but still-below-average child, but better to create the second than the first, how do you deal with the intransitivities this creates? Do you want to start trying to say fancy stuff about incommensurability? Also, any appeal to the “average” is going to implicate “Egyptology” problems, where e.g. you can’t decide whether to have kids until you know what the average welfare was like in ancient Egypt, what it’s like on other planets, etc. In general, I recommend Chapter 4 of Nick Beckstead’s thesis for discussion of the various choice-points in trying to say that potential people don’t matter (or matter less). In Alexander’s follow-up to the review, he revises his position to ““morality prohibits bringing below-zero-happiness people into existence, and says nothing at all about bringing new above-zero-happiness people into existence, we’ll make decisions about those based on how we’re feeling that day and how likely it is to lead to some terrible result down the line.”

^{^}

“But in the end I am kind of a moral nonrealist who is playing at moral realism because it seems to help my intuitions be more coherent. If I ever discovered that my moral system requires me to torture as many people as possible, I would back off, realize something was wrong, and decide not to play the moral realism game in that particular way. This is what’s happening with the repugnant conclusion.”

^{^}

In particular, “just stay at World A” doth not an adequate population ethic make (population ethics aspires to do much more than to rank world A, A+, and Z), he’s going to want to hold on to other intuitive data-points as well; and Alexander doesn’t say what he proposes to do if the intuitive satisfaction he seeks actually is impossible (indeed, there are various proofs in this vicinity, which Alexander is aware of, so I’m a bit confused by his optimism here).

^{^}

Alexander acknowledges that he is bound by the empirical implications of ethical principles he is committed to – for example, if he is committed to “suffering is wrong,” then he has to follow the evidence where it leads re: where there is suffering, including re: animals. That said, it’s not actually clear why this would be the case, on anti-realism: in principle, you could revise your ethical principles once you see that they lead (in conjunction with plausible empirical views), to counter-intuitive places.

^{^}

For what it’s worth: while I think I’m more sympathetic to the repugnant conclusion than the average philosopher (though see: these folks), I’m pretty open to denying these other premises as well (though transitivity is probably last-on-my-list to deny). Indeed: I suspect that, in general, once you start getting more open to bounded-utility-function vibes (which I think we may need to get open to), and to the idea that some parts of your value system might not be willing to sacrifice themselves arbitrarily for other parts (even if the other parts try to jack up the stakes arbitrarily), then the repugnant conclusion will start to look much more optional (and premises like Non-Anti-Egalitarianism much more suspect).

^{^}

Or perhaps, some set of concrete philosophers. MacAskill’s book, for example, has a real-world agenda.

^{^}

At least not for reasonable ways of spelling out what these two principles mean.

^{^}

This is the translation of the drowning child case, above, into policy talk.

^{^}

Here the symbol “≻” means “is preferred to” or “better than” or some “chosen over” or some such.

^{^}

As an example of someone who seems to me to take this sort of argument fairly seriously, see Yudkowsky here.

Yudkowsky’s piece is also tied to a related but distinct discourse about coherence, which has to do with whether philosophical arguments about coherence give us reason to expect that future, powerful AI systems will have whatever property in the vicinity of “goal-directedness” and “consequentialism” causes them (the worry goes) to seek power and maybe kill everyone. I associate this argument mostly closely with Yudkowsky (see here, here, and here; though see also section 2 of Omohundro (2008)), and there’s been a lively debate about it over the years (see e.g. Shah, Ngo, Dai, Grace).

This particular argument is about a certain category of empirical prediction (though exactly what sort of empirical prediction isn’t always clear, given that any particular pattern of real-world behavior can in principle be interpreted as maximizing the expectation of some utility function – see here for more). In the present essay, though, I’m mostly interested in the normative question of whether we (and in particular, the anti-realists amongst us) should be coherent. You could think that the future slides us relentlessly towards a world full of coherent (omnicidal), expected-utility maximizers, forged from all the free power that coherence supposedly pays out, without thinking that you, yourself, should go with the flow (compare with Moloch, evolution, and so on). Perhaps you, in your incoherence, are a dying breed. But does that make it an ignoble tradition? Must you make yourself anew, out of metal and math, into some sort of sleeker and colder machine? Need you succumb to all this … modernity? Rage against the dying…

Still, there are important connections between the empirical and normative debates. In particular, the empirical debate typically appeals to two sorts of processes that could reshape a system into a more coherent form: the system itself (or perhaps, some part of it, or some combination), and something outside the system (for example, a training process; a set of commercial incentives; etc). I won’t be discussing “outside the system” forces very much here (though I’d guess that this is where the strongest empirical arguments will come), but to the extent the system’s own self-modification is supposed to be an empirical force for coherence, the normative question of whether coherence should look, from the perspective of the system itself, like an important target of self-modification becomes quite relevant. That is, one key thrust of the empirical argument is basically supposed to be “coherence is free power, the system itself will want free power, so the system itself will try to become more coherent.” But if the “free power” argument is weak, from a normative perspective, this sort of reasoning looks weaker as well. And indeed, to the extent that “outside-the-system” argument runs along similar lines – “coherence is free power, the outside-the-system process will want the system to be powerful, so the outside-the-system process will modify the system to be more coherent” – this argument might look weaker as well.

^{^}

Here I’m adding in more philosophical content than my typical conversation about this, to include more of what happens in conversations with myself.

^{^}

This is similar to the strategy Alexander pursues in response to MacAskill. See also Ahmed (2017) for a more full-scale defense.

^{^}

See also Huemer: “there are no known cases in which one should make a particular choice to prevent oneself from later making a particular perfectly rational, informed, and correct choice, other than cases that depend on intransitivity.” Using what I expect is Huemer’s notion of “rational” and “correct,” I think Parfit’s Hitchhiker would count as such a case.

^{^}

This is a case I heard Johann Frick give at a talk at NYU in spring of 2017. Note the similarity to the Mere Addition Paradox discussed above.

^{^}

See the discussion of Transitivity here for more.

^{^}

This discussion is inspired by one in Huemer (2008), though he uses a different dominance principle.

^{^}

Though: which do you want?

^{^}

Well, really, it’s: Apples if my choice set is blah, Grapes if my choice set is blah, etc. But trying to pump some intuition re: confusion about what intransitive agents are “trying to do” overall.

^{^}

Is it, maybe, actively counter-productive in this respect, insofar as it requires you to endorse some counter-intuitive and therefore unpopular conclusion?

^{^}

The proof for the utility function version goes through the aggregation theorem in Harsanyi (1955).

LESSWRONG
is fundraising!
LW

LESSWRONG
is fundraising!
LW

44

Why should ethical anti-realists do ethics?

44

44

1. Introduction

2. The problem

2.1 Map-making with no territory

2.2 Why curve-fit?

2.3 Who needs ethics if you’re free?

3. Some examples where this stuff comes up

3.1 Drowning children stuff

3.2 Lizard stuff

3.3 Scott Alexander on rejecting the “philosophy game”

4. I’m not trying to turn you into a lizard

5. Some kind of brute preference for consistency and systematization?

6. Money-pumps

6.1 Dialogue with an intransitive agent

6.2 Coherence isn’t free

6.3 Can we even think outside of a rational agent ontology?