Appraising aggregativism and utilitarianism

Thanks for writing this!

I only skimmed the post, so I may have missed something, but it seems to me that this post underemphasizes the fact that both Harsanyi's Lottery and LELO imply utilitarianism under plausible assumptions about rationality. For example, if the social planner satisfies the vNM axioms of expected utility theory, then Harsanyi's Lottery implies that the social planner is utilitarian with respect to expected utilities (Harsanyi 1953). Likewise, if the social planner's intertemporal preferences satisfy a set of normatively plausible axioms, then LELO implies that the social planner is utilitarian with respect to experienced utilities (Fryxell 2024). In my view, it is therefore not clear that it makes sense to compare LELO and Harsanyi's Lottery with utilitarianism.

Also, at least some of the advantages of aggregativism that you mention are easily incorporated into utilitarianism. For example, what is achieved by adopting LELO with exponential time-discounting in Section 2.5.1 can also be achieved by adopting discounted utilitarianism (rather than unweighted total utilitarianism).

A final tiny comment: LELO has a long history, going back to at least C.I. Lewis's " An Analysis of Knowledge and Valuation", though the term "LELO" was coined by my colleague Loren Fryxell (Fryxell 2024). It's probably worth adding citations to these.

[-]Cleo Nardo1y90

thanks for comments, gustav

I only skimmed the post, so I may have missed something, but it seems to me that this post underemphasizes the fact that both Harsanyi's Lottery and LELO imply utilitarianism under plausible assumptions about rationality.

the rationality conditions are pretty decent model of human behaviour, but they're only approximations. you're right that if the approximation is perfect then aggregativism is mathematically equivalent to utilitarianism, which does render some of these advantages/objections moot. but I don't know how close the approximations are (that's an empirical question).

i kinda see aggregativism vs utilitarianism as a bundle of claims of the following form:

humans aren't perfectly consequentialist, and aggregativism answers the question "how consequentialist should our moral theory be?" with "exactly as consequentialist as self-interested humans are."
humans have an inaction bias, and aggregativism answers the question "how inaction-biased should our moral theory be?" with "exactly as inaction-biased as self-interested humans are."
humans are time-discounting, and aggregativism answers the question "how time-discounting should our moral theory be?" with "exactly as time-discounting as self-interested humans are."
humans are risk-averse, and aggregativism answers the question "how risk-averse should our moral theory be?" with "exactly as risk-averse as self-interested humans are."
and so on

the purpose of the social zeta function is simply to map social outcomes (the object of our moral attitudes) to personal outcomes (the object the self-interested human's attitudes) so this bundle of claims type-checks.

Also, at least some of the advantages of aggregativism that you mention are easily incorporated into utilitarianism. For example, what is achieved by adopting LELO with exponential time-discounting in Section 2.5.1 can also be achieved by adopting discounted utilitarianism (rather than unweighted total utilitarianism).

yeah that's true, two quick thoughts:

i suspect exponential time-discounting was added to total utilitarianism because it's a good model of self-interested human behaviour. aggregativism says "let's do this with everything", i.e. we modify utilitarianism in all the ways that we think self-interested humans behave.
suppose self-interested humans do time-discounting, then LELO would approximate total utilitarianism with discounting in population time, not calender time. that is, a future generation is discounted by the sum of lifetimes of each preceding generation. (if the calendar time for an event is $T$ then the population time for the event is $\int_{- \infty}^{T} N (t) d t$ where $N (t)$ is the population size at time $t$ . I first heard this concept in this Greaves talk.) if you're gonna adopt discounted utilitarianism, then population-time-discounted utilitarianism makes much more sense to me than calendar-time-discounted utilitarianism, and the fact that LELO gives the right answer here is a case in favour of it.

A final tiny comment: LELO has a long history, going back to at least C.I. Lewis's " An Analysis of Knowledge and Valuation", though the term "LELO" was coined by my colleague Loren Fryxell (Fryxell 2024). It's probably worth adding citations to these.

I mention Loren's paper in the footnote of Part 1. i'll cite him in part 2 and 3 also, thanks for the reminder.

[-]Gustav Alexandrie1y10

I appreciate the reply!

"the rationality conditions are pretty decent model of human behaviour, but they're only approximations. you're right that if the approximation is perfect then aggregativism is mathematically equivalent to utilitarianism, which does render some of these advantages/objections moot. but I don't know how close the approximations are (that's an empirical question)."

I'm not sure why we should combine Harsanyi's Lottery (or LELO or whatever) with a model of actual human behaviour. Here's a rough sketch of how I am thinking about it: Morality is about what preference ordering we should have. If we should have preference ordering R, then R is rational (morality presumably does not require irrationality). If R is rational, then R satisfies the vNM axioms. Hence, I think it is sufficient that the vNM axioms work as principles of rationality; they don't need to describe actual human behaviour in this context.

Regarding your points about two quick thoughts on time-discounting: yes, I basically agree. However, I also want to note that it is a bit unclear how to ground discounting in LELO, because doing so requires that one specifies the order in which lives are concatenated and I am not sure there is a non-arbitrary way of doing so.

Thanks for engaging!

[-]Cleo Nardo1y20

If we should have preference ordering R, then R is rational (morality presumably does not require irrationality).

I think human behaviour is straight-up irrational, but I want to specify principles of social choice nonetheless. i.e. the motivation is to resolve carlsmith’s On the limits of idealized values.

now, if human behaviour is irrational (e.g. intransitive, incomplete, nonconsequentialist, imprudent, biased, etc), then my social planner (following LELO, or other aggregative principles) will be similarly irrational. this is pretty rough for aggregativism; I list it was the most severe objection, in section 3.1.

but to the extent that human behaviour is irrational, then the utilitarian principles (total, average, Rawls’ minmax) have a pretty rough time also, because they appeal to a personal utility function to add/average/minimise. idk where they get that if humans are irrational.

maybe you the utilitarian can say: “well, first we apply some idealisation procedure to human behaviour, to remove the irrationalities, and then extract a personal utility function, and then maximise the sum/average/minimum of the personal utility function”

but, if provided with a reasonable idealisation procedure, the aggregativist can play the same move: “well, first we apply the idealisation procedure to human behaviour, to remove the irrationalities, and then run LELO/HL/ROI using that idealised model of human behaviour.” i discuss this move in 3.2, but i’m wary about it. like, how alien is this idealised human? why does it have any moral authority? what if it’s just ‘gone off the rails’ so to speak?

it is a bit unclear how to ground discounting in LELO, because doing so requires that one specifies the order in which lives are concatenated and I am not sure there is a non-arbitrary way of doing so.

macaskill orders the population by birth date. this seems non-arbitrary-ish(?);^[1] it gives the right result wrt to our permutation-dependent values; and anything else is subject to egyptologist objections, where to determine whether we should choose future A over B, we need to first check the population density of ancient egypt.

Loren sidesteps this the order-dependence of LELO with (imo) an unrealistically strong rationality condition.

^{^}
if you’re worried about relativistic effects then use the reference frame of the social planner

[-]Gustav Alexandrie1y10

Thanks!

i’m wary about it. like, how alien is this idealised human? why does it have any moral authority?

I don't have great answers to these metaethical questions. Conditional on normative realism, it seems plausible to me that first-order normative views must satisfy the vNM axioms. Conditional on normative antirealism, I agree it is less clear that first-order normative views must satisfy the vNM axioms, but this is just a special case of it being hard to justify any normative views under normative antirealism.

In any case, I suspect that we are close to reaching bedrock in this discussion, so perhaps this is a good place to end the discussion.

[-]MichaelStJules1y20

Harsanyi's theorem has also been generalized in various ways without the rationality axioms; see McCarthy et al., 2020 https://doi.org/10.1016/j.jmateco.2020.01.001. But it still assumes something similar to but weaker than the independence axiom, which in my view is hard to motivate separately.

[-]Elliott Thornley (EJT)1y21

Another nice article. Gustav says most of the things that I wanted to say. A couple other things:

I think LELO with discounting is going to violate Pareto. Suppose that by default Amy is going to be born first with welfare 98 and then Bobby is going to be born with welfare 100. Suppose that you can do something which harms Amy (so her welfare is 97) and harms Bobby (so his welfare is 99). But also suppose that this harming switches the birth order: now Bobby is born first and Amy is born later. Given the right discount-rate, LELO will advocate doing the harming, because it means making good lives happen earlier. Is that right?
I think a minor reframing of Harsanyi's veil-of-ignorance makes it more compelling as an argument for utilitarianism. Not only is it the case that doing the utilitarian thing maximises the decision-maker's expected welfare behind the veil-of-ignorance, doing the utilitarian thing maximises everyone's expected welfare behind the veil-of-ignorance. So insofar as aggregativism departs from utilitarianism, it means doing what would be worse in expecation for everyone behind a veil-of-ignorance.

[-]Cleo Nardo1y*30

Is that right?

Yep, Pareto is violated, though how severely it's violated is limited by human psychology.

For example, in your Alice/Bob scenario, would I desire a lifetime of 98 utils then 100 utils over a lifetime with 99 utils then 97 utils? Maybe idk, I don't really understand these abstract numbers very much, which is part of the motivation for replacing them entirely with personal outcomes. But I can certainly imagine I'd take some offer like this, violating pareto. On the plus side, humans are not so imprudent to accept extreme suffering just to reshuffle different experiences in their life.

Secondly, recall that the model of human behaviour is a free variable in the theory. So to ensure higher conformity to pareto, we could…

Use the behaviour of someone with high delayed gratification.
Train the model (if it's implemented as a neural network) to increase delayed gratification.
Remove the permutation-dependence using some idealisation procedure.

But these techniques (1 < 2 < 3) will result in increasingly "alien" optimisers. So there's a trade-off between (1) avoiding human irrationalities and (2) robustness to 'going off the rails'. (See Section 3.1.) I see realistic typical human behaviour on one extreme of the tradeoff, and argmax on the other.

[-]cubefox1y20

Is it right to say that aggregativism is, similar to total and average utilitarianism, incompatible with the procreation asymmetry, unlike some forms of person affecting utilitarianism?

[-]Cleo Nardo1y20

which principles of social justice agrees with (i) adding bad live is bad, but disagrees with (ii) adding good lives is good?

total utilitarianism agrees with both (i) and (ii).
average utilitarianism can agree with any of the combination: both (i) and (ii); neither (i) nor (ii); only (i) and not (ii). the combination depends on the existing average utility, because average utilitarianism obliges creating lives above the existing average and forbids creating lives below the existing average.
Rawls' difference principle (maximise minimum utility) can agree with any of the combination: neither (i) nor (ii); only (i) and not (ii). this is because adding lives is never good (bc it could never increase minimum utility), and adding bad lives is bad iff those lives are below-minimum.

so you're right that utilitarianism doesn't match those intuitions. none of the three principles discussed reliably endorse (i) and reject (ii).

now consider aggregativism. you'll get asymmetry between (i) and (ii) depending on then social zeta function mapping social outcomes to personal outcomes, and on the model of self-interested human behaviour.

let‘s examine LELO (i.e. the social zeta function maps a social outcome to the concatenation of all individuals' lives) and our model of self-interested human behaviour is Alice (described below).

suppose Alice expects 80 year lives of comfortable fulfilling life.

would she pay to live 85 years instead, with 5 of those years in ecstatic joy? probably.
would she pay to avoid living 85 years instead, with 5 of those years in horrendous torture? probably.

there’s probably some asymmetry in Alice’s willingness of pay. i think humans are somewhat more misery-averse than joy-seeking. it’s not a 50-50 symmetry, nor a 0-100 asymmetry, maybe a 30-70 asymmetry? idk, this is an empirical psychological fact.

anyway, the aggregative principle (generated by LELO+Alice) says that the social planner should have the same attitudes towards social outcomes that Alice has towards the concatenation of lives in those social outcomes. so the social planner would pay to add joyful lives, and pay to avoid adding miserable lives, and there should be exactly as much willingness-to-pay asymmetry as Alice (our self-interested human) exhibits.

^{^}

The term LELO originates in Loren Fryxell (2024), "XU", which is where I first encountered the concept. I think Fryxell offers the first formal treatment of the LELO principle. MacAskill (2022), "What We Owe the Future", says this thought experiment comes from Georgia Ray (2018), “The Funnel of Human Experience”, and that the short story Andy Weir (2009), "The Egg", shares a similar premise. But Roger Crisp attributes LELO to C.I. Lewis, which would predate both Ray and Weir, but I haven't traced that reference.

^{^}

John C. Harsanyi "Cardinal Utility in Welfare Economics and in the Theory of Risk-Taking" (1953) and "Cardinal Welfare, Individualistic Ethics, and Interpersonal Comparisons of Utility" (1955)

^{^}

See John Rawls (1971), "A Theory of Justice" and Samuel Freeman (2023) "Original Position".

^{^}

Bernard Williams discusses the notion of "unthinkable" options in his critique of utilitarianism.

It could be a feature of a man’s moral outlook that he regarded certain courses of action as unthinkable, in the sense that he would not entertain the idea of doing them: and the witness to that might, in many cases, be that they simply would not come into his head. Entertaining certain alternatives, regarding them indeed as alternatives, is itself something that he regards as dishonourable or morally absurd.
(Bernard Williams, 1973, "Utilitarianism: For and Against").

This is distinct from options that are ruled out by moral side constraints or physical impossibility. As Williams puts it, "it is perfectly consistent, and it might be thought a mark of sense, to believe, while not being a consequentialist, that there was no type of action which satisfied [the condition of being morally prohibited whatever the consequences]" (Williams, 1973).

^{^}

On the flip-side, even if the social context $f : X \to S$ is fixed, we can nonetheless concoct for any option $x \in X$ a utility function $u : S \to R$ such that $x$ is permitted by the utilitarian principle $f \mapsto {argmax}_{X} (u \circ f)$ . That is, any option will be permitted in any social context, provided the social utility function is sufficiently misspecified, no matter how ludicrous that choice would be.

To prove this, define $u : S \to R$ as the indicator function for ${f (x)} \subseteq S$ :

$u (s) = {\begin{matrix} 1 & if s = f (x) 0 & if s \neq f (x) \end{matrix}$

This $u$ assigns utility 1 to the social outcome of choosing $x$ , and 0 to all other outcomes, so a utilitarian planner maximizing this $u$ would be permitted to choose $x$ , or any other option that leads to the same outcome as $x$ . That is, $x^{'} \in {argmax}_{X} (u \circ f) ⟺ f (x) = f (x^{'})$ .

^{^}

Determining whether a statement $θ$ has a proof that is less than $k$ bits long is an NP-complete problem. Even $k = 1000$ would exceed the computational resources of the observable universe.

Firstly, this problem belongs to the complexity class NP because, given a proof $x$ that is less than $k$ bits in length, it is possible to verify each step of the proof to ensure that it adheres to the rules of Peano Arithmetic (PA). The verification process can be completed in polynomial time with respect to the size of the proof.

Moreover, this problem is NP-hard, as it is possible to reduce the Boolean Satisfiability Problem (SAT), which is known to be NP-hard, to our problem. To demonstrate this reduction, consider an instance of SAT with variables $x_{1}, \dots, x_{n}$ and a Boolean formula $ϕ (x_{1}, \dots, x_{n})$ . We can construct a statement $θ$ in the following manner:

$\exists x_{1}, \dots, x_{n} : x_{1} \leq 1 \land \dots \land x_{n} \leq 1 \land ϕ (x_{1}, \dots, x_{n})$

If the original Boolean formula is satisfiable, then this newly constructed formula is provable with a proof that requires only a polynomial number of bits. Furthermore, this reduction can be performed in polynomial time.

Our problem is both in NP and NP-hard, and hence is NP-complete.

^{^}

Strictly speaking, the claim that a social planner cannot implement the utilitarian principle in this scenario relies on two key assumptions:

(1) The social planner's decision-making process is instantiated by a physical system, such as a machine or computer, that exists in our universe and is bound by the laws of physics.

(2) No physically realizable machine can efficiently solve NP-complete problems. In other words, the time required to find a solution grows exponentially with the size of the problem, quickly becoming infeasible for even moderately large instances.

For a discussion of (1), see Scott Garrabrant and Abram Demski's 2018 article "Embedded Agency". For a compelling defence of (2), see Scott Aaronson's 2005 paper "NP-complete Problems and Physical Reality".

^{^}

See Bales, R. E. (1971), ‘"Act utilitarianism: Account of right-making characteristics or decision making procedure", which "stress[es] the importance of maintaining a sharp distinction between (a) decision-making procedures, and (b) accounts of what makes right acts right."

^{^}

We've seen how utilitarianism demands superhuman computational resources from the social planner, in contrast to aggregativism. As I demonstrate below, a similar point can be made about noncomputational resources.

Most humans cannot, I presume, jump exactly 45 cm. It's practically impossible for a typical human to reliably distinguish between jumping 45 cm and 46 cm, as the difference is too small to accurately control or perceive. Hence, in some circumstances, a human might either jump 45 cm or jump 46 cm; in other circumstances, they will surely do neither; but there are no circumstance where a human might jump 45 cm but surely won't jump 46 cm.

Formally, let $X = {0 cm, 1 cm, \dots, 60 cm}$ denote all the possible heights that a human might jump. To say that the human cannot distinguish between 45 cm and 46 cm, we mean that $45 cm \in Π (g) ⟺ 46 cm \in Π (g)$ for all personal contexts $g : X \to P$ .

Now, the aggregative principles satisfy a property called 'indistinguishable-options consistency'. Namely, If the aggregative principle permits (resp. forbids) jumping 45 cm in some social context, then it must also permit (resp. forbid) jumping 46 cm in that same context. The social planner is never permitted to jump 45 cm while forbidden to jump 46 cm, nor vice-versa.

More generally, if $x_{1} \in Π (g) ⟺ x_{2} \in Π (g)$ for all personal contexts $g : X \to P$ , then $x_{1} \in Π (ζ \circ f) ⟺ x_{2} \in Π (ζ \circ f)$ for all social contexts $f : X \to S$ .

In contrast, utilitarian principles violate indistinguishable-options consistency. If $u : S \to R$ is any non-constant utility function, with $u (s^{-}) < u (s^{+})$ , then we can define the social context $f : X \to S$ as follows:

$f (x) = {\begin{matrix} s^{+} & if x = 45 cm s^{-} & if x \neq 45 cm \end{matrix}$

The utilitarian planner maximizing $u$ would be obligated to jump exactly 45 cm, and forbidden to jump 46 cm, even though distinguishing between these two options is physically impossible.

^{^}

Philosophers like Bernard Williams (1981) rejected the codification of ethics into simple theories such as Kantianism or utilitarianism. “There cannot be any very interesting, tidy or self-contained theory of what morality is… nor… can there be an ethical theory, in the sense of a philosophical structure which, together with some degree of empirical fact, will yield a decision procedure for moral reasoning.”

^{^}

See Eliezer Yudkowsky on The Hidden Complexity of Wishes, Not for the Sake of Happiness (Alone), and Fake Utility Functions.

^{^}

I've been persuaded by Brian Tomasik's writings, in particular "The Horror of Suffering" (2017) and "Preventing Extreme Suffering Has Moral Priority" (2016, video presentation, warning: disturbing content).

^{^}

In "Three Types of Negative Utilitarianism", Brian Tomasik uses a LELO-esque argument to support lexical-threshold negative utilitarianism. This position states that a small minority facing extreme suffering cannot be compensated by a miniscule benefit to a sufficiently large majority. He justifies this on the grounds that a self-interested human wouldn't desire the concatenation of those lives:

A day in hell could not be outweighed by happiness:
I would not accept a day in hell in exchange for any number of days in heaven. Here I'm thinking of hell as, for example, drowning in lava but with my pain mechanisms remaining intact for the whole day. Heaven just wouldn't be worth it, no matter how long. It seems like there's no comparison.

^{^}

To formalize this, let $Φ : (X \to R) \to P (X)$ be any $R$ -choice principle and let $(R, ≺)$ be any binary relation over the payoffs $R$ . We say that $Φ$ respects $≺$ if, for all contexts $f : X \to R$ and all options $x \in Φ (f)$ , there exists no $x^{'} \in X$ such that $f (x) ≺ f (x^{'})$ . In plain terms: if $≺$ represents strict preference, then $Φ$ never permits choosing a strictly dispreferred option. Moreover, we say that $Φ$ has transitive preferences if it respects some transitive relation $≺$ .

It's straightforward to show that ${argmax}_{X} : (X \to R) \to P (X)$ has transitive preferences: it respects the usual 'less than' ordering $(R, <)$ on real numbers, which is transitive. Furthermore, if $u : S \to R$ is any function and the $R$ -choice principle $Φ$ has transitive preferences, then so does the composite principle $Ψ : (X \to S) \to P (X)$ defined by $Ψ (f) := Φ (u \circ f)$ . Indeed, if $Φ$ respects a relation $(R, ≺)$ , then $Ψ$ respects the relation $(S, ˙ ≺)$ defined by $s ˙ ≺ s^{'} ⟺ u (s) ≺ u (s^{'})$ , and $˙ ≺$ is transitive if $≺$ is. Combining these observations: since ${argmax}_{X}$ has transitive preferences, so does a social planner following any utilitarian principle $f \mapsto {argmax}_{X} (u \circ f)$ .

However, the human behaviour model $Π : (X \to P) \to P (X)$ may lack transitive preferences. If so, then a social planner following the aggregative principle $f \mapsto Π (ζ \circ f)$ , for some social zeta function $ζ : S \to P$ , may also lack transitive preferences. This exposes the planner to 'ethical money pumps': a sequence of choices that leads to a strictly worse outcome than where they started, by exploiting their intransitive preferences. For example, the planner might trade policy A for B, B for C, and C back to A, each time accepting a small 'ethical cost' that compounds to a large overall loss.

^{^}

See e.g. "Affective Forecasting" (Gilbert and Wilson, 2003)

LESSWRONG
is fundraising!
LW

LESSWRONG
is fundraising!
LW

27

Appraising aggregativism and utilitarianism

27

27

1. Introduction

2. Advantages of aggregativism

2.1 Avoids excessive permissions

2.2. Avoids excessive obligations.

2.3. Computationally tractable

2.4. Retains utilitarian spirit

2.4. Lower description complexity

2.6. Avoids counterintuitive implications

2.5.1. Repugnant conclusion

2.5.2. Extreme suffering

2.7. More concrete and relatable

3. Objections to aggregativism

3.1. Inherits human irrationality

3.2. Requires model of human behaviour

3. Conclusion