LESSWRONG
is fundraising!
LW

Aggregative principles approximate utilitarian principles — LessWrong

1. Introduction

Utilitarianism is the view that a social planner should choose options which maximise the social utility of the resulting social outcome. The central object in utilitarianism is the social utility function which assigns a real value $u (s) \in R$ to each social outcome $s \in S$ . This function typically involves variables such as the well-being, preferences, and mental states of individuals, distributional factors like inequality, and other relevant factors such as justice, social cohesion, and freedoms. Utilitarianism is a broad class of social choice principles, one corresponding to each function $u : S \to R$ .

In my previous article, I introduced aggregative principles, which state that a social planner should make decisions as if they will face the aggregated personal outcomes of every individual in the population. The central object in aggregativism is the function $ζ : S \to P$ , represented with the the greek letter zeta, which assigns a personal outcome $ζ (s) \in P$ to each social outcome $s \in S$ . This function typically aggregates the collection of personal outcomes facing the entire population into a single personal outcome. Aggregativism is a broad class of social choice principles, one corresponding to each function $ζ : S \to P$ .

We examined three well-known aggregative principles:

Live Every Life Once (LELO), where $ζ (s)$ is the concatenation of every individual's life.
Harsanyi's Lottery (HL), where $ζ (s)$ is a uniform lottery over every individual's life.
Rawls' Original Position (ROI), where $ζ (s)$ is Knightian uncertainty over every individual's life.

I'm interested in aggregative principles because they avoid many theoretical pitfalls of utilitarian principles. Unlike utilitarianism, aggregativism doesn't require specifying a social welfare function, which is notoriously intractable. Moreover, it seems less prone to counterintuitive conclusions such as the repugnant conclusion or the violation of moral side constraints.^[1] In this article, I will show that, under natural conditions of human rationality, aggregative principles approximate utilitarian principles. Therefore, even though aggregativism avoids these theoretical pitfalls, we should nonetheless expect aggregativism to generate roughly-utilitarian recommendations in practical social contexts, and thereby retain the most appealing insights from utilitarianism.

The rest of the article is organized as follows. Section 2 formalises social choice principles as functions of type $(X \to S) \to P (X)$ . Section 3 demonstrates the structural similarity between two strategies to specifying such principles, namely the aggregative and utilitarian strategies. Section 4 proves that under natural conditions about human rationality, the aggregative and utilitarian principles are mathematically equivalent. This theorem is the key contribution of the article. Sections 5, 6, and 7 applies the theorem to LELO, HL, and ROI respectively.

2. Social choice principles

Suppose you are a social planner choosing from a set of options $X = {x_{1}, \dots, x_{n}}$ . The set $X$ might be the set of available tax rates, environmental policies, military actions, political strategies, neural network parameters, or whatever else is being chosen by the social planner. Now, your choice will presumably depend on the social consequences of the options, even if you also consider non-consequentialist factors. We can model the social consequences with a function $f : X \to S$ , where $S$ is the set of social outcomes. In particular, if you choose an option $x \in X$ , then the resulting social outcome would be $f (x) \in S$ .

We call $f : X \to S$ the "social context". As a concrete example, suppose the options are different tax rates (say 10%, 20%, and 30%), and the social outcomes are characterized by variables like total tax revenue, income inequality, and unemployment rate. Then the social context is the function $f : [0, 1] \to S$ which maps each tax rate $x \in [0, 1]$ to the resulting values of these social outcome variables.

A social choice principle should say, for each social context, which options are acceptable. Formally, a social choice principle is characterised by some function $Ψ : (X \to S) \to P (X)$ , which takes a social context $f : X \to S$ as input and returns a subset of the options $Ψ (f) \subseteq X$ as output. Specifically, $Ψ (f) \subseteq X$ consists of exactly those options which satisfy the principle in the social context $f : X \to S$ .

Note that $(X \to S)$ denotes the set of all functions from $X$ to $S$ , so $Ψ$ is a higher-order function, meaning it receives another function as input. Additionally $P (X)$ denotes the powerset of $X$ , i.e. the set of subsets of $X$ . We use the powerset $P$ to allow for the fact that multiple options may satisfy a principle: if a principle $Ψ$ permits only options $x_{1}$ and $x_{2}$ in a context $f : X \to S$ then $Ψ (f) = {x_{1}, x_{2}}$ . Finally, the powerset $P (X)$ includes the empty set $\emptyset$ , which allows for the case $Ψ (f) = \emptyset$ . Informally, $Ψ (f) = \emptyset$ means that the social planner, following principle $Ψ$ and faced with context $f : X \to S$ , has no acceptable options, which allows for principles that aren't universally satisfiable.

Here are some examples of social choice principle:

Context-independence
Let $X_{0} \subseteq X$ be any fixed subset of the options, and consider the principle $Ψ : f \mapsto X_{0}$ , which returns $X_{0}$ regardless of the input $f$ . Whether an option $x \in X$ satisfies this principle depends only on whether $x \in X_{0}$ , and is independent of the social context. At one extreme, there's a trivial principle $Ψ : f \mapsto X$ which never constrains the social planner, and at the other extreme, there's a principle $Ψ : f \mapsto \emptyset$ which is always unsatisfiable. When $X_{0} = {x_{0}}$ consists of a single option, the principle $Ψ$ states that the social planner must choose $x_{0} \in X$ regardless of the context.^[2]
Targets
Let $S_{target} \subseteq S$ be any fixed subset of the social outcomes, whose elements we'll call targets. There is a principle which says that the social planner should choose an option which achieves a target. This principle is characterised by the function $Ψ : f \mapsto f^{- 1} (S_{target})$ , where $f^{- 1} (S_{target}) \subseteq X$ denotes the preimage of $S_{target}$ , i.e. $f^{- 1} (S_{target}) := {x \in X : f (x) \in S_{target}}$ . Note that if $f (x) \notin S_{target}$ for all $x \in X$ then $Ψ (f) = \emptyset$ , i.e. if no option would achieve a target then the principle says all options are unacceptable.
Impact minimisation
We can also characterise more unusual principles as functions $(X \to S) \to P (X)$ . For example, consider the principle that says a social planner should choose an option if most other options would've led to the same social outcome. Intuitively, this captures some notion of impact minimisation. Formally, this principle is characterised by the function $Ψ (f) := {x \in X ∣ # [x]_{f} > \frac{# X}{2}}$ , where $# S$ denote the cardinality of a set and $[x]_{f} \subseteq X$ denotes the $f$ -equivalency class of $x \in X$ , i.e. $[x]_{f} = {y \in X ∣ f (x) = f (y)}$ .

These examples illustrate the diversity of conceivable social choice principles. The key point is that they can all be represented by functions $Ψ : (X \to S) \to P (X)$ . I've found this a productive way to think about principles of decision-making, and agency more generally.^[3] Finding compelling social choice principle is the central problem in social ethics, and different normative frameworks will propose different principles.

3. Two strategies for specifying principles

3.1. Utilitarian principles

Utilitarianism and aggregativism are two strategies for specifying a social choice principle $Ψ : (X \to S) \to P (X)$ . The utilitarian strategy specifies a social choice principle using two components:

A social utility function $u : S \to R$ that assigns a real-valued utility $u (s) \in R$ to each social outcome $s \in S$ .
The ${argmax}_{X}$ operator, which maps a real-valued function $r : X \to R$ to the set of points that maximize it. Formally, ${argmax}_{X} (r) := {x \in X ∣ \forall x^{'} \in X : r (x^{'}) \leq r (x)}$ . Note that ${argmax}_{X} (r)$ is a subset of $X$ , possibly containing multiple points in case of ties, or no points in the case of unbounded functions.

Given the social utility function $u : S \to R$ and the operator ${argmax}_{X} : (X \to R) \to P (X)$ , the utilitarian principle is defined by $Ψ (f) := {argmax}_{X} (u \circ f)$ . Note that if $f : X \to S$ is the social context, then the composition $u \circ f : X \to R$ calculates the social utility resulting from each option, thereby providing a real-valued function $r : X \to R$ . The utilitarian principle $f \mapsto {argmax}_{X} (u \circ f)$ says that the social planner should choose an option that maximizes this function.

As a simplistic example, consider a social utility function $u : S \to R$ that measures the gross world product of a social outcome. The resulting utilitarian principle $f \mapsto {argmax}_{X} (u \circ f)$ would oblige maximizing gross world product. In practice, utilitarians typically endorse more nuanced utility functions that account for factors like individual well-being, fairness, and existential risk.

3.2. Aggregative principles

Aggregativism offers an alternative strategy to specifying social choice principle. Like utilitarianism, it defines the principle $Ψ : (X \to S) \to P (X)$ using two components:

A function $ζ : S \to P$ that assigns a personal outcome $ζ (s) \in P$ to each social outcome $s \in S$ . We call $ζ$ the social zeta function.
A model of a self-interested human, characterised by a function $Π : (X \to P) \to P (X)$ , explained below.

The function $Π : (X \to P) \to P (X)$ should model a self-interested human in the following sense: for each personal context $g : X \to P$ the subset $Π (f) \subseteq X$ should contain the options that the hypothetical human might choose in that context. A personal context $g : X \to P$ is an assignment of a personal outcome to each of the options, analogously to a social context. For example, if $g : X \to P$ maps some options to finding a dollar and the remaining options to drowning in a swamp, then presumably $Π (g)$ contains only the former options.

Given the social zeta function $ζ : S \to P$ and a model of self-interested human $Π : (X \to P) \to P (X)$ , the aggregationist principle is defined by $Ψ (f) := Π (ζ \circ f) .$ Note that if $f : X \to S$ is the social context, then the composition $ζ \circ f : X \to P$ calculates the hypothetical personal outcome resulting from each option, thereby providing a personal context $g : X \to P$ . The aggregative principle $f \mapsto Π (ζ \circ f)$ says that the social planner should choose an option a self-interested human might choose in this personal context.

For example, consider a social zeta function $ζ : S \to P$ that maps each social outcome $s$ to the personal outcome of living every individual's life in sequence, starting with the earliest-born humans. The resulting aggregative principle $f \mapsto Π (ζ \circ f)$ obliges affecting society such that living the concatenated lives is personally desirable.

3.3. Structural similarity between the two strategies

This comparison reveals the structural similarity between utilitarianism and aggregativism. Both strategies specify the principle $Ψ$ using a two components:

A function mapping social outcomes to a different space, either $R$ (in the case of utilitarianism) or $P$ (in the case of aggregativism).
A choice principle in that different space, either maximization (in the case of utilitarianism) or a model of a self-interested human (in the case of aggregativism).

Both $Π$ , the model of a self-interested human, and the ${argmax}_{X}$ operator are choice principles: $Π$ is a personal choice principle, it 'chooses' one the options based on their associated personal outcomes, and ${argmax}_{X}$ is a real choice principle, it 'chooses' one of the options based on their associated real value. (Of course, ${argmax}_{X}$ doesn’t literally choose anything, it’s simply a mathematical operator, but so too is $Π$ .)

In general, for any space $R$ , let's say an $R$ -context is any function with type-signature $X \to R$ , and an $R$ -choice principle is any function with type-signature $(X \to R) \to P (X)$ . That is, an $R$ -choice principle $Φ$ , when provided with an $R$ -context $r : X \to R$ , identifies some subset $Φ (f) \subseteq X$ of the options which are 'acceptable'.

How might one use an $R$ -choice principle $Φ$ to specify a social choice principle $Ψ$ ? Well, what's needed is some function $σ : S \to R$ from social outcomes to elements of $R$ . This function $σ$ will extends any social context $f : X \to S$ to an $R$ -context $σ \circ f : X \to R$ , which can then be provided to the $R$ -choice principle to identify the acceptable options. Formally, $Ψ : f \mapsto Φ (σ \circ f)$ . This is how utilitarianism and aggregativism succeed in defining social choice principles. The key difference is that utilitarianism uses real numbers while aggregativism uses personal outcomes.

4. Equivalence between aggregativism and utilitarianism

4.1. Three conditions for equivalence

Despite their differences, there are natural conditions under which the utilitarian and aggregative principles are equivalent, in the sense that a social planner is permitted to choice an option, under the utilitarian principle, if and only if they are permitted to choice the same option under the aggregative principle.

Formally, let $Ψ_{u}$ denote the utilitarian principle $Ψ_{u} : f \mapsto {argmax}_{X} (u \circ f)$ and let $Ψ_{a}$ denote the aggregative principle $Ψ_{a} : f \mapsto Π (ζ \circ f)$ ; under what conditions does $Ψ_{u} (f) = Ψ_{a} (f)$ for all social contexts $f : X \to S$ ?

In the previous article, we showed that LELO, HL, and ROI each employ social zeta functions which aggregates the personal outcomes across all individuals in the population. Formally, $ζ (s) := α (γ (-, s)^{M} (π))$ where $I$ is a fixed set of individuals; $γ : I \times S \to P$ is a fixed function mapping a social outcome $s \in S$ and an individual $i \in I$ to the personal outcome $γ (i, s) \in P$ that $i$ faces when $s$ obtains; $M$ is the monad capturing a notion of 'collection'; $π \in M (I)$ be a fixed collection of individuals impartially representing the population; and $α : M (P) \to P$ is an $M$ -algebra specifying how to aggregate collections of personal outcomes into a single personal outcome.

Supposing $ζ$ has the general form above, and the three conditions below are satisfied, then the utilitarian principle $Ψ_{u}$ and the aggregative principle $Ψ_{a}$ are mathematically equivalent:

A self-interested human maximises personal utility.

Formally, the first condition states that the function $Π : (X \to P) \to P (X)$ has the form $Π (f) = {argmax}_{X} (v \circ f)$ for some personal utility function $v : P \to R$ which assigns a real-valued utility $v (p) \in R$ to each personal outcome $p \in P$ . Even by itself, this condition is quite strong. It implies that if, for some personal context $f : X \to P$ , two options $x_{1}$ and $x_{2}$ result in the same personal outcome, i.e. $f (x_{1}) = f (x_{2})$ , then the human might choose $x_{1}$ if and only if they might choose $x_{2}$ . Hence, this condition precludes nonconsequential considerations.

Let's call this condition "Humans Maximise Personal Utility" (HMPU).
Personal utility is 'rational', in a technical sense defined below.

Let $α : M (P) \to P$ denote an $M$ -algebra on personal outcomes, describing how to aggregate a collection of personal outcomes into a single personal outcome. Let $β : M (R) \to R$ denote an $M$ -algebra on real numbers, describing how to aggregate a collection of real numbers into a single real number. The second condition states that $v \circ α = β \circ v^{M}$ . Informally, this condition means that the personal utility of an aggregate of personal outcomes is the aggregate of the personal utilities of each personal outcome being aggregated. In mathematical jargon, the personal utility function $v : P \to R$ must be a homomorphism between the $M$ -algebras $(P, α)$ and $(R, β)$ , which means it preserves the algebraic structure on $P$ and $R$ .

Let's call this condition "Rationality of Personal Utility" (RUP).
Social utility is the aggregate of personal utilities across all individual in the population.

Formally, the third condition states that $u (s) = β ((v \circ γ) (-, s)^{M} (π))$ , where $v : P \to R$ is the personal utility function introduced in HMPU, $γ : I \times S \to P$ is the function assigning personal outcomes to each individual in each social outcome, $π \in M (I)$ is the distinguished collection of individuals representing the population, and $β : M (R) \to R$ is the $M$ -algebra describing how to aggregate a collection of real numbers into a single real number. Informally, this condition states that the social utility of a social outcome is the aggregate of the personal utilities of the personal outcomes faced by all individuals in the population.

Let's call this condition "Social Utility Aggregates Personal Utilities" (SUAPU).

The aggregative principle (when our model of a self-interested human is a rational personal utility maximiser) is equivalent to the utilitarian principle (when social utility is the impartial aggregation of personal utility over each individual). The full proof is elementary and uninsightful.^[4]

Now, these three conditions are only approximately true, and they fail in systematic ways. However, the theorem will help elucidate exactly the extent to which the aggregative principle approximates the corresponding utilitarian principle. Namely, the aggregative principle will approximate the utilitarian principle to the degree that these conditions hold.

Because RPU and SUAPU depend on the specific monad $M$ under discussion, I will spell out the details for three paradigm examples: the list monad $List$ (representing finite sequences), the distribution monad $Δ$ (representing probability distributions), and the nonempty finite powerset monad $P_{f}^{+}$ (representing nonempty finite sets).

5. Equivalence between LELO and longtermist total utilitarianism

The previous section proved an equivalence, under certain conditions, between aggregative principles and utilitarian principles. This section will apply that theorem to the monad $List$ , which is used to formalise Live Every Life Once (LELO). We will see that LELO is equivalent to longtermist total utilitarianism.

5.2. Monoidal rationality of personal utility?

The real numbers admit a concatenation operator in the obvious way, i.e., there exists a function $sum : List (R) \to R$ defined by $sum ([r_{1}, \dots, r_{k}]) := 0 + r_{1} + \dots + r_{k}$ . This is simply the well-known summation operator, which sends a list of real values to their sum.

Let's unpack RPU, which formally states that $v \circ conc = sum \circ v^{List}$ . In other words, for any list of personal outcomes $[p_{1}, \dots, p_{n}]$ , we have equality between $v \circ conc ([p_{1}, \dots, p_{n}])$ and $0 + v (p_{1}) + \dots + v (p_{n})$ . Informally, the personal utility of a concatenated outcome equals the sum of the personal utilities of the outcomes being concatenated. This 'monoidal' rationality condition constrains how humans must value the concatenation of different personal outcomes.

In the previous article, we saw that the concatenation operator $conc : List (P) \to P$ can be equivalently presented by a binary operator $▹$ and a constant $ϵ \in P$ , with the intended interpretation $p ▹ p^{'} := conc ([p_{1}, p_{2}])$ and $ϵ := conc ([])$ . We can restate the RPU condition in terms of $▹$ and $ϵ$ with two equations: $v (ϵ) = 0$ and $v (p ▹ p^{'}) = v (p) + v (p^{'})$ for all $p, p^{'} \in P$ .

How realistic is the RPU condition? That is, supposing humans do maximise a personal utility function, how monoidally rational is it? I think this condition is approximately true, but unrealistic in several ways. I'll assume here that $p ▹ p^{'}$ is interpreted as facing $p$ and then facing $p^{'}$ in sequence, rather than some exotic notion of concatenation.

Firstly, RPU rules out permutation-dependent values. It precludes a personal utility function $v : P \to R$ such that $v (p_{1} ▹ p_{2}) \neq v (p_{2} ▹ p_{1})$ . Informally, RPU assumes human values must be invariant to the ordering of experiences: they cannot value saving the best till last, nor saving the worse till last. In particular, RPU assumes that humans values are time-symmetric, which seems unrealistic, as illustrated by the following examples. Compare the process of learning, i.e. ending with better beliefs than one started with, with the process of unlearning, i.e. ending with worse beliefs than one started with. Humans seem to value learning above unlearning, but such time-asymmetric values are precluded by RPU. Similarly, humans seem to value a history of improvement over a history of degradation, even if both histories are different permutations of the same list of moments, but such values are precluded by RPU.

Secondly, RPU rules out time-discounted values. Under exponential time-discounting, a common assumption in economics, the personal utility function $v : P \to R$ obeys the equation $v (p_{1} ▹ p_{2}) = v (p_{1}) + (1 + δ)^{- duration (p_{1})} \cdot v (p_{2})$ . Here $duration : P \to R^{\geq 0}$ gives the duration of each outcome and $δ > 0$ is the discount rate. This discounting formula weights the first outcome $p_{1}$ more than the second outcome $p_{2}$ , with the difference growing exponentially with the duration of $p_{1}$ . For instance, let $p_{1}$ and $p_{1}^{'}$ be equally valuable experiences lasting different durations, like a minute of ecstasy and a week of contentment respectively. Time-discounting implies that $v (p_{1} ▹ p_{2})$ depends more on $v (p_{2})$ than $v (p_{1}^{'} ▹ p_{2})$ does. However, RPU precludes this possibility, as it requires that $δ = 0$ , i.e. that humans are equally concerned with all life stages, not discounting future rewards relative to present ones

Thirdly, RPU rules out path-dependent values. Informally, whether I value a future $p$ more than a future $q$ must be independent of my past experiences. But this is an unrealistic assumption about human values, as illustrated in the following examples. If $p$ denotes reading Moby Dick and $q$ denotes reading Oliver Twist, then humans seem to value $p ▹ p$ less than $p ▹ q$ but value $q ▹ p$ more than $q ▹ q$ . This is because humans value reading a book higher if they haven't already read it, due to an inherent value for novelty in reading material. Alternatively, if $p$ and $q$ denote being married to two different people, then humans seem to value $p ▹ p$ more than $p ▹ q$ but value $q ▹ p$ less than $q ▹ q$ . This is because humans value being married to someone for a decade higher if they've already been married to them, due to an inherent value for consistency in relationships.^[5] But RPU would precludes such path-dependent values.

5.2. Social utility sums personal utility?

Now let's unpack SUAPU, which formally states that $u (s) = sum ((v \circ γ (-, s))^{List} (l))$ . In other words, the social utility function is the sum of personal utilities over the individuals in the distinguished list representing the population. That is, if $l = [i_{1}, \dots, i_{n}] \in List (I)$ is a list of individuals representing the entire population impartially, then for any social outcome $s \in S$ , its social utility $u (s)$ is given by $v (γ (i_{1}, s)) + \dots + v (γ (i_{n}, s))$ .

How realistic is the SUAPU condition? The answer depends on one's axiological theory. Indeed, SUAPU is a statement of longtermist total utilitarianism. This is a strong assumption that precludes a social utility function from exhibiting certain properties, analogous to how RPU constrains the personal utility function. Specifically, SUAPU precludes social utility functions with the following features:

Permutation-dependence: The social utility depends not only on the final sequence of personal utilities, but also on the specific ordering of the individuals who faced those personal utilities. For example, history where humanity starts well-informed and grows more ignorant generation-to-generation has higher value than a history where humanity starts poorly-informed and grows less ignorant generation-to-generation, if the latter simply reverses the former.
Time-discounting: The social utility discounts the personal utilities of future individuals relative to those of present individuals.
Path-dependence: The value added by the future depends on the past. For example, it is inherently valuable if future generations preserve certain traditions, or inherently valuable if future generations explore novel personal outcomes.

Nonetheless, I think that HMPU, RPU, and SUAPU are useful approximations, even if they aren't perfectly true. To the extent that these assumptions do hold, Live Every Life Once (LELO) and longtermist total utilitarianism will be roughly equivalent. This explains why MacAskill appeals to LELO to argue for longtermist utilitarianism in his book "What We Owe The Future" (2022). Indeed, MacAskill's implicit argument can be summarized as follows:

LELO is a compelling principle of social justice, stating that a social planner should make decisions as if they will live out every individual's life in sequence.
Humans can approximately be modelled as maximising a monoidally rational personal utility function.
The social utility function, stipulated by MacAskill, sums the personal utility over all individuals in society.
Therefore, longtermist total utilitarianism is a compelling principle of social justice, where longtermist total utilitarianism is the utilitarian principle employing MacAskill's stipulated social utility function.

6. Equivalence between HL and average utilitarianism

We've seen how to apply the general equivalence, under certain conditions, between aggregative principles and utilitarian principles, e.g. between LELO and longtermist total utilitarianism. This section will apply that theorem to the monad $Δ$ , which is used to formalise Harsanyi's Lottery (HL). We will see that HL is equivalent to average utilitarianism.

6.1. Convex rationality of personal utility?

The real numbers admit an interpolation operator in the obvious way, i.e., there exists a function $mean : Δ (R) \to R$ defined by $mean (⟨ r_{1} : λ_{1} ∣ \dots ∣ r_{n} : λ_{n} ⟩) := λ_{1} \cdot r_{1} + \dots + λ_{n} \cdot r_{n}$ . This is simply the well-known mean-value operator, which sends a distribution of real values to their weighted average.

Let's unpack RPU, which formally states that $v \circ E = mean \circ v^{Δ}$ . In other words, for any distribution of personal outcomes $⟨ p_{1} : λ_{1} ∣ \dots ∣ p_{n} : λ_{n} ⟩$ , we have equality between $v (E [⟨ p_{1} : λ_{1} ∣ \dots ∣ p_{n} : λ_{n} ⟩])$ and $λ_{1} \cdot v (p_{1}) + \dots + λ_{n} \cdot v (p_{n})$ . Informally, the personal utility of an interpolated outcome is the average of the personal utilities outcomes being interpolated. This 'convex' rationality condition constrains how humans must value the interpolation of different personal outcomes.

In the previous article, we saw that the interpolation operator $E : Δ (P) \to P$ can be equivalently presented by a family of binary operators ${+_{λ} : λ \in (0, 1)}$ , with the intended interpretation $p +_{λ} p^{'} = E (⟨ p : λ ∣ p^{'} : 1 - λ ⟩)$ . We can restate the RPU condition in terms of $+_{λ}$ with the family of equations: $v (p +_{λ} p^{'}) = λ \cdot v (p) + (1 - λ) \cdot v (p^{'})$ .

How realistic is the RPU condition? That is, supposing humans do maximise a personal utility function, how convexly rational is it? I think this condition is approximately true, but unrealistic in several ways. I'll assume here that $p +_{λ} p^{'}$ is interpreted as a lottery between $p$ with likelihood $λ$ and $p^{'}$ with likelihood $1 - λ$ .

Firstly, RPU rules out valuing determinacy. Informally, a lottery can't be valued below each determinate outcomes. But perhaps this is an unrealistic assumption, as illustrated in the following examples. If $p_{1}$ denotes dying on Monday and $p_{2}$ denotes dying on Tuesday, then humans might value both determinate outcomes over the lottery between them, e.g. $v (p_{1}) \approx v (p_{2})$ but $v (p_{1} +_{0.6} p_{2}) < v (p_{1}), v (p_{2})$ . This is because humans may inherently value determinacy about the day of their death. But RPU precludes valuing determinacy.

Secondly, RPU rules out valuing randomness. Informally, a lottery can't be valued above each determinate outcomes. But perhaps this is an unrealistic assumption, as illustrated in the following examples. If $p_{1}$ and $p_{2}$ denotes marrying two different people, then humans might value both determinate outcomes less than the lottery between them, e.g. $v (p_{1}) \approx v (p_{2})$ but $v (p_{1} +_{0.6} p_{2}) > v (p_{1}), v (p_{2})$ . This is because humans may inherently value randomness about whom they marry. Again, RPU precludes valuing randomness.

Thirdly, RPU rules out rules out values discontinuous in the underlying likelihoods. Formally, if $λ_{i} \to λ_{\infty}$ is a convergent sequence in $(0, 1)$ , then RPU implies $v (p +_{λ_{i}} p^{'}) \to v (p +_{λ} p^{'})$ . Moreover, if $λ_{i} \to 0$ then $v (p +_{λ} p^{'}) \to v (p^{'})$ and if $λ_{i} \to 1$ then $v (p +_{λ} p^{'}) \to v (p)$ . But perhaps this is an unrealistic assumption, as illustrated in the following examples. If $p_{okay}$ denotes an okay outcome and $p_{cata}$ denotes a catastrophic outcome, then humans might value the lottery $p_{cata} +_{λ} p_{okay}$ substantially less than $p_{okay}$ for all $λ \in (0, 1)$ , i.e. ${lim}_{λ \to 0} v (p_{cata} +_{λ} p_{okay}) < v (p_{okay})$ . Informally, the human values the zeroness catastrophe's likelihood. Analogously, if $p_{defeat}$ denotes a terrible defeat and $p_{victory}$ denotes a great victory, then humans might value the lottery $p_{victory} +_{λ} p_{defeat}$ substantially more than $p_{defeat}$ for all $λ \in (0, 1)$ , i.e. ${lim}_{λ \to 0} v (p_{victory} +_{λ} p_{defeat}) > v (p_{defeat})$ . Informally, the human values the nonzeroness victory's likelihood. But RPU precludes valuing either zeroness or nonzeroness of likelihoods, because this value would be discontinuous in the underlying likelihoods.

That being said, I think human values approximate convex rationality far better than they approximate monoidal rational. In fact, while mainstream economics does not assume monoidal rationality (e.g. via time-discounting) it does assume convex rationality. Convex rationality is a straightforward application of von Neumann-Morgenstern (VNM) expected utility theory. Hence, I accept convex rationality of human values, at least when interpolation $p +_{λ} p^{'}$ is interpreted as a lottery between $p$ and $p^{'}$ .^[6]

6.2. Social utility averages personal utility?

Now let's unpack SUAPU, which formally states that $u (s) = mean ((v \circ γ (-, s))^{Δ} (π))$ . In other words, the social utility function is the weighted average of personal utility over the individuals in distinguished distribution representing the population. That is, if $π = ⟨ i_{1} : λ_{1} ∣ \dots ∣ i_{n} : λ_{n} ⟩ \in Δ (I)$ is a distribution of individuals representing the entire population impartially, then for any social outcome $s \in S$ , its social utility $u (s)$ is given by $λ_{1} \cdot v (γ (i_{1}, s)) + \dots + λ_{n} \cdot v (γ (i_{n}, s))$ .

How realistic is the SUAPU condition? The answer depends on one's axiological theory. Indeed, SUAPU is a statement of average utilitarianism. This is a strong assumption that precludes a social utility function from exhibiting certain properties, analogous to how RPU constrains the personal utility function. Specifically, SUAPU precludes social utility functions with the following features:

Homogeneity values: A society where half the population faces personal outcome $p$ and half faces $p^{'}$ has lower social utility than both the society where everyone faces $p$ and a society where everyone faces $p^{'}$ . For instance, a society with a mix of catholics and protestants is worse than both a fully-catholic society or a fully-protestant society.
Heterogeneity values: A society where half the population faces personal outcome $p$ and half faces $p^{'}$ has higher social utility than both the society where everyone faces $p$ and a society where everyone faces $p^{'}$ . For instance, a society with a mix of Dickensians and Shakespeareans is worse than both a fully-Dickensian society and a fully-Shakespearean society.
Permutation-dependence: A society where one group $A$ faces $p$ and another group $B$ faces $p^{'}$ has higher value than a society where $p$ and $p^{'}$ are reversed, even though both groups $A$ and $B$ have equal total weight, i.e. $\sum_{i \in A} π (i) = \sum_{i \in B} π (i)$ where $π : I \to [0, 1]$ is the distinguished distribution over individuals. Note that permutation-dependence doesn't necessarily follow from time-discounting, because $π$ may assign early humans higher weights than later humans.
Discontinuity in weights: A society with a slight majority facing $p$ and a slight minority facing $p^{'}$ is valued substantially higher than a society with slight minority facing $p$ and slight majority facing $p^{'}$ . For instance, a society where the majority is happy is substantially better than a society where the minority is unhappy, no matter how marginal the majority.

Nonetheless, I think that HMPU, RPU, and SUAPU are reasonable approximations, even if not perfectly true. To the extent that these assumptions hold, Harsanyi's Lottery (HL) and average utilitarianism will be roughly equivalent. This explains why Harsanyi appeals to HL to argue for average utilitarianism in his paper 'Cardinal Welfare, Individualistic Ethics, and Interpersonal Comparisons of Utility' (1955).

Harsanyi's implicit argument can be summarized as follows:

HL is a compelling principle of social justice, stating that a social planner should make decisions as if they face a lottery over the individuals in population.
Humans can approximately be modelled as maximising a convexly rational personal utility function.
The social utility function, stipulated by Harsanyi, averages the personal utility over all individuals in society.
Therefore, average utilitarianism is a compelling principle of social justice, where average utilitarianism is the utilitarian principle employing Harsanyi's stipulated social utility function.

7. Equivalence between ROI and difference principle

We've seen how to apply the general equivalence, under certain conditions, between aggregative principles and utilitarian principles, e.g. between LELO and longtermist total utilitarianism, or between HL and average utilitarianism. This section will apply that theorem to the monad $P_{f}^{+}$ , which is used to formalise Rawls' Original Position (ROI). We will see that ROI is equivalent to the difference principle.

7.1. Semilatticial rationality of personal utility?

The real numbers admit a fusion operator in the obvious way, i.e., there exists a function $min : P_{f}^{+} (R) \to R$ where $min ({r_{1}, \dots, r_{n}})$ is the largest $r^{*} \in R$ satisfying $r^{*} \leq r_{i}$ for each $i = 1, \dots, n$ . This is simply the well-known minimisation operator, which sends a nonempty finite subset of the real values to their minimum.

Let's unpack RPU, which formally states that $v \circ ⨁ = min \circ v^{P_{f}^{+}}$ . In other words, for any nonempty finite subset of personal outcome ${p_{1}, \dots, p_{n}}$ , we have equality between $v \circ ⨁ ({p_{1}, \dots, p_{n}})$ and $min {v (p_{1}), \dots, v (p_{n})}$ . Informally, the personal utility of a fused outcome is the minimum of the personal utilities of the outcomes being fused. This 'semilatticial' rationality condition constrains how humans must value the fusion of different personal outcomes.

In the previous article, we saw that the fusion operator $⨁ : P_{f}^{+} (P) \to P$ can be equivalently presented by a single binary operator $\oplus$ , with the intended interpretation $p \oplus p^{'} = ⨁ ({p, p^{'}})$ . We can restate the RPU condition in terms of $\oplus$ with the single equation $v (p \oplus p^{'}) = min {v (p), v (p^{'})}$ .

How realistic is the RPU condition? That is, supposing humans do maximise a personal utility function, how semilatticially rational is it? I think this condition is approximately true, but unrealistic in several ways. I'll assume here that $p \oplus p^{'}$ is interpreted as a Knightian uncertainty between facing $p$ and facing $p^{'}$ . Then semilatticial rationality requires that humans are pessimistic, i.e. they value the disjunction of different outcomes no greater than the worst, as if the alternative will be selected by an adversary.

Firstly, RPU rules out valuing determinacy or indeterminacy. Informally, a disjunction can't be valued below each determinate outcome, nor higher. But perhaps this is an unrealistic assumption: humans might value the disjunction $p_{1} \oplus p_{2}$ lower than either $p_{1}$ or $p_{2}$ because humans inherently value determinacy in this case. Or humans might value the disjunction $p_{1} \oplus p_{2}$ lower than either $p_{1}$ or $p_{2}$ because humans inherently value indeterminacy in this case. But RPU precludes such values.

Second, RPU rules out non-pessimistic considerations. Informally, adding additional possibilities can never increase the value of a disjunction. But this is an unrealistic assumption about human values, as illustrated in the following examples. If $p_{okay}$ is a typical comfortable life and $p_{bad}$ is a life of horrific torture, then humans may value the outcome $p_{okay}$ higher than $p_{bad}$ and value the outcome $p_{bad} \oplus p_{okay}$ higher than the outcome $p_{bad}$ . In particular, the outcome $p_{bad} \oplus p_{okay}$ gives a possibility of being fine, while $p_{bad}$ is certain to result in torture. However, RPU precludes such values.

Overall, I think human values approximate semilatticial rationality. Indeed, suppose you face genuine Knightian uncertainty between a set of possibilities $p_{1}, \dots, p_{n}$ , with personal utilities $v (p_{1}), \dots, v (p_{n})$ respectively. What's the ex-ante value? There's not much to done to construct the ex-ante personal value for $p_{1} \oplus \dots \oplus p_{n}$ other than minimising over the possibilities, i.e. $v (p_{1} \oplus \dots \oplus p_{n})$ equals $min ({v (p_{1}), \dots, v (p_{n})})$ . The only alternative is to deny that the ex-ante value of $p_{1} \oplus \dots \oplus p_{n}$ depends solely on the ex-post value of the constituent possibilities, or else employ a different semilattice on $R$ .^[7] Moreover, Wald's maximin model, and robust optimisation more generally, are popular principles of decision-making. These principles involve maximising a semilatticially rational utility function. Hence, I accept semilattical rationality of human values, at least when the fusion $p_{1} \oplus p_{2}$ is interpreted as Knightian uncertainty between $p_{1}$ and $p_{2}$ , a mode of ignorance which is rarely encountered.^[8]

7.2. Social utility minimises personal utility?

Now let's unpack SUAPU, which formally states that $u (s) = min ((v \circ γ (-, s))^{P_{f}^{+}} (A))$ . In other words, the social utility function to the minimum of personal utilities over the individuals in the distinguished nonempty subset representing the population. That is, if $A = {i_{1}, \dots, i_{n}} \in P_{f}^{+} (I)$ is a nonempty subset of individuals representing the entire population impartially, then for any social outcome $s \in S$ , the social utility $u (s)$ is given by $min ({v (γ (i_{1}, s)), \dots, v (γ (i_{n}, s)})$ .

How realistic is the SUAPU condition? The answer depends on one's axiological theory. Indeed, SUAPU is a statement of Rawls' difference principle. This is a strong assumption that precludes a social utility function from exhibiting certain properties, analogous to how RPU constrains the personal utility function. Specifically, SUAPU precludes social utility functions with the following features:

Permutation-dependence: The social utility depends not only on the nonempty subset of personal outcome, but also who faces each particular outcome. For example, a society where one group $G_{1}$ faces $p_{1}$ and another group $G_{2}$ faces $p_{2}$ has higher value than a society where $p_{1}$ and $p_{2}$ are reversed, even though $G_{1}$ and $G_{2}$ are both contained by the distinguished subset $A \in P_{f}^{+} (I)$ representing the population. Note that this form of permutation-dependence necessarily follows from time-discounting future generations, if future generations are contained in $A$ .
Scope-sensitivity: The social utility depends not only on the nonempty subset of personal outcomes, but also how many individuals face each outcome. For example, a society where 99% face torture and 1% face comfort is better than the converse.
Considerations for the fortunate: The social utility depends on how happy the fortunate are, even if there are unfortunate individuals facing much worse outcomes. That is, if two possible societies both contain one person facing horrific torture, then the improving the lives of a billion other people from bad-but-not-torturous to good adds no value.

Nonetheless, I think that HMPU, RPU, and SUAPU are somewhat reasonable approximations, though probably the least plausible in the ROI context than the LELO context or HL context. To the extent that these assumptions hold, Rawls' Original Position (ROI) and his difference principle will be roughly equivalent. This explains why Rawls appeals to ROI to argue for difference utilitarianism in his book "The Theory of Justice" (1973).

Rawls' implicit argument can be summarized as follows:

ROI is a compelling principle of social justice, stating that a social planner should make decisions as if they were ignorant about which individual in society they will be, with no basis for assigning probabilities to the possible alternatives.
Humans can approximately be modelled as maximising a semilatticially rational personal utility function.
The social utility function, stipulated by Rawls, minimises the personal utility over all individuals in society.
Therefore, Rawls' difference principle is a compelling principle of social justice, where Rawls' difference principle is the utilitarian principle employing Rawls' stipulated social utility function.

8. Conclusion

To summarise, I first formalised social choice principles using functions of type-signature $(X \to S) \to P (X)$ . This allowed me to define the utilitarian principle corresponding to a given social utility function, and the aggregative principles corresponding to a given social zeta function. As discussed in my previous article, this social zeta function maps a social outcome to the aggregated personal outcomes of each individual. Using the formalism, I proved that, under three natural conditions, the aggregative principle is mathematically equivalent a corresponding utilitarian principle. Because these conditions are approximately true, aggregativism approximates utilitarianism. even though aggregativism avoids the theoretical pitfalls of utilitarianism, we should nonetheless expect aggregativism to generate roughly-utilitarian recommendations in practical social contexts, and thereby retain the most appealing insights from utilitarianism. Moreover, this explains why MacAskill, Harsanyi, and Rawls each appeal to aggregative principles to defend their respective utilitarian principles.

In this next article, I will enumerate the theoretical pitfalls that face utilitarianism, and how aggregativism overcomes them.

^{^}
See Appraising aggregativism and utilitarianism for a thorough defence.
^{^}
In fact, the function mapping each option $x \in X$ to the principle $Ψ : f \mapsto {x}$ is a canonical embedding of the space of options into the space of social choice principle.
^{^}
See Game Theory without Argmax.
^{^}
The aggregative principle is $f \mapsto Π (ζ \circ f)$ , where $f : X \to S$ is a social context, $Π$ is the human model, and $ζ : S \to P$ is the social zeta function. This means a social planner should choose an option if a self-interested human would choose the associated personal outcome. By HMPU, $Π$ has the form $f \mapsto {argmax}_{X} (v \circ f)$ , where $v : P \to R$ is the personal utility function. This means a self-interested human will choose an option that maximizes personal utility. Hence, aggregativism is the principle $f \mapsto {argmax}_{X} (v \circ ζ \circ f)$ . Intuitively, this means a social planner should choose an option which maximizes the personal utility of the associated personal outcome.
The social zeta function $ζ : S \to P$ is defined by $ζ (s) := α (γ (-, s)^{M} (π))$ , where $α : M (P) \to P$ is the aggregation function for personal outcomes, $γ : I \times S \to P$ assigns personal outcomes to individuals, and $π \in M (I)$ is the distinguished collection representing the population. Intuitively, this means the personal outcome associated to a social outcome is the aggregate of the personal outcomes across all individuals in society.
Now, RPU asserts that $v \circ α = β \circ v^{M}$ , i.e. that the personal utility of the aggregate of personal outcomes is the aggregate of personal utilities of each outcome. Given $ζ (s) := α (γ (-, s)^{M} (π))$ , we obtain $(v \circ ζ) (s) = (β \circ v^{M}) (γ (-, s)^{M} (π))$ . Intuitively, this means the personal utility of the personal outcome associated to a social outcome is the aggregate of the personal utilities of the personal outcomes faced by each individual in society.
Now, SUAPU asserts that $u (s) = β ((v \circ γ) (-, s)^{M} (π))$ , where $u : S \to R$ is the social utility function, $β : M (R) \to R$ is the aggregation function for real numbers, $v : P \to R$ is the personal utility function, $γ : I \times S \to P$ assigns personal outcomes to individuals, and $π \in M (I)$ is the distinguished collection representing the population. Intuitively, this means the social utility of a social outcome is the aggregate of the personal utilities of the personal outcomes faced by each individual in society.
This entails that $u = v \circ ζ$ . To see this, note that the right-hand-sides of the equations $(v \circ ζ) (s) = (β \circ v^{M}) (γ (-, s)^{M} (π))$ and $u (s) = β ((v \circ γ) (-, s)^{M} (π))$ are identical: $(β \circ v^{M}) (γ (-, s)^{M} (π)) = β ((v \circ γ) (-, s)^{M} (π))$ . Indeed, this follows from the functorality of the lifting operator. Therefore, $v \circ ζ (s) = u (s)$ for all $s \in S$ . Intuitively, this means the social utility of a social outcome is the personal utility of its associated personal outcome.
Hence, the aggregative principle is $f \mapsto {argmax}_{X} (u \circ f)$ . To see this, note that ${argmax}_{X} (v \circ ζ \circ f) = {argmax}_{X} (u \circ f)$ because $v \circ ζ = u$ . Intuitively, this means a social planner following the aggregative principle should choose an option which maximizes the social utility of the resulting social outcome. The utilitarian principle is $f \mapsto {argmax}_{X} (u \circ f)$ . Hence the aggregative principle is equivalent to the utilitarian principle conditional on HMPU, RPU, and SUAPU. $□$
^{^}
Of course, whether these particular cases violate RPU depends on which function $Π : (X \to P) \to P_{f}^{+} (X)$ models the self-interested human, and which personal utility function $v : P \to R$ is used to characterise $Π$ . Nonetheless, I think that any reasonable $Π$ or $v : P \to R$ will contain both examples of novelty value and of consistency value.
^{^}
We might also ask: are human values convexly rational with respect to other convex algebras on personal outcomes? Recall that, in my previous article, we examined a novel interpretation of $p +_{λ} p^{'}$ as the direct interpolation in some high-dimensional vector space $R^{d}$ . To obtain semantically meaningful vector representations of personal outcomes, we might leverage the activation space of a large language model like GPT-3. The interpolation $p +_{λ} p^{'}$ of two vector representations $p, p^{'} \in R^{d}$ is simply $λ \cdot p + (1 - λ) \cdot p^{'}$ . Under this interpretation of $+_{λ}$ , the RPU condition says that personal utility is a linear probe. Formally, RPU requires the personal utility function $v : R^{d} \to R$ to satisfy the equation $v (λ \cdot p + (1 - λ) \cdot p^{'}) = λ \cdot v (p) + (1 - λ) \cdot v (p^{'})$ for all vectors $p, p \in R^{d}$ and interpolation weights $λ \in (0, 1)$ . Whether RPU holds in this setting depends on the specific vector representation of outcomes.
^{^}
The real numbers admit another fusion operator, $max : P_{f}^{+} (R) \to R$ , which we could consider. But the semilattice $(R, max)$ will generate a condition of semilatticial rational which is even less plausible than that generated by the semilattice $(R, min)$ . Namely, it requires $v (p_{1} \oplus p_{2}) = max {v (p_{1}), v (p_{2})}$ , e.g. humans would value Knightian uncertainty between horrific torture and a comfortable life no higher than certainty in a comfortable life.
^{^}
In my previous article, we examined a conjunctive interpretation of the fusion of personal outcomes, in contrast to Rawls' disjunctive interpretation. In particular, if $p_{1}$ and $p_{2}$ are personal outcomes then $p_{1} \oplus p_{2}$ is the personal outcome of facing $p_{1}$ and $p_{2}$ simultaneously. How should we understand semilatticial rationality, which formally states that for any nonempty finite subset of personal outcome ${p_{1}, \dots, p_{n}}$ , we have equality between $v \circ ⨁ ({p_{1}, \dots, p_{n}})$ and $min {v (p_{1}), \dots, v (p_{n})}$ ? Under this fusion operator, semilatticial rationality requires that humans are "glass half-empty". Informally, the value of facing outcomes $p_{1}, \dots, p_{n}$ simultaneously is no greater than the value of the worst constituent outcome. That is, $v (p_{1} \oplus \dots \oplus p_{n}) = min {v (p_{1}), \dots, v (p_{n})}$ .
Here's how this rationality condition might arise naturally: Imagine a set of "catastrophes", such as being bored, being cold, being dead. Each catastrophe is represented with a personal outcome $p$ and a value $v (p) \in R$ . For example, $v (bored) = - 5$ , $v (cold) = - 10$ , and $v (dead) = - 1000$ . Moreover, the utility of a complex personal outcome, such as being bored and cold simultaneously, is determined by the worst catastrophe. That is, $v (bored \oplus cold) = - 10$ . It implies that facing multiple catastrophes, which are equally disastrous, is no worse than facing only one such catastrophe, i.e. if $v (hungry) = - 10$ then $v (cold \oplus hungry) = - 10$ .

LESSWRONG
is fundraising!
LW

LESSWRONG
is fundraising!
LW

28

Aggregative principles approximate utilitarian principles

28

28

1. Introduction

2. Social choice principles

3. Two strategies for specifying principles

3.1. Utilitarian principles

3.2. Aggregative principles

3.3. Structural similarity between the two strategies

4. Equivalence between aggregativism and utilitarianism

4.1. Three conditions for equivalence

5. Equivalence between LELO and longtermist total utilitarianism

5.2. Monoidal rationality of personal utility?

5.2. Social utility sums personal utility?

6. Equivalence between HL and average utilitarianism

6.1. Convex rationality of personal utility?

6.2. Social utility averages personal utility?

7. Equivalence between ROI and difference principle

7.1. Semilatticial rationality of personal utility?

7.2. Social utility minimises personal utility?

8. Conclusion