Hide table of contents

In 1955, John Harsanyi published a paper demonstrating that anyone who follows certain reasonable assumptions must be a total utilitarian. The paper is somewhat technical, but the result is relatively easy to understand. I’ve been unable to find a non-technical summary of this result and so, because it is one of the more compelling arguments for utilitarianism, I decided to write one up.

Background

Suppose a group of friends are deciding where to eat. Each individual person has some preference (say, one person most prefers Chinese, then Italian, then Japanese; another prefers Italian, then Chinese, then Japanese) but there is no clear restaurant which everyone thinks is best. How should they choose a place?

One solution is to have each person attach a numeric score to how much they would enjoy a given restaurant. If you really like Chinese food, then maybe you give it 10 points; if you’re lukewarm then you give it 2, and if you really hate Chinese then maybe it’s -5.

Once each person has voted, you simply add up all the scores, and then the group goes to whichever restaurant had the highest total score.

This method is (a simplified form of) “total” utilitarianism, and Harsanyi demonstrated that it is the only “reasonable” way that groups can make a decision.

Theorem

Concretely, assume:

  1. Each individual in the group is rational (for a commonly used but technical definition of “rational”, hereafter referred to as “VNM-rational”)[1][2]

  2. The group as a whole is VNM-rational[3][4]

  3. If every individual in the group is indifferent between two options, then the group as a whole is indifferent between those two options

The theorem proves that total utilitarianism is the only method which satisfies these three assumptions.

Note that this theorem just demonstrates that, if there is some way of saying that certain things are better or worse for individuals, then the way to determine whether those things are better or worse for groups is to add up how good it is for the individuals in those groups. It doesn't say anything about the way in which things can be better or worse for individuals. I.e. you could be adding up each individual's happiness (hedonistic utilitarianism), something related to their preferences (preference utilitarianism), or something more exotic.

Example

The above is somewhat abstract, so here is a concrete example demonstrating why anything other than total utilitarianism fails these axioms. (This is my best attempt at creating a simple example; perhaps others in the comments can create even simpler ones.)

Consider a population consisting of 2 people. Because they are VNM-rational, they have utility functions, and therefore we can represent states of the world as a vector of numbers. E.g. the vector is a world in which the first person has utility 5 and the second has utility 7.

Let’s prove that the world must be as good as the world .

Consider a lottery in which there is a one-half chance we end up with the world and a one-half chance that we end up with the world . Because we are indifferent between who has the 2 and who has the 0,[5] and the group is an expected utility maximizer, these are equally valuable:[6]

We can we write this from the perspective of each individual in society:

Because VNM-rational agents are expected utility maximizers we can just multiply the probabilities through:[7]

QED.

The key insight here is that each individual is indifferent between the “50% chance of 2, 50% chance of 0” and “guaranteed chance of 1” lotteries (on account of being VNM-rational). Because each individual is indifferent, the group is also forced to be indifferent (on account of the third assumption).

Conclusion

Total utilitarianism is a fairly controversial position. The above example where can be extended to show that utilitarianism is extremely demanding, potentially requiring extreme sacrifices and inequality.

It is therefore interesting that it is the only decision procedure which does not violate one of these seemingly reasonable assumptions.

While not conclusive, this theorem provides a compelling argument for total utilitarianism.

Appendix on Equality

Harsanyi’s original theorem allowed for weighted total utilitarianism. (I.e. everyone gets a vote, but some people’s votes count more than others.)

It’s easy enough to add an assumption like “also everyone is equal” to force true total utilitarianism, but interestingly Harsanyi didn’t think that was necessary:

This implies, however, without any additional ethical postulates that an individual’s impersonal preferences, if they are rational, must satisfy Marschak’s axioms [equivalent to VNM-rationality] and consequently must define a cardinal social welfare function equal to the arithmetical mean of the utilities of all individuals in the society (the arithmetical mean of all individual utilities gives the actuarial value of his uncertain prospects, defined by an equal probability of being put in the place of any individual in the situation chosen). [Emphasis added]

In other words, he thinks it would be irrational to weight people unevenly, because equal weighting is the expected utility-maximizing choice if you don’t know which person in society you will become.

This idea of making decisions behind a veil of ignorance where you don’t know which person in society you will become was later popularized by John Rawls, who used it to argue for his Minimax decision rule.

It is, in my humble opinion, unfortunate that the veil of ignorance has become associated with Rawls, when Harsanyi’s utilitarian formulation has a much more rigorous mathematical grounding. (And was also published earlier.)

Credits

I would like to thank Aaron Gertler, Sam Deere, Caitlin Elizondo and the CEA UK office staff for comments on drafts of this post and discussions about related ideas.


  1. Harsanyi used Marschak’s axioms, which are mathematically equivalent to the VNM ones, but less popular. I'm using VNM here just because they seem better known. ↩︎

  2. "Rational" is a somewhat unfortunate term, but I'm sticking with it because it's standard. These axioms are intended to prevent things like "Ben likes apples more than bananas but also likes bananas more than apples." It's not intended to prevent "irrational" value judgments like enjoying Nickelback's music. A better term might be something like "consistent". ↩︎

  3. It’s a well-known consequence of this assumption that the group must be “utilitarian” in the sense that it has a utility function. The surprising part of Harsanyi’s theorem is not that there is a utility function but rather that the utility function must be a linear addition of its constituents’ utility functions (as opposed to, say, their average or the sum of their logarithms or something completely disconnected from its constituents' utility.). ↩︎

  1. An example of what it means for a group decision to be VNM-rational: if the group somehow aggregates its preferences (through voting or reading entrails or whatever) and decides that Chinese is preferable to Italian, and also that Italian is preferable to Japanese, then the group must also conclude that Chinese is preferable to Japanese. We don’t care how it’s aggregating its preferences, but it must do so in a “rational” way. ↩︎

  2. Note that this isn't clearly implied by the assumptions – see the appendix on equality. Harsanyi's original proof does not require any assumptions about equality, but this sort of assumption makes the proof much simpler and seems unlikely to be a point of controversy, so I'm including it. ↩︎

  3. More precisely: for some group utility . Because of the VNM axioms, . (Normalizing .) Therefore, . I’m still skipping some steps; people interested in a more rigorous proof should see his original paper. ↩︎

  4. More precisely: each individual is indifferent between a lottery where they are guaranteed 1 utility versus having a 50% chance of 2, 50% chance of 0. Since each individual is different between these, the group is also indifferent. ↩︎

Show all footnotes

47

1
0

Reactions

1
0

More posts like this

Comments42


Sorted by Click to highlight new comments since:

Thanks for writing this up!

For those interested in more info:

VNM Utility is the thing that people actually pursue and care about. If wellbeing is distinct from that, then wellbeing is the wrong thing for society to be optimizing. I think this actually is the case. Harsanyi, and myself, are preference utilitarians. Singer and Parfit seem to be something else. I believe they were wrong about something quite foundational. Writing about this properly is extremely difficult and I can understand why no one has done it and I don't know when I'll ever get around to it.

I think this last point essentially denies the third axiom above, which is what connects individual vNM utility and social/ethical preferences. (The original statement of the second axiom is just vNM rationality for social/ethical preferences, and has no relationship with the individuals' preferences.)

Thanks for writing this!

I don't think the theorem provides support for total utilitarianism, specifically, unless you add extra assumptions about how to deal with populations of different sizes or different populations generally. Average utilitarianism is still consistent with it, for example. Furthermore, if you don't count the interests of people who exist until after they exist or unless they come to exist, it probably won't look like total utilitarianism, although it gets more complicated.

You might be interested in Teruji Thomas' paper "The Asymmetry, Uncertainty, and the Long Term" (EA Forum post here), which proves a similar result from slightly different premises, but is compatible with all of 1) ex post prioritarianism, 2) mere addition, 3) the procreation asymmetry, 4) avoiding the repugnant conclusion and 5) avoiding antinatalism, and all five of these all at the same time, because it sacrifices the independence of irrelevant alternatives (the claim that how you rank choices should not depend on what choices are available to you, not the vNM axiom). Thomas proposes beatpath voting to choose actions. Christopher Meacham's "Person-affecting views and saturating counterpart relations" also provides an additive calculus which "solves the Non-Identity Problem, avoids the Repugnant and Absurd Conclusions, and solves the Mere-Addition Paradox" and satisfies the asymmetry, also by giving up the independence of irrelevant alternatives, but hasn't, as far as I know, been extended to deal with uncertainty.

I've also written about ex ante prioritarianism in the comments on the EA Forum post about Thomas' paper, and in my own post here (with useful feedback in the comments).

Thanks!

I don't think the theorem provides support for total utilitarianism, specifically, unless you add extra assumptions about how to deal with populations of different sizes or different populations generally. Average utilitarianism is still consistent with it, for example.

Well, average utilitarianism is consistent with the result because it gives the same answer as total utilitarianism (for a fixed population size). The vast majority of utility functions one can imagine (including ones also based on the original position like maximin) are ruled out by the result. I agree that the technical result is "anything isomorphic to total utilitarianism" though.

You might be interested in Teruji Thomas' paper

I had not seen that, thanks!

I just want to second the point that some others have made that it seems more accurate to say only that Harsanyi's result supports utilitarianism (rather than total utilitarianism). Adding the word "total" suggests that the result rules out other version of utilitarianism (e.g. average, critical-level and critical-range utilitarianism), which as you point out is not correct. More generally, I think "utilitarianism" (without the "total") nicely signals that Harsanyi's result concerns fixed-population settings.

It is also worth noting that Harsanyi himself accepted average utilitarianism rather than total utilitarianism in variable-population settings (see the letter exchange between him and Yew-Kwang Ng reported in the appendix of Ng, Y. K. (1983). Some broader issues of social choice. In Contributions to Economic Analysis  (Vol. 145, pp. 151-173). Elsevier.).

Anyway, thanks for this post!

[Edited comment to remove grammatical error]

Thanks for writing this up! I agree that this result is interesting, but I find it unpersuasive as a normative argument. Why should morality be based on group decision-making principles? Why should I care about VNM rationality of the group?

Also, you suggest that this result lends support to common EA beliefs. I'm not so sure about that. First, it leads to preference utilitarianism, not hedonic utilitarianism. Second, EAs tend to value animals and future people, but they would arguably not count as part of the "group" in this framework(?). Third, I'm not sure what this tells you about the creation or non-creation of possible beings (cf. the asymmetry in population ethics).

Finally, it's worth pointing out that you could also start with different assumptions and get very different results. For instance, rather than demanding that the group is VNM rational, one could consider rational individuals in a group who bargain over what to do, and then look at bargaining solutions. And it turns out that the utilitarian approach of adding up utilities is *not* a bargaining solution, because it violates Pareto-optimality in some cases. Does that "disprove" total utilitarianism?

(Using e.g. the Nash bargaining solution with many participants probably leads to some form of prioritarianism or egalitarianism, because you'd have to ensure that everyone benefits.)

+1

I have a strongly negative bias against any attempt to ground normative theories in abstract mathematical theories, such as game theory and decision theory. The way I see it, the two central claims of utilitarianism are the axiological claim (well-being is what matters) and the maximizing claim (we should maximize what matters ie. well-being). This argument provides no reason to ground our axiology in well-being, and also provides no reason that we should be maximizers.

In general, there is a significant difference between normative claims, like total utilitarianism, and factual claims, like "As a group, VNM rational agents will do X."

And it turns out that the utilitarian approach of adding up utilities is *not* a bargaining solution, because it violates Pareto-optimality in some cases. Does that "disprove" total utilitarianism?

I'm not sure this is right. As soon as you maximize a weighted sum with non-negative coefficients your solution will be weakly Pareto optimal. As soon as all coefficients are strictly positive, it will be strongly Pareto optimal. The axioms mentioned above don't imply non-negative coefficients, so theoretically they are also satisfied by "anti-utilitarianism" which counts everyone's utility negatively. But one can add stronger Pareto axioms to force all coefficients to be strictly positive.

The problem with the utilitarian Bargaining solution is that it is not independent of affine transformations of utility functions. Just summing up utility functions is underspecified, one also needs to choose a scaling for the utility functions. A second criterion that might not be satisfied by the utilitarian solution (depending on the scaling chosen) is individual rationality, which means that everyone will be better off given the bargaining solution than some disagreement outcome.

You're right; I meant to refer to the violation of individual rationality. Thanks!

Thanks for the comment!

Also, you suggest that this result lends support to common EA beliefs.

Hmm, I wasn't trying to suggest that, but I might have accidentally implied something. I would be curious what you are pointing to?

First, it leads to preference utilitarianism, not hedonic utilitarianism

I used preferences about restaurants as an example because that seemed like something people can relate to easily, but that's just an example. The theorem is compatible with hedonic utilitarianism. (In that case, the theorem would just prove that the group's utility function is the sum of each individual's happiness.)

Second, EAs tend to value animals and future people, but they would arguably not count as part of the "group" in this framework(?).

I don't think that this theorem says much about who you aggregate. It's just simply stating that if you aggregate some group of persons in a certain way, then that aggregation must take the form of addition.

Third, I'm not sure what this tells you about the creation or non-creation of possible beings (cf. the asymmetry in population ethics).

I agree it doesn't say much, see e.g. Michael's comment.

I agree it doesn't say much, see e.g. Michael's comment.

In that case, it would IMO be better to change "total utilitarianism" to "utilitarianism" in the article. Utilitarianism is different from other forms of consequentialism in that it uses thoroughgoing aggregation. Isn't that what Harsanyi's theorem mainly shows? It doesn't really add any intuitions about population ethics. Mentioning the repugnant conclusion in this context feels premature.

In that case, it would IMO be better to change "total utilitarianism" to "utilitarianism" in the article. Utilitarianism is different from other forms of consequentialism in that it uses thoroughgoing aggregation. Isn't that what Harsanyi's theorem mainly shows?

Hmm, it does show that it's a linear addition of utilities (as opposed to, say, the sum of their logarithms). So I think it's stronger than saying just "thoroughgoing aggregation".

I'm not very familiar with the terminology here, but I remember that in this paper, Alastair Norcross used the term "thoroughgoing aggregation" for what seems to be linear addition of utilities in particular. That's what I had in mind anyway, so I'm not sure I believe anything different form you. The reason I commented above was because I don't understand the choice of "total utilitarianism" instead of just "utilitarianism." Doesn't every form of utilitarianism use linear addition of utilities in a case where population size remains fixed? But only total utilitarianism implies the repugnant conclusion. Your conclusion section IMO suggests that Harsanyi's theorem (which takes a case where population size is indeed fixed) does something to help motivate total utilitarianism over other forms of utilitarianism, such as prior-existence utilitarianism, negative utilitarianism or average utilitarianism. You already acknowledged in your reply further above to that it doesn't do much of that. That's why I suggested rephrasing your conclusion section. Alternatively, you could also explain in what ways you might think the utilitarian alternatives to total utilitarianism are contrived somehow or not in line with Harsanyi's assumptions. And probably I'm missing something about how you think about all of this, because the rest of the article seemed really excellent and clear to me. I just find the conclusion section really jarring.

Alastair Norcross used the term "thoroughgoing aggregation" for what seems to be linear addition of utilities in particular

Ah, my mistake – I had heard this definition before, which seems slightly different.

I just find the conclusion section really jarring.

Thanks for the suggestion – always tricky to figure out what a "straightforward" consequence is in philosophy.

I changed it to this – curious if you still find it jarring?

Total utilitarianism is a fairly controversial position. The above example where can be extended to show that utilitarianism is extremely demanding, potentially requiring extreme sacrifices and inequality. It is therefore interesting that it is the only decision procedure which does not violate one of these seemingly reasonable assumptions.

Ah, my mistake – I had heard this definition before, which seems slightly different.

Probably I was wrong here. After reading this abstract, I realize that the way Norcross wrote about it is compatible with a weaker claim that linear aggregation of utility too. I think I just assumed that he must mean linear aggregation of utility, because everything else would seem weirdly arbitrary. :)

I changed it to this – curious if you still find it jarring?

Less so! The "total" still indicates the same conclusion I thought would be jumping the gun a bit, but if that's your takeaway it's certainly fine to leave it. Personally I would just write "utilitarianism" instead of "total utilitarianism."

I used preferences about restaurants as an example because that seemed like something people can relate to easily, but that's just an example. The theorem is compatible with hedonic utilitarianism. (In that case, the theorem would just prove that the group's utility function is the sum of each individual's happiness.)

In this case, I think it's harder to argue that we should care about ex ante expected individual hedonistic utility and for the 1st and 3rd axioms, because we had rationality based on preferences and something like Pareto to support these axioms before, but we could now just be concerned with the distribution of hedonistic utility in the universe, which leaves room for prioritarianism and egalitarianism. I think the only "non-paternalistic" and possibly objective way to aggregate hedonistic utility within an individual (over their life and/or over uncertainty) would be to start from individual preferences/attitudes/desires but just ignore concerns not about hedonism and non-hedonistic preferences, i.e. an externalist account of hedonism. Roger Crisp defends internalism in "Hedonism Reconsidered", and defines the two terms this way:

Two types of theory of enjoyment are outlined-internalism, according to which enjoyment has some special ’feeling tone’, and externalism, according to which enjoyment is any kind of experience to which we take some special attitude, such as that of desire.

Otherwise, I don't think there's any reason to believe there's an objective common cardinal scale for suffering and pleasure, even if there were a scale for suffering and a separate scale for pleasure. Suffering and pleasure don't use exactly the same parts of the brain, and suffering isn't just an "opposite" pattern to pleasure. Relying on mixed states, observing judgements when both suffering and pleasure are happening at the same time might seem promising, but these judgements happen at a higher level and probably wouldn't be consistent between people, e.g. you could have two people with exactly the same suffering and pleasure subsystems, but with different aggregating systems.

I'm personally more sympathetic to externalism. With antifrustrationism (there are actually arguments for antifrustrationism; see also my comment here), externalism leads to a negative hedonistic view (which I discuss further here).

Why should morality be based on group decision-making principles? Why should I care about VNM rationality of the group?

It doesn't have to be the group, it can be an impartial observer with their own social welfare function, as long as it is increasing with individual expected utility, i.e. satisfies ex ante Pareto. Actually, that's how it was originally stated.

EDIT: woops, condition 2 is weaker than ex ante Pareto; it's just vNM rationality with respect to outcomes for social/ethical preferences/views. It's condition 3 that connects individual vNM utility and social/ethical vNM utility.

[This comment is no longer endorsed by its author]Reply
Why should morality be based on group decision-making principles? Why should I care about VNM rationality of the group?

I've retracted my previous reply. The original 2nd condition is different from ex ante Pareto; it's just vNM rationality with respect to outcomes for social/ethical preferences/views and it says nothing about the relationship between individual preferences and social/ethical ones. It's condition 3 that connects individual vNM utility and social/ethical vNM utility.

I am not an expert in this topic but I believe this recent paper is relevant and may derive a result that is more general than Harsanyi-style utilitarianism https://www.sciencedirect.com/science/article/pii/S0304406820300045

Because we are indifferent between who has the 2 and who has the 0

Perhaps I'm missing something, but where does this claim come from? It doesn't seem to follow from the three starting assumptions.

Yeah, it doesn't (obviously) follow. See the appendix on equality. It made the proof simpler and I thought most readers would not find it objectionable, but if you have a suggestion for an alternate simple proof I would love to hear it!

As stated in another comment, you have proved any ethical theory that is identical to total utilitarianism with fixed population sizes (e.g average utilitarianism).

But, you can use separability to rule out non-total versions of utilitarianism.

Separability is roughly the principle that, in comparing the value of two outcomes, one can ignore any people whose existence and welfare are unaffected.

Non-total versions of utilitarianism violate separability because they imply that the value of creating someone depends on the population or wellbeing of unaffected beings.

FWIW, if you extend the rationality axioms to prospects (probability distributions) with infinitely many possible outcomes in their natural ways, Harsanyi's theorem + separability leads to contradiction. In general, unbounded utility functions violate extensions of standard rationality axioms to prospects with infinitely many possible outcomes, and these extensions can be motivated pretty much the same ways as the versions here, in the vNM utility theorem and Savage's theorem. See my post here.

I think this math is interesting, and I appreciate the good pedagogy here. But I don't think this type of reasoning is relevant to my effective altruism (defined as "figuring out how to do the most good"). In particular, I disagree that this is an "argument for utilitarianism" in the sense that it has the potential to convince me to donate to cause A instead of donating to cause B.

(I really do mean "me" and "my" in that sentence; other people may find that this argument can indeed convince them of this, and that's a fact about them I have no quarrel with. I'm posting this because I just want to put a signpost saying "some people in EA believe this," in case others feel the same way.)

Following Richard Ngo's post https://forum.effectivealtruism.org/posts/TqCDCkp2ZosCiS3FB/arguments-for-moral-indefinability, I don't think that human moral preferences can be made free of contradiction. Although I don't like contradictions and I don't want to have them, I also don't like things like the repugnant conclusion, and I'm not sure why the distaste towards contradictions should be the one that always triumphs.

Since VNM-rationality is based on transitive preferences, and I disagree that human preferences can or "should" be transitive, I interpret things like this as without normative weight.

I think this is an important point. People might want to start with additional or just different axioms, including, as you say, avoiding the repugnant conclusion, and if they can't all together be consistent, then this theorem may unjustifiably privilege a specific subset of those axioms.

I do think this is an argument for utilitarianism, but more like in the sense of "This is a reason to be a utilitarian, but other reasons might outweigh it." I think it does have some normative weight in this way.

Also, independence of irrelevant alternatives is safer to give up than transitivity, and might accomplish most of what you want. See my other comment.

Thanks for the pointer to "independence of irrelevant alternatives."

I'm curious to know how you think about "some normative weight." I think of these arguments as being about mathematical systems that do not describe humans, hence no normative weight. Do you think of them as being about mathematical systems that *somewhat* describe humans, hence *some* normative weight?

I think if you believe the conditions of the theorem are all plausible or desirable and so give them some weight, then you should give the conclusion some weight, too.

For example, it's unlikely to be the case that anyone's ethical rankings actually satisfy the vNM rationality conditions in practice, but if you give any weight to the claims that we should have ethical rankings that are complete, continuous with respect to probabilities (which are assumed to work in the standard way), satisfy the independence of irrelevant alternatives and avoid all theoretical (weak) Dutch books, and also give weight to the combination of these conditions at once*, then the Dutch book results give you reason to believe you should satisfy the vNM rationality axioms, since if you don't, you can get (weakly) Dutch booked in theory. I think you should be at least as sympathetic to the conclusion of a theorem as you are to the combination of all of its assumptions, if you accept the kind of deductive logic used in the proofs.

*I might be missing more important conditions.

I think if you believe the conditions of the theorem are all plausible or desirable and so give them some weight, then you should give the conclusion some weight, too.

This makes sense, but the type of things that tend to convince me to believe in an ethical theory generally depend a lot on how much I resonate with the main claims of the theory. When I look at the premises in this theorem, none of them seem to be type of things that I care about.

On the other hand, pointing out that utilitarians care about people and animals, and they want them to be as happy as possible (and free, or with agency, desire satisfaction) that makes me happy to endorse the theory. When I think about all people and animals being happy and free from pain in a utilitarian world, I get a positive feeling. When I think about "Total utilitarians are the only ones that satisfy these three assumptions" I don't get the same positive feeling.

When it comes to ethics, it's the emotional arguments that really win me over.

This makes sense, but the type of things that tend to convince me to believe in an ethical theory generally depend a lot on how much I resonate with the main claims of the theory. When I look at the premises in this theorem, none of them seem to be type of things that I care about.

If you want to deal with moral uncertainty with credences, you could assign each of the 3 major assumptions an independent credence of 50%, so this argument would tell you should be utilitarian with credence at least . (Assigning independent credences might not actually make sense, in case you have to deal with contradictions with other assumptions.)

On the other hand, pointing out that utilitarians care about people and animals, and they want them to be as happy as possible (and free, or with agency, desire satisfaction) that makes me happy to endorse the theory. When I think about all people and animals being happy and free from pain in a utilitarian world, I get a positive feeling.

Makes sense. For what it's worth, this seems basically compatible with any theory which satisfies the Pareto principle, and I'd imagine you'd also want it to be impartial (symmetry). If you also assume real-valued utilities, transitivity, independence of irrelevant alternatives, continuity and independence of unconcerned agents, you get something like utilitarianism again. In my view, independence of unconcerned agents is doing most of the work here, though.

Like other links between VNM and Utilitarianism, this seems to roll intersubjective utility comparison under the rug. The agents are likely using very different methods to convert their preferences to the given numbers, rendering the aggregate of them non rigorous and subject to instability in iterated games.

I can't tell whether you are denying assumption 1 or 2.

I don't think Romeo even has to deny any of the assumptions. Harsanyi's result, derived from the three assumptions, is not enough to determine how to do intersubjective utility comparisons. It merely states that social welfare will be some linear combination of individual utilities. While this already greatly restricts the way in which utilities are aggregated, it does not specify which weights to use for this sum.

Moreover, arguing that weights should be equal based on the veil of ignorance, as I believe Harsanyi does, is not sufficient, since utility functions are only determined up to affine transformations, which includes rescalings. (This point has been made in the literature as a criticism of preference utilitarianism, I believe.) So there seems to be no way to determine what equal weights should look like, without settling on a way to normalize utility functions, e.g., by range normalization or variance normalization. I think the debate about intersubjective utility comparisons comes in at the point where you ask how to normalize utility functions.

Of course, if you are not using a kind of preference utilitarianism but instead just aggregate some quantities you believe to have an absolute scale—such as happiness and suffering—then you could argue that utility functions should just correspond to this one absolute scale, with the same scaling for everyone. Though I think this is also not a trivial argument—there are potentially different ways to get from this absolute scale or Axiology to behavior towards risky gambles, which in turn determine the utility functions.

> there seems to be no way to determine what equal weights should look like, without settling on a way to normalize utility functions, e.g., by range normalization or variance normalization. I think the debate about intersubjective utility comparisons comes in at the point where you ask how to normalize utility functions.

yup, thanks. Also across time as well as across agents at a particular moment.

As a fan of Nickelback, I really appreciate fn2.

I want to point out that both assumptions 2, and 1 and 3 together have been objected to by academic philosophers.

Assumption 2 is ex post consequentialism: maximize the expected value of a social welfare function. Ex ante prioriatarianism/egalitarianism means rejecting 2: we should be fair to individuals with respect to their expected utilities, even if this means overall worse expected outcomes. This is, of course, vNM irrational, but Diamond defended it (and see my other comment here). Essentially, even if two outcomes are equally valuable, a probabilistic mixture of them can be more valuable because it gives people fairer chances; this is equality of opportunity. This contradicts the independence axiom specifically for vNM rationality (and so does the Allais paradox).

Assumptions 1 and 3 together are basically a weaker version of ex ante Pareto, according to which it's (also) better to increase the expected utility of any individual(s) if it comes at no expected cost to any other individuals. Ex post prioritarianism/egalitarianism means rejecting the conjunction of 1 and 3, and ex ante Pareto: we should be more fair to individuals ex post (we want more fair actual outcomes after they're determined), even if this means worse individual expected outcomes.

There was a whole issue of Utilitas devoted to prioritarianism and egalitarianism in 2012, and, notably, Parfit defended prioritarianism in it, arguing against ex ante Pareto (and hence the conjunction of 1 and 3):

When Rawls and Harsanyi appeal to their versions of Veil of Ignorance Contractualism, they claim that the Equal Chance Formula supports the Utilitarian Average Principle, which requires us to act in ways that would maximize average utility, by producing the greatest sum of expectable benefits per person. This is the principle whose choice would be rational, in self-interested terms, for people who have equal chances of being in anyone’s position.
We can plausibly reject this argument, because we can reject this version of contractualism. As Rawls points out, Utilitarianism is, roughly, self-interested rationality plus impartiality. If we appeal to the choices that would be rational, in self-interested terms, if we were behind some veil of ignorance that made us impartial, we would expect to reach conclusions that are, or are close to being, Utilitarian. But this argument cannot do much to support Utilitarianism, because this argument’s premises are too close to these conclusions. Suppose that I act in a way that imposes some great burden on you, because this act would give small benefits to many other people who are much better off than you. If you object to my act, I might appeal to the Equal Chance Formula. I might claim that, if you had equal chances of being in anyone’s position, you could have rationally chosen that everyone follows the Utilitarian Principle, because this choice would have maximized your expectable benefits. As Scanlon and others argue, this would not be a good enough reply.9 You could object that, when we ask whether some act would be wrong, we are not asking a question about rational self-interested choice behind a veil of ignorance. Acts can be wrong in other ways, and for other reasons.

He claimed that we can reject ex ante Pareto ("Probabilistic Principle of Personal Good"), in favour of ex post prioritarianism/egalitarianism:

Even if one of two possible acts would be expectably worse for people, this act may actually be better for these people. We may also know that this act would be better for these people if they are worse off. This fact may be enough to make this act what we ought to do.

Here, by "worse off" in the second sentence, he meant in a prioritarian/egalitarian way. The act is actually better for them, because the worse off people under this act are better off than the worse off people under the other act. He continued:

We can now add that, like the Equal Chance Version of Veil of Ignorance Contractualism, this Probabilistic Principle has a built-in bias towards Utilitarian conclusions, and can therefore be rejected in similar ways. According to Prioritarians, we have reasons to benefit people which are stronger the worse off these people are. According to Egalitarians, we have reasons to reduce rather than increase inequality between people. The Probabilistic Principle assumes that we have no such reasons. If we appeal to what would be expectably better for people, that is like appealing to the choices that it would be rational for people to make, for self-interested reasons, if they had equal chances of being in anyone’s position. Since this principle appeals only to self-interested or prudential reasons, it ignores the possibility that we may have impartial reasons, such as reasons to reduce inequality, or reasons to benefit people which are stronger the worse off these people are. We can object that we do have such reasons.
When Rabinowicz pointed out that, in cases like Four, Prioritarians must reject the Probabilistic Principle of Personal Good, he did not regard this fact as counting against the Priority View. That, I believe, was the right response. Rabinowicz could have added that similar claims apply to Egalitarians, and to cases like Two and Three.

Thanks for this.

Even if this argument is successful, there are debates over decision theory (evidential, causal, functional). Does an ideally rational agent intervene at the level of states, actions, or decision procedures?

If it's decision procedures, or something similar, functional decision theory can you get views that look quite close to Kantianism.

Just as a side note, Harsanyi's result is not directly applicable to a formal setup involving subjective uncertainty, such as Savage's or the Jeffrey-Bolker framework underlying evidential and causal decision theory. Though there are results for the Savage setup too, e.g., https://www.jstor.org/stable/10.1086/421173, and Caspar Oesterheld and I are working on a similar result for the Jeffrey Bolker framework. In this setup, to get useful results, the indifference Axiom can only be applied to a restricted class of propositions where everyone agrees on beliefs.

Some discussion here, too.

I would actually say that being equivalent to and is in contradiction with equality of opportunity. In the first case, both individuals have an equal chance of being well-off (getting 2), but in the second and third, only one has any chance of being well-off, so the opportunities to be well-off are only equal in the first case (essentially the same objection to essentially the same case is made in "Cardinal Welfare, Individualistic Ethics, and Interpersonal Comparison of Utility: Comment", in which Peter Diamond writes "it seems reasonable for the individual to be concerned solely with final states while society is also interested in the process of choice"). This is what ex ante prioritarianism/egalitarianism is for, but it can lead to counterintuitive results. See the comments on that post, and "Decide As You Would With Full Information! An Argument Against Ex Ante Pareto" by Marc Fleurbaey & Alex Voorhoeve.

For literature on equality of outcomes and uncertainty, the terms to look for are "ex post egalitarianism" and "ex post prioritarianism" (or with the hyphen as "ex-post", but I think Google isn't sensitive to this).

Yeah, my point was that ex-ante utility was valued equally, but I think that was confusing. I'm just going to remove that section. Thanks!

Concretely, assume:
1. Each individual in the group is rational (for a commonly used but technical definition of “rational”, hereafter referred to as “VNM-rational”)[1][2]
2. The group as a whole is VNM-rational[3][4]
3. If every individual in the group is indifferent between two options, then the group as a whole is indifferent between those two options

One way of motivating 3 is by claiming (in the idealistic case where everyone's subjective probabilities match, including the probabilities that go with the ethical ranking):

a. Individual vNM utilities track welfare and what's better for individuals, and not having it do so is paternalistic. We should trust people's preferences when they're rational since they know what's best for themselves.

b. When everyone's preferences align, we should trust their preferences, and again, not doing so is paternalistic, since it would (in principle) lead to choices that are dispreferred by everyone, and so worse for everyone, according to a.*

As cole_haus mentioned, a could actually be false, and a motivates b, so we'd have no reason to believe b either if a were false. However, if we use some other real-valued conception of welfare and claim what's good for individuals is maximizing its expectation, then we could make an argument similar to b (replacing "dispreferred by everyone" with "worse in expectation for each individual") to defend the following condition, which recovers the theorem:

3'. If for two options and for each individual in the options, their expected welfare is the same in the two options, then we should be ethically indifferent between the options.

*As alluded to here, if your ethical ranking of choices broke one of these ties so , it would do so with a real number-valued difference, and by the continuity axiom, you could probabilistically mix the choice you broke the tie in favour of with any choice that's worse to everyone than the other choice , and this could be made better than according to your ethical ranking, i.e. for any close enough to 1, while everyone has the opposite preference over these two choices.

Curated and popular this week
 ·  · 16m read
 · 
This is a crosspost for The Case for Insect Consciousness by Bob Fischer, which was originally published on Asterisk in January 2025. [Subtitle.] The evidence that insects feel pain is mounting, however we approach the issue. For years, I was on the fence about the possibility of insects feeling pain — sometimes, I defended the hypothesis;[1] more often, I argued against it.[2] Then, in 2021, I started working on the puzzle of how to compare pain intensity across species. If a human and a pig are suffering as much as each one can, are they suffering the same amount? Or is the human’s pain worse? When my colleagues and I looked at several species, investigating both the probability of pain and its relative intensity,[3] we found something unexpected: on both scores, insects aren’t that different from many other animals.  Around the same time, I started working with an entomologist with a background in neuroscience. She helped me appreciate the weaknesses of the arguments against insect pain. (For instance, people make a big deal of stories about praying mantises mating while being eaten; they ignore how often male mantises fight fiercely to avoid being devoured.) The more I studied the science of sentience, the less confident I became about any theory that would let us rule insect sentience out.  I’m a philosopher, and philosophers pride themselves on following arguments wherever they lead. But we all have our limits, and I worry, quite sincerely, that I’ve been too willing to give insects the benefit of the doubt. I’ve been troubled by what we do to farmed animals for my entire adult life, whereas it’s hard to feel much for flies. Still, I find the argument for insect pain persuasive enough to devote a lot of my time to insect welfare research. In brief, the apparent evidence for the capacity of insects to feel pain is uncomfortably strong.[4] We could dismiss it if we had a consensus-commanding theory of sentience that explained why the apparent evidence is ir
 ·  · 7m read
 · 
Introduction I have been writing posts critical of mainstream EA narratives about AI capabilities and timelines for many years now. Compared to the situation when I wrote my posts in 2018 or 2020, LLMs now dominate the discussion, and timelines have also shrunk enormously. The ‘mainstream view’ within EA now appears to be that human-level AI will be arriving by 2030, even as early as 2027. This view has been articulated by 80,000 Hours, on the forum (though see this excellent piece excellent piece arguing against short timelines), and in the highly engaging science fiction scenario of AI 2027. While my article piece is directed generally against all such short-horizon views, I will focus on responding to relevant portions of the article ‘Preparing for the Intelligence Explosion’ by Will MacAskill and Fin Moorhouse.  Rates of Growth The authors summarise their argument as follows: > Currently, total global research effort grows slowly, increasing at less than 5% per year. But total AI cognitive labour is growing more than 500x faster than total human cognitive labour, and this seems likely to remain true up to and beyond the point where the cognitive capabilities of AI surpasses all humans. So, once total AI cognitive labour starts to rival total human cognitive labour, the growth rate of overall cognitive labour will increase massively. That will drive faster technological progress. MacAskill and Moorhouse argue that increases in training compute, inference compute and algorithmic efficiency have been increasing at a rate of 25 times per year, compared to the number of human researchers which increases 0.04 times per year, hence the 500x faster rate of growth. This is an inapt comparison, because in the calculation the capabilities of ‘AI researchers’ are based on their access to compute and other performance improvements, while no such adjustment is made for human researchers, who also have access to more compute and other productivity enhancements each year.
 ·  · 40m read
 · 
I am Jason Green-Lowe, the executive director of the Center for AI Policy (CAIP). Our mission is to directly convince Congress to pass strong AI safety legislation. As I explain in some detail in this post, I think our organization has been doing extremely important work, and that we’ve been doing well at it. Unfortunately, we have been unable to get funding from traditional donors to continue our operations. If we don’t get more funding in the next 30 days, we will have to shut down, which will damage our relationships with Congress and make it harder for future advocates to get traction on AI governance. In this post, I explain what we’ve been doing, why I think it’s valuable, and how your donations could help.  This is the first post in what I expect will be a 3-part series. The first post focuses on CAIP’s particular need for funding. The second post will lay out a more general case for why effective altruists and others who worry about AI safety should spend more money on advocacy and less money on research – even if you don’t think my organization in particular deserves any more funding, you might be convinced that it’s a priority to make sure other advocates get more funding. The third post will take a look at some institutional problems that might be part of why our movement has been systematically underfunding advocacy and offer suggestions about how to correct those problems. OUR MISSION AND STRATEGY The Center for AI Policy’s mission is to directly and openly urge the US Congress to pass strong AI safety legislation. By “strong AI safety legislation,” we mean laws that will significantly change AI developers’ incentives and make them less likely to develop or deploy extremely dangerous AI models. The particular dangers we are most worried about are (a) bioweapons, (b) intelligence explosions, and (c) gradual disempowerment. Most AI models do not significantly increase these risks, and so we advocate for narrowly-targeted laws that would focus their att