How to Measure Capacity for Welfare and Moral Status

Jason Schukraft

How to Measure Capacity for Welfare and Moral Status

Comments 19

Sorted by

New & upvoted

zdgroff

Great post, and I'm excited to see RP work on this. I have great confidence in your carefulness about this.

A concern I have with pretty much every approach to weighting welfare across species is that it seems like the correct weights may depend on the type of experience. For example, I could imagine the intensity of physical pain being very similar across species but the severity of depression from not being able to move to vary greatly.

Is there a way to allow for this within the approach you lay out here?

Jason Schukraft

Hi Zach,

Thanks for your comment. Measuring and comparing welfare across species is a tremendous theoretical and practical challenge. For measuring capacity for welfare, we would want to get a rough sense of the range of physical pain and pleasure an animal can experience as well as the range of emotional pain and pleasure an animal can experience. We would also want to know the degree to which physical and emotional pain/pleasure contribute to overall welfare, and this may differ by species. (We will need to account for combination effects: among other things, "stacking" one unit of physical pain on top of one unit of emotional pain may create more or less than two units of overall suffering.) All else being equal, if two animals have the same range of possible physical pains and pleasures, but animal A has a greater range of possible emotional pains and pleasures than animal B, we would expect animal A to have a greater capacity for welfare than animal B.

One thing to keep in mind is that what ultimately matters morally is realized welfare, not capacity for welfare. In many instances, judging the effectiveness of an intervention will require looking at species-specific differences in the way welfare is realized. Two animals may have the same overall capacity for welfare, and they may be subject to the same conditions (solitary confinement, say), but species-specific differences (one is a social animal and the other is not, say) may indicate that one animal suffers much more than the other in those conditions.

Nonetheless, I do believe thinking about capacity for welfare will help increase the efficiency with which our resources are allocated across interventions, especially when applied to big-picture questions, like "What percentage of our resources should ideally go to fish or crustaceans or insects?"

Anthony DiGiovanni 🔸

We could also ask how many days of one’s human life one would be willing to forgo to experience some duration of time as another species. This approach would allow us to assign cardinal numbers to the value of animal lives.

I hope I’m not being too obvious here, but I’ve seen people frequently speak of animals “mattering” X times as much as a human, say, without drawing this distinction: we’d need to be very careful to distinguish what we mean by value of life. For prioritizing which lives to save, this quote perhaps makes sense. But not if “value of animal lives” is meant to correspond to how much we should prioritize alleviating different animals’ suffering. I wouldn’t trade days of my life to experience days of a very poor person’s life, but that doesn’t mean my life is more valuable in the sense that helping me is more important. Quite the opposite: the less value there is in a human’s/animal’s life, the more imperative it is to help them (in non-life-saving ways), for reasons of diminishing returns at least.

I would strongly encourage surveys about intuitions of this sort to precisely ask about tradeoffs of experiences, rather than “value of life” (as in the Norwood and Lusk survey that you cite).

Jason Schukraft

Yeah, I agree that estimating welfare (either average realized welfare or capacity for welfare) this way is a bad strategy for a number of reasons. There are going to be many confounders and the framing of the thought experiment obscures rather than clarifies the issue.

Michael St Jules 🔸

Combination effects seem challenging as you point out. I think it's often taken for granted that weighting things should be done linearly, but there really isn't any reason to believe this would approximate the moral truth or what we'd want to care about upon reflection in this domain, although it's useful for its simplicity, interpretability and transparency.

Another specific challenge is whether we should apply a given (usually monotonic) transformation to a feature that comes in degrees first. For example, if the degree of $X$ matters, say neuron count or neuron count in a particular part of the brain, should we use $X, X^{2}, \sqrt{X}, log X, 2^{X}$ or something else? There are infinitely many degrees of freedom here.

Jacob_Peacock

Hi Jason, thank you for writing this. I appreciate the refreshing reiteration that we do and must make trade-offs between the interests of different species, as well as your careful philosophical treatment. A few thoughts:

An animal’s capacity for welfare is how good or bad its life can go. An animal’s moral status is the degree to which an animal’s experiences or interests matter morally.

While capacity and moral weight are important parameters, I think there also remains significant empirical uncertainty about actual experience as well. Without eliminating this uncertainty, estimate of the two former values may not be especially useful.

(1) a holistic approach, in which relevant experts employ their normative and biological expertise to make all-things-considered estimates of the appropriate tradeoffs between different lives, experiences, or interests, and (2) an atomistic approach, in which we identify empirical proxies for morally salient features, then let our best scientific understanding of the degree to which different animals possess those features guide our estimates of comparative moral value. The two approaches are not in principle mutually exclusive.

As you indicate, these are, of course, not mutually exclusive. However, I suspect they overlap so much as to be not worth distinguishing as any reasonable application would apply both approaches. As you suggest, the weightings of the atomistic features would rely on expert judgement, as would estimates of combination effects, which could occur at the species (or even individual) level. For example, Bracke 2019 is the best study I've seen on comparing a wide array of chicken housing condition. In the study, a panel of chicken welfare experts were provided a set of "atomistic" attributes (eg, stocking density, temperature, light exposure) about different housing conditions to inform holistic judgments of the relative welfare of each system. While this is not exactly the same task as assessing capacity for welfare and moral status, it seems analogous and illustrative of the need for a hybrid approach.

So I think there is good reason in general to worry that unwanted considerations unduly sway one’s intuitions about the value of nonhuman animals.

I agree, but this might be mitigated by including these as explanatory variables. For example, the impact of speciesism could at least be examined and potentially controlled for by inclusion of the above-cited speciesism scale or the impact of diet patterns by inclusion of a diet screener.

Personally, I think order is probably the right rank at which to investigate the subject.

This seems very unlikely to be the correct taxa in my opinion. First, taxa above genus or family are generally arbitrary in scope. Second, relevant traits would likely be heterogeneous within such a broad group. For example, within the order of bivalves, there are sessile and motile species, and species with a dozen plus compound eyes or "eyes" that detect only light and dark.

Jason Schukraft

Hi Jacob,

Thanks for your comment! I’m happy to chat in more detail if you’d like to set up a call.

While capacity and moral weight are important parameters, I think there also remains significant empirical uncertainty about actual experience as well.

I agree, and I fully support more research aimed at figuring out how to measure realized welfare. For many comparisons of specific interventions, learning more about the realized welfare of a given group of animals (and how a change in conditions would affect realized welfare) is going to be much more action-relevant than information about capacity for welfare. Considerations pertaining to capacity for welfare are most pertinent to big-picture questions about how we should allocate resources across fairly distinct types of animals (e.g., chickens vs. fish vs. crustaceans vs. insects). I think some uncertainties surrounding capacity for welfare can be resolved without fully solving the problem of how to measure realized welfare in every case. Of course, measuring realized welfare and measuring capacity for welfare share many of the same conceptual and practical hurdles, so we may be able to make progress on the two in tandem.

While this is not exactly the same task as assessing capacity for welfare and moral status, it seems analogous and illustrative of the need for a hybrid approach.

Not sure how much we disagree here. I certainly think all-things-considered expert judgments have an important role to play in assessing capacity for welfare. The post emphasizes the atomistic approach because it’s a lot more complicated (and thus warrants deeper explanation) and also because it’s much more likely to uncover action-relevant information that our untutored all-things-considered judgments may miss. (I liken the project to RP’s previous work on invertebrate sentience, which required many subjective judgment calls but ultimately whose main contribution was a compilation of hard data on 53 empirically measurable features that are relevant to assessing whether or not an animal is sentient.)

This seems very unlikely to be the correct taxa in my opinion. First, taxa above genus or family are generally arbitrary in scope. Second, relevant traits would likely be heterogeneous within such a broad group.

Yeah, I could be convinced that order is the wrong taxonomic rank. My main concern is tractability. The scale of the potential project is already so enormous, and moving from order to family could easily add another 500-1000 hours of work. My hope was that we would be able to discern some broad trends at the level of order (which could be refined in the future). But if neither time nor money were a particular concern, then, for the reasons you outline, I think family would be a much better rank at which to investigate these questions.

Again, happy to talk more if you’re interested!

Jacob_Peacock

Thanks for the helpful clarifications and responses, Jason. I don't have anything to add at this point, but look forward to reading more of your work!

MichaelPlant

Thanks for writing this up. It seems what you've done with the atomistic approach is stated what, in principle, one would need to do, but not really wrestled with the difficulties and details of doing it. By analogy, it's a bit like you've said "if we want to get to space, we need to build a spaceship" and but not said how to build a spaceship ("well, it would need to get into space, and carry people, ...")

I think it would help to spell out a particular issue. Suppose we think happiness, the intrinsic pleasurableness/displeasurableness of experiences is one of the things that constitutes welfare. Okay, what proxy do we use for that? Happiness is a subjective experience, so no objective measure is possible. Of course, we have intuitions about relative magnitudes of happiness in different animals, but what makes us think we're right, even approximately?

(I note I raised effectively the same concern in your previous post and you haven't (yet) replied to my latest comment. You linked me this paper, but it doesn't address my concern: the author surveys didn't "suffering calculators" but doesn't provide an account of how we would test that some are more valid that others).

Jason Schukraft

Hi Michael,

Thanks for your comment.

Happiness is a subjective experience, so no objective measure is possible. Of course, we have intuitions about relative magnitudes of happiness in different animals, but what makes us think we're right, even approximately?

This is an important concern, but I think we disagree about what it would take to satisfy this concern. It’s true that we don’t and can’t have direct access to the subjective experience of nonhuman animals. But of course we also don’t and can’t have direct access to the subjective experience of other humans. Subjective experience is, well, subjective. So whenever we conclude that a fellow human is happy or sad, we’re doing so on the basis of indirect evidence.

Now, most humans can give verbal reports of their subjective states, which is about as good a kind of indirect evidence as we could hope for. But not all humans can do that. I take it as a datum that we can know a great deal about the subjective states of babies. Maybe you deny that. If so, that’s an interesting crux.

If you agree that we can know about the subjective states of babies, then that establishes that it is in principle possible to know about the subjective experience of non-verbal animals in the absence of direct evidence. Admittedly, this type of inference gets harder as we move to nonhuman animals, and harder still as we move farther out in phylogenetic distance. But we should clearly distinguish practical difficulties from conceptual difficulties. There’s nothing particularly conceptually dubious about abductive reasoning; inference to the best explanation is used in many areas of both philosophy and science.

Have you read Michael Tye’s Tense Bees and Shell-Shocked Crabs? He discusses these questions in a bit more detail. You could also take a look at our introduction to the invertebrate sentience project, especially the project rationale section. I’d be happy to schedule a meeting to talk in more detail if you want.

MichaelPlant

Thanks for your response, but I don't think you're grasping the nettle of my objection. I agree with you that you and I both think we know something about the mental states of other adult humans and, further, human babies. I also think such assumptions are reasonable, if empirically unprovable. But that's not my point.

In short, my challenge is: articulate and defend the method you will use to determine how much more or less happy humans are than non-humans animals in particular contexts - say the average humans vs the average factory farmed chicken.

Here's what I think we can do with humans. We assume you and I have the same capacity for happiness. We assume we are able to learn about the experiences of others and communicate them via language, e.g. we've both stubbed our toes, but I haven't broken my leg, and when you say "breaking my leg is 10x worse" I can conclude that would be true for me too. Hence, when you say "I feel 2/10" or "I feel terrible" I might feel confident you mean the same things by those as I do.

What can do with chickens? We really have no idea what chickens' capacities for happiness are - is it 1/10th, 1/100th, etc? It doesn't seem at all reasonable to assume they are roughly the same as ours. The chicken cannot tell us how happy how it is relative to its maximum, our maximum, or, indeed, tell us anything at all. Of course, we may have intuitions - what we might perjoratively call "tummy feelings" - about these things. Fine. But what method do we use to assess if those intuitions are correct? The application of further intuitive reflection? Surely not. I cannot think of a justifiable empirical method to inform our priors. If you can explain why this project is not doomed, I would love to know why! But I fear it is.

Jason Schukraft

Hi Michael,

Thanks for your comment. This is a complicated topic, so it’s easy for well-meaning folks to talk past one another. For that reason, I’ll encourage you again to reach out to schedule a call to discuss in further detail.

Since this area is so under-explored, I think there is a large range of reasonable expectations about the outcome of the sort of project I outline in the post. I can try to give you some insight into why I’m more optimistic than you are, but that’s not to say that your pessimism is outside the range of reasonable attitudes one could take to the project.

One reason I’m optimistic is because in my own limited experience exploring questions of comparative moral value, the returns have thus far been quite high. Let me give just one example.

The subjective experience of time is plausibly an important determinant of realized welfare and capacity for welfare. There are plausible empirical proxies we can use to approximate differences in the subjective experience of time. Critical flicker-fusion frequency (CFF) is an especially well-studied measure, so I’ll use it in this example, but I think there are probably better metrics. (I’m currently writing a report on this subject; stay tuned for details.) If CFF tracks the subjective experience of time, then higher values represent more subjective moments per objective unit of time. The typical human has a max CFF threshold of around 60 Hz. Chickens have a max CFF threshold around 87 Hz. Honey bees have a max CFF threshold of around 200 Hz. So that's an example of a way we might directly compare three important animals on a metric that might track an important welfare determinant.

Now I’m not saying CFF is a perfect measure of the subjective experience of time. It’s not. In fact, my best guess is that there’s only a ~30% chance it tracks the subjective experience of time under the best conditions. (Again, see my forthcoming report for extensive discussion.) But the illustrative point here is that there may exist empirically measurable proxies for features we care about that allow us to compare capacity for welfare across species. If we don’t at least try to locate such proxies, we’ll never know if they exist. Given the stakes, it seems reasonable to me to devote a small fraction of our collective resources to think more carefully about these very difficult issues.

Michael St Jules 🔸

I share your overall pessimism of arriving at an answer that will actually be satisfying philosophically, but I do think research in this area is still important and useful. Our ultimately subjective judgements can be better informed.

We assume you and I have the same capacity for happiness.

I think the same problem applies here too, because of the uniqueness of humans (our nervous systems, the density of nerve endings, the thickness of our skin, etc.), although it's much more reasonable to generalize from one human to another than between species, because of similarity. Still, I don't think it's actually reasonable, using the same standard; I might as well be a talking alien. And we have no way of objectively quantifying how reasonable this approximation is or whether one human's welfare capacity is greater or lower than another's.

That being said, I don't think you always need this assumption for humans anyway, e.g. if you're randomly sampling humans to survey from the same distribution that you're generalizing to (or sampling humans to generalize to), since the estimator can be chosen to be statistically unbiased, regardless of how well it measures what we actually care about. (However, in practice, the distributions often aren't the same, and we know of generalizability issues due to that, e.g. WEIRD. You can adjust/match/control for certain characteristics, but you can never really eliminate all bias. And for something subjective like welfare, we can't bound the bias from the underlying concept we care aout, either, even if it were possible to bound the statistical bias, for the same reason we can't bound how different my experience of a toe stub is from yours.)

On the other hand, we can't do this with nonhuman animals, since we're sampling from humans and generalizing beyond humans. The distributions are definitely not the same.

MichaelPlant

Right. My thought is that we assume humans have the same capacity on average, because while there might be differences, we don't know which way they'll go so they should 'wash out' as statistical noise. Pertinently, this same response doesn't work for animals because we really don't know what their relatively max capacities are.

FWIW, the analogue to my response here would be to say we can expect all chickens to have approximately the same capacity as each other, even if individuals chickens differ. The claim isn't about humans per se, but about similarities borne out of genetics.

Michael St Jules 🔸

My thought is that we assume humans have the same capacity on average, because while there might be differences, we don't know which way they'll go so they should 'wash out' as statistical noise.

In another comment, I mentioned that I think this is actually only fair to assume while we don't know much about the individual humans. We could break this symmetry pretty easily.

FWIW, the analogue to my response here would be to say we can expect all chickens to have approximately the same capacity as each other, even if individuals chickens differ. The claim isn't about humans per se, but about similarities borne out of genetics.

Since humans also differ from each other genetically, isn't the distinction here just a matter of degree?

Michael St Jules 🔸

You might also think you can generalize between you and I using a symmetry argument, but this is only by willful ignorance. We could learn more about each other in a way that would suggest one of us experiences certain things more intensely than the other (e.g. based on the sizes of the parts of our brains used for processing emotion, our personalities or experiences) and ignoring these differences would be the same philosophically as ignoring the differences between humans and chickens. We might learn differences that go in each direction for you and I, resulting in a moral complex cluelessness, but the same can actually happen with nonhuman animals, too: there are reasons to believe some nonhuman animals could typically experience some things more intensely than us, e.g. our better awareness of the context around an experience can reduce its intensity, and some animals have faster processing times. It's plausible enough to me that dogs have higher highs in practice than me (although maybe I'm capable of higher highs; they just don't happen).

Michael St Jules 🔸

Have you considered a (semi-)blind approach? Collect data on each of the species/taxa of interests into a table, but hide the species (except possibly human, as the reference?) and make moral weight judgements based on that (and the judges can do this without any formal or precise weighting of features if they prefer). You could also get separate people who do the research and prepare the table from those who make the judgements, to reduce the identifiability of the species/taxa from the data, although this risk won't really go away.

Jason Schukraft

Yeah, that's an interesting idea. Sounds pretty good in principle, though I imagine fairly hard to implement in practice. AI Impacts did something similar last year when they investigated the relationship between neuron count and general intelligence. They prepared anonymized descriptions of the behavior of four species (two birds and two primates). Survey participants were asked to judge which animals were more intelligent on the basis of the anonymized descriptions. (The birds scored about the same as the primates.)

RomeoStevens

Appreciate the care taken, especially in the atomistic section. One thing is that it seems to assume that best we can do with such a research agenda is analyze correlates, where what we really want is a causal model.

Comments

More from the author

291

Pre-Announcing the 2023 Open Philanthropy AI Worldviews Contest

Jason Schukraft·3y ago·2m read

137

Announcing the Open Philanthropy AI Worldviews Contest

Jason Schukraft, Peter Favaloro·3y ago·4m read

Announcing the Winners of the 2023 Open Philanthropy AI Worldviews Contest

Jason Schukraft·2y ago·2m read

Curated and popular this week

Hard-to-reverse decisions destroy option value

Stefan_Schubert·9y ago·Curated 23h ago·14m read

This post is co-authored with Ben Garfinkel. It is cross-posted from the CEA blog. A PDF version can be found here. Summary: Some strategic decisions available to the effective altruism m...

Introducing Impact List: a ranking of philanthropists by expected lives saved

Elliot Olds·1d ago·6m read

TL;DR: I'm releasing a website that ranks philanthropists according to EA principles and research, and allows users to re-rank the list using their own assumptions. I'd like feedback and help making it better. I'd especially like ideas for how to make the results more trustworthy. Funding may be available. I recently built Impact List (impactlist.xyz), a site which ranks people by their positive impact via donations. The goal is t...

If you're agentic, work in biosecurity

sharmaayushmaan🔸·5d ago·7m read

Disclaimer: Although I work on the Groups Team at CEA, I’m writing this in a personal capacity, and this post does not constitute an endorsement by CEA. Agency - the realisation that you really can just do things. TL;DR Biosecurity needs people (of any background) who are agentic and have a high execution velocity and track record....

Recent opportunities to take action

Marginal Victories: career advising and opportunities for U.S. democracy preservation & political work

Annika Burman 🔸·3d ago·2m read

I'm stepping down as Hive's Executive Director, and we're hiring my successor

SofiaBalderson, Hive·4d ago·3m read

Starting an EA group @ SUNY Binghamton

micahzarin·2d ago·1m read

Jacob_Peacock

An animal’s capacity for welfare is how good or bad its life can go. An animal’s moral status is the degree to which an animal’s experiences or interests matter morally.

(1) a holistic approach, in which relevant experts employ their normative and biological expertise to make all-things-considered estimates of the appropriate tradeoffs between different lives, experiences, or interests, and (2) an atomistic approach, in which we identify empirical proxies for morally salient features, then let our best scientific understanding of the degree to which different animals possess those features guide our estimates of comparative moral value. The two approaches are not in principle mutually exclusive.

So I think there is good reason in general to worry that unwanted considerations unduly sway one’s intuitions about the value of nonhuman animals.

Personally, I think order is probably the right rank at which to investigate the subject.

This is very much not an exhaustive list. ↩︎
See this spreadsheet for details. By my count, every order in the spreadsheet is exploited in numbers greater than ~50 million individuals per year. ↩︎
Humans also indirectly affect many wild animals, and many wild animals suffer independent of any human interference. In this series I focus primarily on animals that humans exploit directly, most of which are farmed. Because the goal of the project is to improve the way resources are allocated across interventions, it makes sense at this time to focus on animals that are directly exploited. As the effective animal advocacy movement identifies more interventions to aid wild animals, we will want to include those animals in our measures of comparative moral value. ↩︎
See Budolfson & Spears 2019 for more on the measurement problem. ↩︎
A method that was more limited (to say farmed land vertebrates) could still be useful even if less than ideal. ↩︎
This list is adapted from Browning 2020. Her focus is on measuring realized welfare, but I think the desiderata apply equally well to measuring capacity for welfare and moral status. ↩︎
I’m not claiming here that philosophical disputes are in principle irresolvable, just that they are usually much less tractable than empirical questions. ↩︎
For instance, we may want to adopt a precautionary principle (Birch 2017) in the face of large uncertainty. ↩︎
For a recent example to the contrary, see Founders Pledge’s report comparing the value of donations to The Humane League to the value of donations to the Against Malaria Foundation. For another potential example to the contrary, see Charity Entrepreneurship’s weighted welfare index. Note that although the CE index is meant to improve the way resources are allocated across species, it does not explicitly address moral status or capacity for welfare. ↩︎
He adds, “Some comparisons may be too difﬁcult. We may have to say that we have not the slightest idea whether it would be better to be a ﬁsh or a snake; but then, we do not very often ﬁnd ourselves forced to choose between killing a ﬁsh or a snake. Other comparisons might not be so difﬁcult. In general, it does seem that the more highly developed the mental life of the being, the greater the degree of self-awareness and rationality and the broader the range of possible experiences, the more one would prefer that kind of life, if one were choosing between it and a being at a lower level of awareness” (Singer 2011: 92). ↩︎
Note that Kagan’s presentation is a bit misleading here. Comparing the welfare of a human’s lifetime with the welfare of a fly’s lifetime is a comparison in diachronic welfare, that is, welfare over time. But humans live much longer than flies; thus they have much longer to amass welfare. So it’s not really a fair comparison. Fruit flies only live about 30 days. If one refused to forgo a single day of human life for an extra lifetime (30 days) as a fly, we should infer that a day in the life of a typical fly contains less than one thirtieth the welfare of a day in the life of a typical human. ↩︎
See Wuensch, Jenkins, & Poteat 2002. ↩︎
See Herzog, Grayson, & McCord 2015 for a shorter version of the scale. ↩︎
Mice, rats, rabbits, pigs, monkeys, octopuses, chickens, badgers, zebrafish, tree shrews, dogs, dolphins, parrots, chimpanzees, badgers, and pigeons. ↩︎
The specific uses are medical research, basic science research, food production, pest control, and “other.” Note that specifying specific uses probably introduces many confounding influences to the responses. ↩︎
The survey was conducted for the American Farm Bureau Federation and is unfortunately no longer available online. Norwood and Lusk discuss the survey in their 2011 book Compassion, by the Pound: The Economics of Farm Animal Welfare (pp. 171-172). A popularization of the survey appeared in Reason magazine under the title “You=11,500 Sheep.” ↩︎
The survey utilized a multiple-choice format, so respondents were not able to input any number they wanted for X. For all questions, the first program affected one thousand animals and the possible values for X for the second program were 1, 500, 1001, 2000, 5000, 10000, 100000, and 1000000. ↩︎
Also of note: “Nearly one-third (30%) of respondents reported that they believed animal suffering should be taken into account to a degree equal to or above human suffering” (1). ↩︎
As just one example of the extent of this ignorance, I hypothesize that few members of the general public would guess that snails are more closely related to squid than earthworms are to silkworms. ↩︎
It may still be useful to survey the lay public. Such surveys may help us identify biases that influence ours and others’ judgments. It’s also possible that a wide enough survey may reveal a latent ‘wisdom of the crowd,’ which would allow us to extract a useful signal from the random noise of our unreliable, arbitrary intuitions. ↩︎
See Cuddington et al. 2013 for more on the tradeoff between the speed and opacity of expert judgment: “While the development of a rule may take some time, expert opinion can be accessed rapidly in most cases, and in some cases is the only information available (O'Neill et al. 2008). However, the role of theory and the assumptions behind expert opinion and rules of thumb are rarely transparent, so there may be little potential for evaluating the assumptions that support models of this sort. Expert opinions inevitably are divergent (e.g., Czembor et al. 2011), although there may be techniques for building consensus among a group of experts (e.g., Delphi technique, Rowe and Wright 1999). It is also possible that the rules of thumb or expert opinion do not include adequate concepts of scale and uncertainty (e.g., Burgman 2005) that are a requirement for appropriate management under global change. However, when the main requirement is that a decision be made extremely quickly with very limited data, expert opinion or rule‐based models have a clear time advantage over other types of models” (3). ↩︎
For an overview of the biases and heuristics literature, see Kahneman 2011. For philosophical examples, see, among others, Machery et al. 2004; Swain, Alexander, & Weinberg 2008; Buckwalter & Stich 2014; and Costa et al. 2014. See Schwitzgebel & Cushman 2012 for a series of experiments in which the moral judgments of professional philosophers were as sensitive to order effects as the judgments of non-philosophers. See De Cruz 2015 §6 for a general discussion about whether these findings undermine the view that professional philosophers are ‘expert intuiters.’ ↩︎
See Serpell 2004, Wynne 2007, and Herzog 2010 for overviews. ↩︎
Alternatively, the exercise might tell us something about average realized welfare rather than capacity for welfare. A species might have a huge capacity for welfare, but if in fact the members of that species tend to lead net-negative lives, we would never want to be a member of that species. ↩︎
Similar concerns apply to measuring the relative value of human health outcomes, which is crucial for calculating QALYs. ↩︎
Of course, which facts are available in the scientific literature might be influenced by bias. There is probably comparatively more information on traits that humans find interesting. ↩︎
In many respects, the first post in this series already plays this role. However, that post does not explicitly weight the features relevant for moral status or capacity for welfare. ↩︎
This stage might benefit from a survey of relevant philosophical experts. ↩︎
The cells are empty not only because the relevant scientific literature has not been surveyed but also because it is as yet unclear what sort of response the cells merit. Ideally, we want the cells to be comparable, even when the cells are reporting very different metrics, so it might make most sense to score each cell on some arbitrary scale (e.g., from 1 to 10). But standardizing the process looks really difficult. See the objections section for more discussion. ↩︎
Ignoring superfamilies, infraorders, and suborders, which not all orders have ↩︎
See this spreadsheet for an overview. ↩︎
One might think humans are unique and thus that this is a special case that says little about the general point. So here is a non-human example: among species in the Carnivora order, neuron count differs by a factor of 24 and brain mass differs by a factor of at least 58 (Jardim-Messeder et al. 2017). ↩︎
Kagan believes there are only around six tiers of moral status (2019: 293). ↩︎
See the first post in the series for details. ↩︎
In practice, measuring neuron counts is actually anything but straightforward. See the work of Suzana Herculano-Houzel for many discussions of the various complications. ↩︎
We can imagine a lump of billions of neurons swirling around a laboratory jar with no capacity for welfare or moral standing. Conversely, we can imagine an alien or a computer program with zero neurons which nonetheless has a high moral status and capacity for welfare. ↩︎
This does not necessarily give them greater precision in movement; insects on average have similar numbers of distinct muscles in total. ↩︎
On the other hand, there appears to be a longstanding and preexisting speciesist prejudice against attributing complex mental states to nonhuman animals. Given such a prejudice, scientists and the lay public may have been systematically underestimating the cognitive abilities of nonhuman animals for a long time. Today’s “surprising” results might just be the product of science finally beginning to overcome deep-rooted prejudice. And insofar as the prejudice persists, the competing forces of positive publication bias and speciesist prejudice might even approximately cancel each other out, leaving us with a literature that is largely reliable (though this is an unlikely outcome). In any event, the results captured in the comparative moral value database ought to be checked and updated on a continual basis. If there is a particular study that carries outsized weight in the final analysis, it may be worthwhile to fund a lab to replicate the study. ↩︎
See Dicke & Roth 2016 for more on the importance of cortical neurons, neuron packing density, interneuronal distance and axonal conduction velocity. ↩︎
I tentatively estimate we’ll end up with about 30 features on the list. ↩︎
Examples of agential features include self-awareness, self-control, number of behavior types, executive functions, long-term planning, and capacity for moral responsibility. ↩︎
Examples of physiological features include neuron count, presence of nociceptors, connection of nociceptors to central nervous system, and presence of endogenous opioids. ↩︎
In a recent talk at Notre Dame, Eric Schwitzgebel offers the example of “a superpleasure machine but one with little or no capacity for rational thought. It’s like one giant, irrational orgasm all day long. Would it be great to make such things and terrible to destroy them, or is such irrational pleasure not really something worth much in the moral calculus?” Schwitzgebel is here wondering whether degree of rationality affects the moral value of capacity for pleasure, which would be another example of a combination effect. ↩︎
It would also be helpful to assign probability distributions to the question ‘What weight would you assign to the feature after one hundred more hours of research?’ See this comment from NunoSempere about doing so in the context of investigating sentience. ↩︎
Obviously, when comparing lives saved across species, differences in lifespan will need to be accounted for. ↩︎

How to Measure Capacity for Welfare and Moral Status

How to Measure Capacity for Welfare and Moral Status

Executive Summary

Introduction and Context

The Measurement Problem

The Holistic Approach

Tradeoffs and preferences

Survey data

The problem with appeals to intuition

The Atomistic Approach

A rough guide to estimating moral status and capacity for welfare atomistically

Choosing taxonomic rank

Finding measurable proxies

Comparing features across animals

Weighting the features

Conclusion

Credits

Works Cited

Notes