This is great, thanks for writing this. It gave me some clarity about something I’ve been confused about for a long time.
(Context: I’m not an expert in animal welfare. My aim is to sketch a potentially neglected perspective on prioritization, not to give highly reliable object-level advice.)
We seem to be clueless about our long-term impact. We might therefore consider it more robust to focus on neartermist causes, in particular animal welfare.[1] But if we also take seriously our deep uncertainty about our impact on animals, what implications does this have for animal welfare prioritization?
This post will explain:
Practical takeaways for cost-effectiveness analyses:
See footnote for some caveats and context.[2]
Compared to having a robustly positive long-term impact, reducing near-term animal suffering seems quite tractable. We can become relatively confident in (some) interventions’ effects via feedback loops, the track records of similar interventions, and intuitions calibrated on problems of similar complexity.
Still, even just considering our effects on animals in the near term, we face epistemic challenges similar to those of longtermism, though much more manageable. Let’s look at two general categories of these challenges.
First, a classic critique of long-term impact estimates is that they depend on made-up numbers (Greaves; Violet Hour; Masrani), based on intuitions that haven’t been honed by relevant feedback. But suppose we try to weigh the upsides of broad animal welfare movement-building, such as increasing funding for welfare reforms, against the downsides from increasing wild animal suffering (Tomasik). Where do these weights come from? We might go with a rough Fermi estimate, if we had to commit to a precise guess. Yet there are alternatives to committing to a precise guess, which seem more appealing when our decision depends on which guess we pick, as I’ll unpack below (see also this post).
Or, let’s say we’re confident that some charity A wouldn’t increase expected wild animal suffering. But there’s a prospective donor to A whose second favorite funding opportunity is another charity B, which we worry would increase expected wild animal suffering. Then we need to assess the probability of funging, i.e., the possibility that if we fill charity A’s funding gap, we’ll make this other donor more likely to fund B (where they otherwise would’ve funded A anyway). (More on this later.) Perhaps we could try guesswork like GiveWell’s below, but what’s the justification for their precise estimate?:
Our best guess is that the average counterfactual use of domestic government spending that could be leveraged by our top charities is ~75% as cost-effective as GiveDirectly. We think using this figure is a useful heuristic, which roughly accords with our intuitions (and ensures we’re being consistent between charities), but we don’t feel confident that we have a good sense of what governments would counterfactually spend their funds on, or how valuable those activities might be. (Snowden)
Second, take crucial considerations: We’ve discovered insights that flipped the sign of various interventions’ near-term effects, not just long-term. E.g., reducing cow farming plausibly increases suffering by increasing wild animal populations (Shulman; Tomasik), or shifting consumption to smaller animals (Charity Entrepreneurship). Since we’ve been surprised by crucial considerations before, it seems we should expect there are others we’re currently unaware of, that is, unknown unknowns. Even lots of research won’t necessarily turn up all the sign-flipping considerations, given the complexity of the system of different animals and human societies we’re intervening on. How should we adjust our EV estimates to factor in unknown unknowns?
It’s tempting to reply, “Unknown unknowns aren’t action-relevant. Either we have no idea what they imply anyway, or they shrink the EV of everything by the same factor.” However (as elaborated below),[3] it seems too strong to assume unknown unknowns conveniently cancel out “in expectation”. For some interventions, then, it will be so ambiguous how to adjust for unknown unknowns that we end up clueless about them. Yet, when we look closely at other interventions’ mechanisms of impact on animals, we may still have reasons to consider these interventions positive.
Effective animal suffering reducers therefore face at least a weak form of cluelessness. What kinds of interventions are better than inaction despite these challenges, that is, cluelessness-robust? I propose that we should only consider an intervention cluelessness-robust if:
(Clearly, we care about cost-effectiveness, not just doing better than inaction. But the worry is that even if an intervention looks very cost-effective according to our precise best guess, it might be no better than inaction on closer inspection.)
Throughout, the examples are not confident claims. They’re mostly meant to be illustrative.
Informally, an effect of an intervention is “robust” if we have reason to factor it into our decision-making.[4] So, robust backfire effects are those that we have at least as much reason to factor into our decisions as the robust upsides.
You’re unsure whether to donate to a charity that tries to make fish slaughter more humane via electrical stunning. Here are some backfire effects you might consider, and intuitively appropriate responses to them (I’m not yet claiming there’s a rigorous way of justifying these intuitive responses):
Non-robust backfire effect: Maybe promoting humane slaughter makes society more complacent about exploiting nonhumans, eventually leading to the mistreatment of astronomically many digital beings in the far future. On the other hand, we could also imagine that by expanding society’s moral circle, promoting humane slaughter prevents the mistreatment of astronomically many future beings. We have “complex cluelessness” about how to weigh the first (very bad) effect against the second (very good) effect, that is, they’re not precisely equally likely and severe. Nevertheless, it doesn’t seem like our cluelessness about these effects should override the benefits to the fish, which we’re not clueless about. Overall, then, the “increasing complacency in the far future” backfire effect doesn’t seem robust.
While these examples give the intuition, the hard part is making precise which effects we have “reason” to factor into our decisions (see here for one attempt). And many net-positive interventions will have some backfire effects. But it’s worth deliberately keeping robust backfire effects in mind, because:
Some key potential backfire risks, which may or may not be robust depending on the specific intervention:
Reducing pasture, as opposed to cropland, is the most robustly bad instance of this effect: Although replacing cropland with natural habitats increases invertebrate populations, it could also prevent more painful deaths from pesticides.[5]
For example, if we push for a subsidy for some more humane farming method to make farmers strictly prefer that method, they might increase production.[6]
You’re deciding whether to fund a corporate campaign for a new chicken welfare reform. This reform appears less effective than the Better Chicken Commitment reforms (assume BCC reforms have hit diminishing returns), but still promising. Except, you wonder about the substitution effect: Maybe when the reform increases the price of chicken and eggs, many consumers will respond by eating a much larger number of fish and shrimp per calorie (Buhler).
Intuitively, you’re not too worried about this downside, since you guess that a large fraction of substitution will be for much larger animals (cows and pigs). But you don’t have very precise, conclusive evidence about the magnitude of the effect, even after consulting studies on price elasticities.[7] You try to roughly guess the expected increase in suffering-years of farmed fish and shrimp, for each suffering-year of chicken affected by the reform — you come up with numbers from 0.5% to 25%. (The larger figure is partly due to the large number of days in captivity per farmed fish, compared to chickens (Welfare Footprint Institute).[8]) And, you also feel pretty confused about how much suffering the reform would reduce per chicken. You suppose it might be as effective as reducing 30% of each chicken’s suffering in expectation, but feel you could just as well say it’s only 5%.
Given the 0.5% figure for increased farmed fish and shrimp suffering-years per chicken suffering-year, and the 30% figure for decreased chicken suffering, you’d net-reduce suffering-years by 29.5%. So the campaign would be good. But given the figures of 25% and 5%, respectively, you’d net-increase suffering-years by 20%, so the campaign would be bad. Then it looks like an arbitrary call overall whether you should fund the campaign.
Once we’ve accounted for an intervention’s robust effects on welfare, we of course need to weigh up the positives vs. negatives — both the amount of welfare affected, and how likely the effect is. In cases like the example above, though, lots of different precise weights seem defensible. Committing to any one set of weights thus looks arbitrary. How should we respond in these cases? I think that if the net sign of the intervention’s robust effects varies based on an arbitrary choice of weights, we can’t call it cluelessness-robust.
Sub-principles:
You’re deliberating between two interventions for some species of farmed animal. You could fund either a welfare reform campaign, or a campaign to block the development of several new factory farms. Assume (generously!) that you have a robust estimate of the cost per individual animal affected, and that the only robust effects of either intervention are on this species.
On one hand, the welfare campaign is somewhat cheaper. But, in the past you’ve come across ways that welfare reforms might not decrease as much suffering as initially expected, or even increase it — e.g., effects of cage-free reforms on chicken injuries and mortality (Rufener and Makagon; Cotra), botched stunning, and potential increases in wild fish catch in response to decreased demand (St. Jules). You worry the same could apply to this reform, even if you can’t think of a specific backfire mechanism. If the animals aren’t born into factory farms at all, though, they definitely won’t experience net-negative lives. Accounting for this, overall, you favor the development-blocking campaign.[9]
Suppose we’ve tried weighing up an intervention’s robust effects, based on all the considerations we’re aware of. And the intervention looks good under every reasonable weighing. Are we in the clear?
Not quite. We’re likely unaware of some considerations about the robust effects. And we should expect that if we knew those considerations, we’d consider some effects more or less likely. Though it’s unclear precisely how to adjust our current estimates to reflect that expectation, we should still do so. (Even when we try to make “conservative” adjustments, our intuitions can fail us pretty badly (Wildeford).) Therefore, if the net sign of the intervention’s robust effects varies across different reasonable ways to adjust for unknown unknowns, it’s not cluelessness-robust.
It might seem impossible to say anything principled or action-relevant about unknown unknowns. But the potential for unknown unknowns seems greater in some domains than others. Most notably, when an intervention’s pathway to its intended benefits is more complex or involves more unfamiliar variables, we should expect there to be more unknown unknowns relevant to this pathway (more below). Arguably, then, we should prioritize interventions whose paths to impact we deeply, mechanistically understand.[10]
Sub-principles:
An important implication: If we expect the intervention’s upsides to occur systematically later than the downsides, then it’s especially plausible that the downsides are greater. For example, it’s been suggested[11] that even if a farmed animal intervention greatly increases wild animal suffering in the near term, this downside will eventually be offset by a more animal-friendly society helping wild animals. This argument seems weaker when we note that the longer-term upsides are less robust to unknown unknowns.
EAs, including those working in animal welfare specifically, have often remarked on the importance of robustness in some sense. Given this, I want to clarify two points. (This section isn’t essential if you’re short on time, but probably will give important context.)
First, cluelessness-robustness aligns in some important ways with existing standards of robustness. So at least to some degree, animal advocates with diverse epistemic perspectives could coordinate on cluelessness-robust interventions. In particular, here’s how some principles in the list above relate to properties people typically consider indicative of robustness:
Second, however, there are also crucial differences. Hence, my framework of cluelessness-robustness is action-relevant for effective animal advocates. Namely:
Cluster thinking and Tomasik’s notion of robustness emphasize the importance of common sense. But evaluating interventions for cluelessness-robustness often requires taking counterintuitive arguments and beyond-first-order effects seriously. This is because we shouldn’t expect our intuitions to precisely price in unknown unknowns, or accurately weigh up the interests of different species, etc.[12]
Cluelessness-robustness doesn’t assume risk aversion (though they’re compatible).[13] For example, even if we think it might be reasonable to assign very low probability to springtails being capable of intense suffering, an intervention focused on these animals could be cluelessness-robust. (At least, as long as we have a non-arbitrary basis for thinking we’re systematically helping springtails rather than harming them.)
Many of us prioritize near-term animal welfare interventions, despite caring about the far future, because our epistemic standards rule out longtermist interventions. This post recommends that we consistently apply those epistemic standards to animal welfare interventions themselves. The high-level starting point is to check for backfire risks, and make sure that when we say they’re outweighed by the upsides, this verdict is robust to different reasonable judgment calls.
What implications do these standards have for our bottom-line prioritization? I’m not sure yet, and I’m keen to flesh this out more. Tentatively: Humane slaughter for invertebrates (or at least research into this) seems promising, if one can avoid funging with interventions that could be highly negative for some off-target animals. This is because:
Invertebrate farming interventions are relatively robust to effects on wild animals and substitution effects. Invertebrate farming uses much less land,[14] and kills many more individuals per calorie, than farming of larger animals. If the intervention had a substitution effect via increased prices, consumers would switch to such larger animals (this includes replacement of feed insects with fishmeal). (However, it’s possible that even substitution toward larger animals could make us clueless, if the relative moral weights of invertebrates vs. vertebrates are too ambiguous.)
(I think it’s likely I’ll conclude that other interventions are cluelessness-robust, too. It’s just that humane invertebrate slaughter looks relatively good.)
The tractability of further desk research on cluelessness-robustness is unclear, but some potential next steps include:
Very few object-level considerations in this post are new (many of them I learned from discussions with Michael St. Jules). And I’m largely indebted to thinking by Jesse Clifton and Anni Leskelä for the high-level ideas here. Thanks to Michael St. Jules, Jim Buhler, Jesse Clifton, Clare Harris, Euan McLean, Joseph Ancion, and Mal Graham for helpful feedback and suggestions. This does not imply any of these people’s endorsement of all my claims.
Some EA thinkers have argued that cluelessness in fact cuts against neartermist work (see, e.g., Greaves and Holness-Tofts). This post won’t address such arguments. Instead, I’ll assume a perspective on which: (i) Neither neartermist nor longtermist interventions are justified based on their effects on expected total welfare (as I’ve argued here); but (ii) interventions might be justified based on their most “robust” effects on welfare, in a sense I unpack below.
Notes:
See the literature on “unawareness” for more. I give a precise explanation of why the “canceling out” assumption is implausible here. The gist is: Once you discover a new consideration, it seems you should update your beliefs on this fact. And this update will break the symmetry between optimistic and pessimistic unknown unknowns.
This isn’t an ideal definition. I want to say that there’s some property of effects that we call “robustness” — which we can try to make more precise, as discussed later in this section — and it’s because these effects have this property that we have reason to factor them into our decision-making.
H/t Samy Sekar and Michael St. Jules.
H/t Michael St. Jules.
I’m definitely not confident that we can’t, in fact, get robust evidence about the magnitude of this substitution effect.
H/t Michael St. Jules.
(More in-the-weeds note:) If the welfare reform isn’t better than inaction, while the development-blocking campaign is, this doesn’t immediately imply that you should prefer to fund development-blocking over the reform. As some toy math: Taking the “EV” restricted to just the robust effects, we might have EV(reform) = [-1, 1], EV(inaction) = 0, and EV(development-blocking) = 0.5. So development-blocking > inaction, but we can’t compare the reform with either development-blocking or inaction! (See here for some technical discussion.) But I find it intuitive that, in general, we should prefer an option in proportion to how much better its robust effects are than each alternative. So, arguably we should prefer development-blocking because it’s at least better than something else, even if the comparison to “inaction” per se isn’t special. See St. Jules for a similar proposal in the context of difference-making risk aversion, though note that the view I’m suggesting here isn’t inherently about difference-making risk aversion.
Note also that in principle, adjusting for unknown unknowns can flip the net sign of an intervention’s robust effects! For example, suppose at first we expect the intervention to have:
So we consider the intervention to be negative. Once we adjust for unknown unknowns, the negative effect could end up not seeming robust anymore because the pathway to this effect is so unfamiliar. We’d then consider the intervention positive.
See, e.g., Meyer Shorb, and John and Sebo. (I’m not claiming that these authors confidently endorse the argument above.) See also an analogous argument by Rouk for putting less weight on the small animal replacement problem.
In particular, I don’t see the justification for Karnofsky’s claims, “Correcting for missed parameters and overestimated probabilities will be more likely to cause “regression to normality” (and to the predictions of other “outside views”) than the reverse,” or, “I believe that the sort of outside views that tend to get more weight in cluster thinking are often good predictors of “unknown unknowns.”” More on this here.
How does this square with the claim that cluelessness-robust interventions aren’t prone to Pascal’s mugging? The idea is: In cases I tend to find the most repugnantly “Pascalian” (your mileage may vary), we’re usually making (implicit) comparisons of very low probabilities of upsides vs. downsides. The case for reducing the suffering of tiny animals doesn’t seem to have this structure, since we won’t make these animals worse off if it turns out they’re not sentient.
See, e.g., the data for prawns here.
When discussing considerations around backfire risks and near-term uncertainty, it is common to hear that this is all excessive nitpicking, and that such discussion lacks action guidance, making it self-defeating. And it's true that raising salience of these issues isn't always productive because it doesn't offer clear alternatives to going with our best guess, deferring to current evaluators that take backfire risks less seriously, or simply not seeking out interventions to make the world a bit better.
Thus, because this article centers the discussion on the search for positive interventions through a reasonably action list of criteria, it has been one of my most valuable reads of the year.
I think the more time we spend exploring the consequences of our interventions, the more we realize that doing good is hard. But it's plausibly not insurmountable, and there may be tentative, helpful answers to the big question of effective altruism down the line. I hope that this document will inspire stronger consideration for uncertainty. Because the individuals impacted by near-term second-order effects of an action are not rhetorical points or numbers on a spreadsheet: they're as real and sentient as the target beneficiaries, and we shouldn't give up on the challenge of limiting negative outcomes for them.