Philosophy, global priorities and animal welfare research. My current specific interests include: philosophy of mind, moral weights, person-affecting views, preference-based views and subjectivism, moral uncertainty, decision theory, deep uncertainty/cluelessness and backfire risks, s-risks, and indirect effects on wild animals.
I've also done economic modelling for some animal welfare issues.
Want to leave anonymous feedback for me, positive, constructive or negative? https://www.admonymous.co/michael-st-jules
I think you've simplified the problem too much. There can be special cases where we can use symmetry and just take simple averages, but many practical cases are not like that. Indeed, that's the point of the distinction between complex and simple cluelessness in the first place.
I think, ideally, we should look for and exploit as much evidential symmetry as possible, but I don’t think we'll always find enough of it to land on a unique precise distribution, I'd guess in principle impossible in many cases (probably almost all cases of intervention and cause area research) without further evidence.
Â
It's true that direct impressions (e.g. internal states about the plausibility of the probabilities) could be considered evidence, but to the extent that for the same objective external evidence, these direct impressions can vary between people or depending on how or when you present the evidence, they seem arbitrary.
Would you take the fact that a direct impression came from your brain — from an inscrutable process, prone to cognitive biases of various kinds, and whose reliability you can at best verify by track records in limited domains where feedback is practical, and where track records may not generalize across tasks and domains well — is better evidence than a direct impression from another person's brain (with similar problems), with access to the same objective external evidence?
Or, what if there are multiple people with different distributions and different track records in relevant domains? How do you weigh them? How much should track record be worth? EDIT: What if their track records are measured in different ways, e.g. you have forecasters with Brier scores, investors or betters with measures of their gains and losses, researchers and grantmakers of various seniorities at different organizations?
And what's the range of direct impressions humans or other semi-rational agents could have, and how would you weigh them all?
Â
I'd also be keen to get your response to this (and also this, if you have the time.)
Do you think it's reasonable for two people with all of the same evidence to disagree on precise probabilities and expected values? If so, how would you justify picking your own precise probabilities over someone else's, if you think theirs are just as defensible?Â
Or would you just average yours and theirs in some way to get a new distribution? How?
Â
And how far would you go, if you consider all the defensible precise probability distributions anyone could assign (whether or not anyone actually does so)? How do you weigh them all if there are infinitely many of them and no uniform distribution over them?
Â
How would you choose the distributions for the model weights in a way that's not itself arbitrary? E.g. how do you choose their forms and parameters in a way that's not arbitrary?
I do think imprecise credences have a similar problem of deciding which distributions to include in their representor. I think ultimately we need to make some arbitrary choices and should accept some, but we can be more or less arbitrary, or stop when it's no longer decision-relevant. Maybe sometimes we can hit a fixed point or see some kind of convergence in the extra steps we're taking.
Â
On there potentially being no fact of the matter, this may be helpful. It goes further than the issue of imprecise credences/EVs.
On the nematode example, it could go further than that: we might assign an imprecise credence between X and 100% to a set of standards for sentience that nematodes don't meet (see my other post on gradations of moral weight). So, the ratio could be anywhere between 0 and 1 (assuming we're taking the absolute value, or only consider same-sign valence).
If the ratio is anywhere between 0 and 1, then whenever we're looking at affecting nematode-seconds relative to their welfare ranges more than human-seconds relative to our welfare ranges, it would be indeterminate which is affected more. I think that would be every time in practice.
If we don't need to deal with gradations/vagueness like this, then I would probably assign expected welfare ranges (conditional on sentience) between constant and roughly proportional to the number of neurons, and this could give many more practically useful comparisons. EDIT: although conscious subsystems makes me more inclined towards approximately proportional, if we’re entertaining nematode sentience.
Hi Vasco, thanks for the comment and sorry for not seeing this and responding earlier.
I agree that the weights/coefficients in the model could end up quite arbitrary, and I would expect them to if someone tried to set them precisely. My sense is that:
Mood et al., 2023 had an earlier estimate:
The FAO (2022c) describes a system in which 4,500 hatchling carp are concurrently stocked to feed each mandarin fish. If this is typical then mandarin farming overall consumes an estimated 3,000 billion feed fishes, i.e. 3.0 × 1012, based on an estimated 674 million mandarin fish (Table 3).
Â
From the FAO source (archived):
According to field practice, live foods should be at a density of 3 800–4 500 fish per m2. Moreover, the size of the live food should be well controlled to keep pace with the growth rate of the cultured fish. In order to provide an appropriate quantity of live food, provision procedures are suggested as: 1 to 4 days after pond fertilization, silver carp and bighead carp hatchlings are stocked as live foods at 1 500 per m2; one week later, the same quantity is again stocked; and again an additional week later, stocking of the same number is performed. The third stocking should be accompanied by the stocking of mandarin fish of size 3–4 cm at an average rate of 1 fish per m2.
So 1 mandarin fish per m2 vs 3 800–4 500 feed fish per m2?
I had Perplexity do a review and compare PETA's report with Welfare Footprint's research here, and also critique and review the report Perplexity generated here. Summary from the second link:
Â
| Finding | Â | Â | Â | Evidence |
|---|---|---|---|---|
| The 80 m² EFSA error identification is correct | ✓ | ✓ | ✓ | EFSA presentation slide confirms "Minimum area: For group >30 birds: 80 m²" alongside "Max stocking density: 4 laying hens/m²" — a total enclosure area, not per-bird |
| PETA's own HPAI data does contradict its thesis | ✓ | ✓ | ✓ | PETA white paper footnote 84 states 60% caged / 40% cage-free culls with ~45% cage-free flock share, confirming disproportionate caged impact​ |
| KBF cherry-picking critique is well-supported | ✓ | ✓ | ✓ | Danish study confirms 86% overall KBF prevalence across all systems; 50–98% in enriched cages |
| WFI's Open Philanthropy funding concern is valid and accurately stated | ✓ | ✓ | ✓ | EA Forum confirms $980K+ as of July 2022; additional $1.25M contract in 2023 |
| WFI mortality meta-analysis publication in Nature is verified | ✓ | ✓ | ✓ | Published in Scientific Reports (Nature) covering 6,040 flocks across 16 countries​ |
| The report's overall assessment — PETA's paper is advocacy, WFI's is substantially more rigorous — is well-supported | ✓ | ✓ | ✓ | PETA's paper is not peer-reviewed, contains verified factual error, and uses advocacy framing; WFI publishes parameters and invites sensitivity testing |
Â
I think there's a lot that could change if you very seriously weighed others' actual or possible direct impressions/intuitions without heavily privileging your own, before we even get into the question of precise vs imprecise credences. Epistemic modesty is going to do a lot of work first.