MSJ

Michael St Jules 🔸

Animal welfare grantmaking and advising
12980 karmaJoined Working (6-15 years)Vancouver, BC, Canada

Bio

Philosophy, global priorities and animal welfare research. My current specific interests include: philosophy of mind, moral weights, person-affecting views, preference-based views and subjectivism, moral uncertainty, decision theory, deep uncertainty/cluelessness and backfire risks, s-risks, and indirect effects on wild animals.

I've also done economic modelling for some animal welfare issues.

Want to leave anonymous feedback for me, positive, constructive or negative? https://www.admonymous.co/michael-st-jules

Sequences
3

Radical empathy
Human impacts on animals
Welfare and moral weights

Comments
2668

Topic contributions
15

This also applies to many comments written by people? In general, whenever something is shared, one could invest more time to make it more accurate. However, to me it seems hard to come up with general rules about how much time to invest vetting claims.

 

If someone had a pattern of fabrication and very poor understanding (and apparent confidence) like LLMs often do if used uncritically, I would be annoyed with them and possibly do any of the following:

  1. Tell them to read and review more carefully, look for opposing arguments, etc..
  2. Downvote such comments (and I very very rarely downvote).
  3. Stop engaging with this person, because it wastes my time and may encourage them to waste others' time.

 

I think sharing long texts produced by LLMs is often fine even if they were not fully read by the person prompting, basically for the same reasons that it is fine to share long text produced by people even if the person sharing them did not fully read them.

I would request you check it yourself or at least run LLM critique passes, because of high rates of hallucination and other errors by LLMs.

Similarly, if there was an author who was just as consistently bad as LLMs are, and you shared their work uncritically like this, I'd recommend the same.

 

I have the impression many people, including academics, use "consciousness" and "phenomenal consciousness" interchangeably, and without wanting to take a stance on illusionism or realism.

My impression is that many people, including academics, use "consciousness" and "phenomenal consciousness" interchangeably, but they do so implicitly rejecting strong illusionism, would reject strong illusionism if asked directly, and typically don't understand strong illusionism. Maybe many are open, though, I'm not sure.

Re AI, it's a special case of Brandolini's law:

The amount of energy needed to refute bullshit is an order of magnitude bigger than that needed to produce it.

LLMs make it easy to produce a lot of plausible-looking but wrong or misleading claims quickly, and sharing them without checking yourself (or at least running another LLM to do that) puts the burden on others to correct.

I think you should make sure the claims are checked before sharing.

You could do this yourself, or run another critical LLM pass on the answer, e.g. a prompt like "Exhaustively verify and ruthlessly critique the above answer."

The critique pass itself may turn up false positive critiques, and you might want to run a third pass to verify the critiques and write a new balanced answer.

(I agree that it's also important to tag that it’s LLM output, and at least read it over yourself, which you did.)

 

However, illusionists acknowledge that phenomenal consciousness exists. They just argue it is not real in the sense it does not have the classic qualia properties. I seem to agree with this.

This is true of weak illusionists, but not strong illusionists. Strong illusionists reject phenomenal consciousness. Specifically they reject phenomenal properties; they accept that consciousness exists.

 

Would you do anything differently if you endorsed weak illusionism/realism as defined in Frankish (2016) instead of (strong) illusionism?

Other than argue about it, probably not much, assuming functionalism and materialism/physicalism of some kind that's compatible with artificial sentience.

FWIW, my impression is that there are ways to interpret ‘what-it’s-like’ or 'subjective' consistently with strong illusionism, as, e.g. the specific patterns of discriminations a brain makes or what information is accessible to it (or its attention or its global workspace). I think Frankish himself has done this later. 

But 'phenomenal' and 'qualitative' seem harder to pin down precisely in purely physical, discriminatory and informational terms in a way that I think would be broadly acceptable after critique and reflection to realists (including weak illusionists).

First, before responding to the specific claims in the Gemini output:

  1. The Gemini output seems like AI slop, and I would recommend against passing along LLM outputs uncritically like this. The answer seems quite bad, most of what it said seems wrong or misguided. You could run another pass of an LLM on the answer to critique it to help a bit. I hope this doesn't accurately represent Lau's views. Illusionism is often misunderstood and strawmanned, including by academics, so there's a lot of garbage out there for LLMs to have been trained on. Some of the problem with misunderstanding and strawmanning is probably due to its unfortunate name. (I'll flag what looks bad below.)
  2. I don't see a plausible path for Lau's view to solve the hard problem or meta problem of consciousness. Does Lau explain why people so often regard consciousness as non-physical, ethereal, having classic qualia properties, or some qualitative/subjective character beyond just the physical? Or does he reject that people often do regard consciousness this way?
    1. Specifically, I don't see how reality monitoring helps. Internally generated mental imagery is still experienced. It's just experienced differently.
    2. I'm not familiar with Lau's theory; there's problably more. But I've looked at so many theories, and my prior is to be extremely skeptical.

 

1. Macro-Level Summaries are Not "Illusions"

Is this section trying to say the non-physical "magic" is real? If not, how does it contradict illusionism?

 

Lau counters this with a functional analogy: macroscopic metrics are real, even if they ignore microscopic details. When a digital thermometer gives you a reading of 20°C, it is not giving you a detailed map of the kinetic energy of billions of individual air molecules. It is giving you a simplified, macroscopic summary. But that doesn't mean "temperature" is an illusion or a false belief.

This looks a lot like what Graziano says about his own illusionist theory here, so doesn't seem like a point against illusionism:

These models are not empty illusions: they are caricatures. They represent something physically real, but they are not accurate. Models never are. The brain’s models are useful, adaptive, simplified, and never fully accurate, yet they form the basis of our beliefs, thoughts, and claims.

That being said, I'd say the appearance of phenomenal properties is an illusion, and possibly an active misrepresentation, not merely a simplification. See Kammerer, 2019.

Similarly, Lau's Perceptual Reality Monitoring (PRM) argues that the prefrontal cortex generates a macroscopic summary of sensory reliability. It evaluates a signal and outputs a tag: This is real and present. Just because that tag lacks the metadata of neurotransmitter release doesn't make the resulting subjective experience an illusion. It is a highly accurate, biologically real computational output.

I don't see how this solves the hard problem or meta problem.

 

2. The Fallacy of the "Perfect" Introspection

Illusionism relies on the premise that for consciousness to be real, it must perfectly match our intuitive, pre-scientific introspective beliefs about it (i.e., that it is magical, ethereal, or non-physical).

This is false. Illusionsim does not rely on such a premise.

 

Lau points out that this is an unfair standard. Science routinely redefines phenomena without eliminating them. When we discovered that water is H₂O rather than an indivisible classical element, we didn't declare that water was an illusion. Lau argues we should treat consciousness the same way: it isn't the magical inner theater we intuitively thought it was, but rather a physical, metacognitive reality-monitoring system. Redefining it doesn't erase its existence.

Illusionism is (re)defining consciousness without eliminating it. What it eliminates is the "phenomenal" properties of consciousness, and classic qualia are roughly the only formalization/precisification of phenomenality so far that doesn't end up strong illusionist. See Frankish's Quining Diet Qualia. "What it feels like", "subjective", etc. are not formal/precise and don't rule out strong illusionism.

And again, I don't see how reality monitoring helps. 

 

3. The Evolutionary Reality of "Presence"

Seems independent of illusionism vs realism. Mental imagery is still experienced, just differently. Is Lau (or just Gemini?) saying the feeling of presence is a phenomenal property, and mental imagery lacks all phenomenal properties? Would you agree that dreams, thoughts and visualizations aren't phenomenally conscious experiences, but your externally generated experiences are phenomenally conscious? I don't think this is compatible with what most would intend by "phenomenal consciousness", and if this is what Lau (or Gemini?) wants to do, he's working on a totally different problem from pretty much everyone else.

 

Summary

Lau would say that strong illusionists correctly identify that the brain uses simplified representations, but they make a fatal philosophical error in concluding that simplified representations are false.

Seems like a strawman. Strong illusionists don't conclude that simplified representations are false. They conclude that there are no phenomenal properties, and any representations as if there are phenomenal properties are therefore misleading (illusions).

Yes, we could just remove it if we thought it was net negative overall.

Cage-free could turn out to be one of the best things we can do to reduce disabling pain, but slightly bad for excruciating pain, so that it's just unclear whether it's net good or bad under wide uncertainty about pain intensity tradeoffs. If it looks very good on views where disabling-excruciating pain tradeoffs are more modest and only somewhat bad and in the end outweighed on views with much more weight to excruciating pain, removing it from the portfolio could be a mistake.

See also my related post Hedging against deep and moral uncertainty.

Suppose cage-free increases excruciating pain compared to caged.

Is the total increase in excruciating pain across all (the sum total of) cage-free work from our community smaller than the total reduction in excruciating pain from all of our other work, e.g. CO2 stunning for chickens, other humane slaughter work, other welfare work, across species?

If yes, and if there are no other increases in excruciating pain (or they're small enough), then we could still say our community is preventing more excruciating pain than it's causing, overall. Our portfolio could still be robustly positive across different views on pain intensity tradeoffs. I think I'd be pretty satisfied with that, even if it is causing some excruciating pain (that is outweighed by reductions).

 

(Alternatively, we could try to compensate the specific animals we expect to cause more excruciating pain to, but that seems much harder on worldviews according to which excruciating pain matters way more than disabling pain.)

Thanks for writing this! I'm generally supportive of more critical review of evidence and analyses like this and would like to see more of it done, as well as more/better data collection, as you point to. I'm also glad you're working on new shrimp stunners and better validating them.

 

Re cage-free vs caged systems

My sense is that it's reasonable to infer that in the long-run, we don't know whether mortality rates are higher in caged or cage-free systems, and that the differences in average mortality rates are "small", based just on the graphs in this post and WFI's meta-analysis to which this post refers.

I agree that it's "an overstatement to say that there is “no differences in mortality” given the actual data" (in recent years, reading the graphs here).

I think it's still reasonable to treat them as if there's no expected difference in mortality rates in the long-run, but I also think it's reasonable to disagree with that, and this is a potentially important and fairly subjective judgement call, so should be flagged pretty explicitly. It could also mask differences in causes of mortality and associated harms by system, like the kind you point out, which could be important.

Cynthia Schuck-Paim from Welfare Footprint cited more US and EU data here, which show a very small difference in average mortality rates (3.75 in cage-free vs 3.64 in caged), much smaller than the standard deviation in mortality rates (2.62 and 2.34). I don't know what the "Standard" column in the table means, though. (Thanks to @mvolz for flagging WFI's responses.)

 

On the other hand, this could still allow that mortality rates increase substantially for several years as farms transition, and that might be quite bad, both in potentially extremely painful deaths and in indicating higher prevalences of other causes of suffering. I think it's worth being upfront about that, because there could be reasonable disagreement about whether it's worth it for the benefits of cage-free.

 

I think one trouble here is that WFI has to make a large number of judgement calls, given the available evidence. I think they are pretty transparent about them, but there can be a lot to check.

 

 

Some questions I have are:

  1. If mortality rates between the systems do roughly converge on average in the long-run, but start higher early in the transition to cage-free, how long does the ~convergence take in practice across geographies, including in the Global South, especially for our marginal cage-free work now?
  2. WFI's currently published analysis estimates generally less expected suffering in cage-free systems across pain intensity categories. At what point in the transition period to cage-free are these estimates based on, what mortality rates do they reflect, and how representative would they be of transitions across geographies? @cynthiaschuck @Wladimir J. Alonso
  3. How much do increased egg costs and the resulting reduced egg demand mitigate the potential harms of cage-free transitions? Could the reduction in number outweigh any potential increases in harms on average per hen?
  4. (May be answered by 2 or 3.) Are we, on net, increasing or decreasing excruciating pain with cage-free transitions in expectation?
    1. Are we just counterfactually moving forward these potentially painful cage-free transitions, if and because industry would (be forced to shift) eventually anyway, and so reaping the benefits earlier? Would egg replacements and other alt proteins or a global catastrophe (e.g. from TAI) mean we don't get to reap the benefits after mortality rates would have converged?
    2. Does our other work focused on excruciating pain (e.g. humane chicken slaughter) make up for potentially more excruciating pain in cage-free systems?
    3. How are we making tradeoffs between excruciating and disabling pain? How should we make them? (Personally, I'm averse to increasing excruciating pain overall in my portfolio, with high uncertainty about how much worse it is than disabling pain, and would probably have a part of my portfolio dedicated to preventing excruciating pain.)
  5. Does nest deprivation really cause (so much) disabling pain? The supporting evidence I've seen seems somewhat confounded, and I would assign a lower probability to disabling pain than in WFI's published analysis, but would guess there's still a decent chance of it. (I expect WFI to have updated estimates in their upcoming book.)

Ah, influencing certifiers sounds interesting, would love to take a look at the paper! :)

Also looking forward to your survey when it's ready!

I've been curious lately about Muslims' attitudes towards stunning and slaughter practices, and especially what kinds of stunning would be acceptable, in case we wanted to promote more stunning or even do more R&D for halal stunning.

Any updates on this, or promising interventions?

In your model and your answers here, just replace inaction with campaign and campaign with inaction.

Do you think this is AI safety-specific, because LessWrong pulls the people who have engaged most deeply with AI safety, and so the EA Forum is left primarily with people who aren't as into or understand arguments around AI safety?

Or, do you think this is generally bad across causes, and the EA Forum is net negative for other causes, too?

Load more