Probabilities, Prioritization, and 'Bayesian Mindset'

Violet Hour

Probabilities, Prioritization, and 'Bayesian Mindset'

Violet Hour

29 min readApr 4, 2023

Comments 6

Sorted by

New & upvoted

titotal

Fantastic post, puts into words a lot of my misgivings with how bayesianism is currently practised here.

I am unconvinced of the most important idea of the longtermist "bayesian mindset": that good predictions in evidence rich, well established fields are transferable to predictions in highly speculative, uncertain fields. I am skeptical that being able to answer questions like "will the EU pass an AI act in the next year" is particularly highly correlated with correctly answering questions like "will a hypothetical future AGI defeat humanity in a hypothetical war".

It seems to me like people can do well on the evidence questions because they are not super hard, and they can receive fairly quick feedback on errors. (For example, calibration is very hard to do on probabilities that are significantly less than 1% or more than 99%, so these type of questions are rarely tested). But when the plausibly correct answer for something varies by many order of magnitude, it seems fairly easy for groupthink and bias to seep in, because there are is not enough evidence to correct them.

This is not to say I don't count good prediction as a point in someones favour, just that there are other factors I value more highly, like domain-relevant specific accomplishments and skills.

Violet Hour

Thanks :)

I’m sympathetic to the view that calibration on questions with larger bodies of obviously relevant evidence aren’t transferable to predictions on more speculative questions. Ultimately I believe that the amount of skill transfer is an open empirical question, though I think the absence of strong theorizing about the relevant mechanisms involved heavily counts against deferring to (e.g.) Metaculus predictions about AI timelines.

A potential note of disagreement on your final sentence. While I think focusing on calibration can Goodhart us away from some of the most important sources of epistemic insight, there are “predictions” (broadly construed) that I think we ought to weigh more highly than “domain-relevant specific accomplishments and skills”.

E.g., if you’re sympathetic to EA’s current focus on AI, then I think it’s sensible to think “oh, maybe Yudkowsky was onto something”, and upweight the degree to which you should engage in detail with his worldview, and potentially defer to the extent that you don’t possess a theory which jointly explains both his foresight and the errors you currently think he’s making.
My objection to ‘Bayesian Mindset’ and the use of subjective probabilities to communicate uncertainty is (in part) due to the picture imposed by the probabilistic mode of thinking, which is something like “you have a clear set of well-identified hypotheses, and the primary epistemic task is to become calibrated on such questions.” This leads me to suspect that EAs are undervaluing the ‘novel hypotheses generation’ component of predictions, though there is still a lot of value to be had from (novel) predictions.

NunoSempere

I thought this was thought-provoking, and I'm sharing it with my forecaster friends. Thanks for writing it!

Stian

This was a great post, and I appreciated the comments towards the end about the train to Crazy Town, like "Stops along the train to Crazy Town simply represent the place where we see the practical limits to a certain kind of quasi-formal reasoning." In my own (but noticeably Yudkowsky-derived) words I was thinking that this applies in areas where "the probabilities are there to make clear our beliefs and values. If, on reflection, you find that the probabilities advocate for something you do not believe or value, then the model you built your probabilities on don't capture your beliefs/values well enough."

This is noticeably narrower than the AI x-risk prediction case, where I see the beliefs about possible/relevant models to be less clustered than beliefs about the set of human beliefs. And now I'm noticing that even here I might be trapped inside the Bayesian Mindset, as the previous sentence is basically a statement of credences over the spread of those sets of beliefs.

Have you had a chance to read Vanessa Kosoy and Diffractor's work on Infra-Bayesianism? From what I can tell (I haven't spent much time engaging with it myself yet, but), it's very relevant here, as it wants to be a theory of learning that applies "when the hypothesis is not included in the hypothesis space". Among other things, they talk about infradistributions: a set of probability distributions which would act like a prior over environments.

Violet Hour

I haven’t read Kosoy & Diffractor’s stuff, but I will now!

FWIW I’m pretty skeptical that their framework will be helpful for making progress in practical epistemology (which I gather is not their main focus anyway?). That said, I’d be very happy to learn that I'm wrong here, so I’ll put some time into understanding what their approach is.

Richard Nerland

I think this is obfuscating the good points, I appreciate many of the points but they seem to be ticked off rather than front and center.

I am afraid the frame of "When to" is promoting a binary mindset which is fundamentally opposed to proper decision making.

I am reading it as attempting to have decision points for when to collapse distributions to point estimates. "Use of explicit probabilities"

You always have the explicit distribution. You always have a policy (why didn't it say policy in the alliterative p title) You always break apart the timeline and draw the causal dag.

This is offensive to reasonable planning: "Some creatures would be better served by mapping out the dynamic dependencies of the world" Always draw the dependencies!

The question is when to communicate the point estimate versus distribution. When to communicate the dependencies or just the final distribution.

People allege the crazy train when you are imagining a point estimate represents the best information that is used for making a decision. That is the implicit suggestion when you discuss point estimates.

Quick suggestions, communicating a point estimate is poor:

When the outcomes have unequal weightings across decision makers. So each decision maker needs to attach their weights to get the weighted EV
When decisions are sensitive to reasonable perturbation of the point estimate. Ie when two good models disagree to the point that it implies different decisions.
When the probability is endogenous to the decisions being made.

Poker is unnecessary for the analogy, just probability of a draw from an urn.

We are speculating on how many balls are in the urn when a much better question would be Given we get the urn will we know how many balls are in it? How much does that cost? Can we do things before opening the urn that change the contents? How much does that cost?

Can we not sign for the urn when the Amazon delivery guy arrives? How much does that cost?

Ok that is a joke, but the idea is that we don't know what recourse we have and those actually are important and affect the point estimate.

The probability is downstream from certain decisions, we need to identify those decisions that affect the probability.

Does that mean the point estimate is useless, well maybe because those decisions might reduce the probability by some relative amount, ie if we get congress to pass the bill the odds are half no matter what they were before.

If you go, yeah but I say it is 27.45% and 13.725% is too high. They a decision maker goes "Sure, but I still want to halve it, give me something else that halves it stop telling me a number with no use"

You mention relative likelihood, but it is buried in a sentence of jargon I just had to search for it to remember if you said it.

Finally, frame the analysis relative to a better considered approach to Robust Decision Making, a la Decision Making Under Deep Uncertainty, not relative to Scott or Holden's world view which are just poor starting points.

Comments

More from the author

133

Effective Altruism's Implicit Epistemology

Violet Hour·3y ago·34m read

Davidson's Model of Takeoff Speeds: A Critical Take

Violet Hour·1y ago·23m read

FTX, 'EA Principles', and 'The (Longtermist) EA Community'

Violet Hour·3y ago·17m read

Curated and popular this week

Hard-to-reverse decisions destroy option value

Stefan_Schubert·9y ago·Curated 1d ago·14m read

This post is co-authored with Ben Garfinkel. It is cross-posted from the CEA blog. A PDF version can be found here. Summary: Some strategic decisions available to the effective altruism m...

Introducing Impact List: a ranking of philanthropists by expected lives saved

Elliot Olds·2d ago·6m read

TL;DR: I'm releasing a website that ranks philanthropists according to EA principles and research, and allows users to re-rank the list using their own assumptions. I'd like feedback and help making it better. I'd especially like ideas for how to make the results more trustworthy. Funding may be available. I recently built Impact List (impactlist.xyz), a site which ranks people by their positive impact via donations. The goal is t...

If you're agentic, work in biosecurity

sharmaayushmaan🔸·6d ago·7m read

Disclaimer: Although I work on the Groups Team at CEA, I’m writing this in a personal capacity, and this post does not constitute an endorsement by CEA. Agency - the realisation that you really can just do things. TL;DR Biosecurity needs people (of any background) who are agentic and have a high execution velocity and track record....

Recent opportunities to take action

Marginal Victories: career advising and opportunities for U.S. democracy preservation & political work

Annika Burman 🔸·4d ago·2m read

I'm stepping down as Hive's Executive Director, and we're hiring my successor

SofiaBalderson, Hive·4d ago·3m read

Starting an EA group @ SUNY Binghamton

micahzarin·3d ago·1m read

Richard Nerland

I think this is obfuscating the good points, I appreciate many of the points but they seem to be ticked off rather than front and center.

I am afraid the frame of "When to" is promoting a binary mindset which is fundamentally opposed to proper decision making.

I am reading it as attempting to have decision points for when to collapse distributions to point estimates. "Use of explicit probabilities"

You always have the explicit distribution. You always have a policy (why didn't it say policy in the alliterative p title) You always break apart the timeline and draw the causal dag.

This is offensive to reasonable planning: "Some creatures would be better served by mapping out the dynamic dependencies of the world" Always draw the dependencies!

The question is when to communicate the point estimate versus distribution. When to communicate the dependencies or just the final distribution.

Quick suggestions, communicating a point estimate is poor:

When the outcomes have unequal weightings across decision makers. So each decision maker needs to attach their weights to get the weighted EV
When decisions are sensitive to reasonable perturbation of the point estimate. Ie when two good models disagree to the point that it implies different decisions.
When the probability is endogenous to the decisions being made.

Poker is unnecessary for the analogy, just probability of a draw from an urn.

Can we not sign for the urn when the Amazon delivery guy arrives? How much does that cost?

Ok that is a joke, but the idea is that we don't know what recourse we have and those actually are important and affect the point estimate.

The probability is downstream from certain decisions, we need to identify those decisions that affect the probability.

You mention relative likelihood, but it is buried in a sentence of jargon I just had to search for it to remember if you said it.

^{^}

Many people (including Dustin) justify focus on areas like biorisk and AI in virtue of the risks posed to the present generation. However, I stick with the terminology of ‘longtermist’ grantmaking, because: (i) my discussion focuses on areas that (philosophical) longtermists tend to prioritize, and (ii) I’m focused on sociologically unusual applications of Bayesian Mindset; people who prioritize biorisk and AI risk based primarily on short-term considerations do so on the basis of an unusual set of cognitive tools (like treating speculative probability estimates seriously), which share much in common with Holden’s account of Bayesian Mindset.

^{^}

Carlsmith instead defers to timelines estimates from prior reports, of which Biological Anchors is one.

^{^}

One further example not included in the main text:

Davidson's Framework for Takeoff Speeds

Davidson uses a semi-endogenous growth model to forecast how R&D investments affect technological progress, given estimates for (among many other parameters) AGI training requirements, and the ‘effective FLOP gap’ between the ‘most demanding’ and ‘80th percentile demanding’ tasks. In short, I’m not convinced by the track record of semi-endogenous growth models within economics, and don’t see much in the way of principled reasons for trusting the forecasts of such models for takeoff speeds.

Davidson thinks his modeling strategy is the “~best you can do” when making technological predictions from R&D investment, though he also believes that simply saying "I just don't trust any method that tries to predict the rate of technological progress from the amount of R&D investment" is a “valid perspective”.

^{^}

Indeed, there are various foundational criticisms in this genre that I think do meet this bar — see Linn, Soares, nostalgebraist, and Yudkowsky. The 2021 MIRI Conversations are perhaps the best example of the ‘worldview explication’-type projects I’m most enthused by.

^{^}

I like some of Sam Clarke’s suggestions for communicating deference over AI timelines.

^{^}

Vasco Grilo’s more recent post also provides a nice summary of some relevant evidence.

^{^}

In one EAG interview, Nick Beckstead claimed that there were “certain vibes of careful and precise reasoning” he believed to be societally neglected. I think that research on forecasting generalizability, in alternative words, is research on the degree to (and domains under) which LEA’s implicit conception of “careful reasoning vibes” are vibes which actually help us navigate the world more successfully.

^{^}

Since drafting, a new dispute has emerged between Scott and Tyler on AI risk. I’m not a huge fan of either take, but I think Tyler’s response is illustrative: “[Scott’s] restatement of my argument is simply not what I wrote. Sorry Scott! There are plenty of arguments you just can’t put into the categories outlined in LessWrong posts.”

On my preferred reading, Tyler is criticizing Scott for an overly colonizing use of Bayesian Mindset. Tyler is suggesting that something is awry with the background picture in which probabilistic estimates of questions relevant to AI risk are formed, and with the attempt to interpret disagreements as disagreements about probabilistic estimates within some shared picture of the world, rather than a more foundational disagreement about the benefits of a particular way of approaching practical epistemology.

Probabilities, Prioritization, and 'Bayesian Mindset'

Probabilities, Prioritization, and 'Bayesian Mindset'

1. Philosophy and Practice

2. Okay, But Shouldn’t We Try to Approximate the Bayesian Ideal?

3. Mechanisms, Metaculus, and World-Models

4. Epistemic Gamification

5. Conclusion