C

cb

239 karmaJoined

Bio

Hello! I work on AI grantmaking at Open Philanthropy.

All posts in a personal capacity and do not reflect the views of my employer, unless otherwise stated.

Comments
18

cb
3
2
0
1

Whether longtermism is a crux will depend on what we mean by 'long'

Yep, I was being imprecise. I think the most plausible (and actually believed-in) alternative to longtermism isn't "no care at all for future people", but "some >0 discount rate", and I think xrisk reduction will tend to look good under small >0 discount rates.

I do also agree that there are some combinations of social discount rate and cost-effectiveness of longtermism, such that xrisk reduction isn't competitive with other ways of saving lives. I don't yet think this is clearly the case, even given the numbers in your paper — afaik the amount of existential risk reduction you predicted was pretty vibes-based, so I don't really take the cost-effectiveness calculation it produces seriously. (And I  haven't done the math myself on discount rates and cost-effectiveness.) 

Even if xrisk reduction doesn't look competitive with e.g. donating to AMF, I think it would be pretty reasonable for some people to spend more time thinking about it to figure out if they could identify more cost-effective interventions. (And especially if they seemed like poor fits for E2G or direct work.)

cb
9
1
0

Glad you shared this!

Expanding a bit on a comment I left on the google doc version of this: I broadly agree with your conclusion (longtermist ideas are harder to find now than in ~2017), but I don't think this essay collection was a significant update towards that conclusion. As you mention as a hypothesis, my guess is that these essay collections mostly exist to legitimise discussing longtermism as part of serious academic research, rather than to disseminate important, plausible, and novel arguments. Coming up with an important, plausible, and novel argument which also meets the standards of academic publishing seems much harder than just making some publishable argument, so I didn't really change my views on whether longtermist ideas are getting harder to find because of this collection's relative lack of them. (With all the caveats you mentioned above, plus: I enjoyed many of the reprints, and think lots of incrementalist research can be very valuable —it's just not the topic you're discussing.)  

I'm not sure how much we disagree, but I wanted to comment anyway, in case other people disagree with me and change my mind!

Relatedly, I think what I'll call the "fundamental ideas" — of longtermism, AI existential risk, etc — are mildly overrated relative to further arguments about the state of the world right now, which make these action-guiding. For example, I think longtermism is a useful label to attach to a moral view, but you need further claims about reasons not to worry about cluelessness in at least some cases, and also potentially some claims about hinginess, for it to be very action-relevant. A second example: the "second species" worry about AIXR is very obvious, and only relevant given that we're in a world where we're plausibly close to developing TAI soon and, imo, because current AI development is weird and poorly understood; evidence from the real world is a potential defeater for this analogy.

I think you're using "longtermist ideas" to also point at this category of work (fleshing out/adding the additional necessary arguments to big abstract ideas), but I do think there's a common interpretation where "we need more longtermist ideas" translates to "we need more philosophy types to sit around and think at very high levels of abstraction". Relative to this, I'm more into work that gets into the weeds a bit more.

cb
11
3
0
1

(Meta: I really liked this more personal, idiosyncratic kind of "apply here" post!)

Could you say a bit more about the "set of worldviews & perspectives" represented on Forethought, and in which ways you'd like it to be broader?

cb
5
0
0

Cool, thanks for sharing!
 

I currently use Timing.app, and have been recommending it to people. Is donethat different in any ways? (TBC, "it has all the same features but also supports an E2G effort" would be sufficient reason for me to consider switching).

cb
3
0
0

Mostly the latter two, yeah

cb
5
0
0

Yes, at least initially. (Though fwiw my takeaway from that was more like, "it's interesting that these people wanted to direct their energy towards AI safety community building and not EA CB; also, yay for EA for spreading lots of good ideas and promoting useful ways of looking at problems". This was in 2022, where I think almost everyone who thought about AI safety heard about it via EA/rationalism.)

cb
22
2
3
1

Interesting post, thanks for sharing. Some rambly thoughts:[1]

  • I'm sympathetic to the claim that work on digital minds, AI character, macrostrategy, etc is of similar importance to AI safety/AI governance work. However, I think they seem much harder to work on — the fields are so nascent and the feedback loops and mentorship even scarcer than in AIG/AIS, that it seems much easier to have zero or negative impact by shaping their early direction poorly.
  • I wouldn't want marginal talent working on these areas for this reason. It's plausible that people who are unusually suited to this kind of abstract low-feedback high-confusion work, and generally sharp and wise, should consider it. But those people are also well-suited to high leverage AIG/AIS work, and I'm uncertain whether I'd trade a wise, thoughtful person working on AIS/AIG for one thinking about e.g. AI character.
  • (We might have a similar bottom line: I think the approach of "bear this in mind as an area you could pivot to in ~3-4y, if better opportunities emerge" seems reasonable.)
  • Relatedly, I think EAs tend to overrate interesting speculative philosophy-flavoured thinking, because it's very fun to the kind of person who tends to get into EA. (I'm this kind of person too :) ). When I try to consciously correct for this, I'm less sure that the neglected cause areas you mention seem as important.
  • I'm worried about motivated reasoning when EAs think about the role of EA going forwards. (And I don't think we should care about EA qua EA, just EA insofar as it's one of the best ways to make good happen.) So reason #2 you mention, which felt more like going "hmm EA is in trouble, what can we do?" rather than reasoning from "how do we make good happen?" wasn't super compelling to me.
  • That being said, If it's cheap to do so, more EA-flavoured writing on the Forum seems great! The EAF has been pretty stale. I was brainstorming about this earlier —initially I was worried about the chilling effect of writing so much in public (commenting on the EAF is way higher effort for me than on google docs, for example), but I think some cool new ideas can and probably should be shared more. I like Redwood's blog a lot partly for this reason.
  • In my experience at university, in my final 2 years the AI safety group was just way more exciting and serious and intellectually alive than the EA group — this is caricatured, but one way of describing it would be that (at extremes) the AI safety group selected for actually taking ideas seriously and wanting to do things, and the EA group correspondingly selected for wanting to pontificate about ideas and not get your hands dirty. I think EA groups engaging with more AGI preparedness-type topics could help make them exciting and alive again, but it would be important imo to avoid reinforcing the idea that EA groups are for sitting round and talking about ideas, not for taking them seriously. (I'm finding this hard to verbalise precisely —I think the rough gloss is "I'm worried about these topics having more of a vibe of 'interesting intellectual pastime', and if EA groups tend towards that vibe anyway, making discussing them feel ambitious and engaging and 'doing stuff about ideas'-y sounds hard". 
  1. ^

    I would have liked to make this more coherent and focused, but that was enough time/effort that realistically I just wouldn't have done it, and I figured a rambly comment was better than no comment.

See also https://sb53.info/ for a FAQ and annotated copy of the bill

cb
*13
2
0
1

Could you say a bit more about the power law point?

A related thing I've been thinking about is that some kinds of deep democracy and some kinds of better futures-style reasoning (for sufficiently risk-neutral, utilitarian, confident in their moral views, etc etc etc kinds of agents, assume all the necessary qualifiers here) will end up being in tension — after all, why compromise between lots of moral views when this means you miss out on a bunch of feasible moral value? (More precisely, why choose the compromise it's-just-ok future when you could optimise really hard according to the moral view you favour and have some small chance of getting almost all feasible value?)

I think that some versions of the power law point might make moral compromise look more appealing, which is why I'm interested. (I'm personally on team compromise!)

I am too young and stupid to be giving career advice, but in the spirit of career conversations week, I figured I'd pass on advice I've received which I ignored at the time, and now think was good advice: you might be underrating the value of good management!

I think lots of young EAish people underrate the importance of good management/learning opportunities, and overrate direct impact. In fact, I claim that if you're looking for your first/second job, you should consider optimising for having a great manager, rather than for direct impact.

Why?

  • Having a great manager dramatically increases your rate of learning, assuming you're in a job with scope for taking on new responsibilities or picking up new skills (which covers most jobs).
  • It also makes working much more fun!
  • Mostly, you just don't know what you don't know. It's been very revealing to me how much I've learnt in the last year, I think it's increased my expected impact, and I wouldn't have predicted this beforehand.
    • In particular, if you're just leaving university, you probably haven't really had a manager-type person before, and you've only experienced a narrow slice of all possible work tasks. So you're probably underrating both how useful a very good manager can be, and how much you could learn.

How can you tell if someone will be a great manager?

  • This part seems harder. I've thought about it a bit, but hopefully other people have better ideas.
  • Ask the org who would manage you and request a conversation with them. Ask about their management style: how do they approach management? How often will you meet, and for how long? Do they plan to give minimal oversight and just check you're on track, or will they be more actively involved? (For new grads, active management is usually better.) You might also want to ask for examples of people they've managed and how those people grew.
  • Once you're partway through the application process or have an offer, reach out to current employees for casual conversations about their experiences with management at the org.
  • You could ask how the organization handles performance reviews and promotions. This is probably an okay-not-great proxy, since smaller, fast-growing orgs might have informal processes but still excellent management, but I thin k it would give you some signal on how much they think about management/personal development.
  • (This maybe only really works if you are socially very confident or know lots of EA-ish people, sorry about that) You could consider asking a bunch of your friends and acquaintances about managers they've had that they thought were very good, and then trying to work with those people.
  • Some random heuristics: All else equal, high turnover rate without seemingly big jumps in career progression seems bad. Orgs that regularly hire and retain/promote early career people are probably pretty good at management; same for orgs whose alumni go on to do cool stuff. 

(My manager did not make me post this)

Load more