Contradict my take on OpenPhil's past AI beliefs

EliezerYudkowsky

Comments 16

Sorted by

New & upvoted

Re "Oxford EAs" - Toby Ord is presumably a paradigm of that. In the Great AI Timelines Scare of 2017, I spent some time looking into timelines. His median, then, was 15 years, which has held up pretty well. (And his x-risk probability, as stated in the Precipice, was 10%.)

I think I was wrong in my views on timelines then. But people shouldn't assume I'm a stand-in for the views of "Oxford EAs".

Toby_Ord

7mo

I ran a timelines exercise in 2017 with many well known FHI staff (though not including Nick) where the point was to elicit one's current beliefs for AGI by plotting CDFs. Looking at them now, I can tell you our median dates were: 2024, 2032, 2034, 2034, 2034, 2035, 2054, and 2079. So the median of our medians was (robustly) 2034 (i.e. 17 more years time). I was one of the people who had that date, though people didn't see each others' CDFs during the exercise.

I think these have held up well.

So I don't think Eliezer's "Oxford EAs" point is correct.

Eric Neyman

7mo

What's the Great AI Timelines Scare of 2017?

William_MacAskill

7mo

In my memory, the main impetus was a couple of leading AI safety ML researchers started making the case for 5-year timelines. They were broadly qualitatively correct and remarkably insightful (promoting the scaling-first worldview), but obviously quantitatively too aggressive. And AlphaGo and AlphaZero had freaked people out, too.

A lot of other people at the time (including close advisers to OP folks) had 10-20yr timelines. My subjective impression was that people in the OP orbit generally had more aggressive timelines than Ajeya's report did.

Denkenberger🔸

7mo

Wow - @Toby_Ord then why did you have such a high existential risk for climate? Did you have large likelihoods that AGI would take 100 or 200 years despite a median date of 2032?

SLermen

7mo

Toby Ord had a x-risk probability of 10% from AI and about 7% from other causes back then for a total of about 1/6.

Reading this, I thought Toby Ord had a total all-cause x-risk probabilitity of 10% back then at first and checked it. Thought this might be helpful since Eliezer specifically mentioned <10% x-risk from AI as very unreasonable.

Owen Cotton-Barratt

7mo*

My belief is that the Open Philanthropy Project, EA generally, and Oxford EA particularly, had bad AI timelines and bad ASI ruin conditional probabilities; and that these invalidly arrived-at beliefs were in control of funding, and were explicitly publicly promoted at the expense of saner beliefs.

There is a surprising amount of normative judgment in here for a fact check. Are you looking just for disagreements that people held roughly the beliefs you later outline (I think you overstate things but are directionally correct in describing how beliefs differed from yours), or also disagreements about whether they were bad beliefs?

For flavour: as I ask that question, I'm particularly (but not only) thinking of the reports you cite, where you seem to be casting them as "OP really throwing its weight behind these beliefs", and I perceived them more as "earnest attempts by people at OP to figure out what was legit, and put their reasoning in public to let others engage". I certainly didn't just agree with them at the time, but I thought it was a good step forwards for collective epistemics to be able to have conversations at that level of granularity. Was it confounding that they were working at a big funder? Yeah, kinda -- but that seemed second order compared to it just being great that anyone at all was pushing the conversation forwards in this way, even if there were a bunch of aspects of them I wasn't on board with. I'm not sure if this is the kind of disagreement you're looking for. (Maybe it's just that I was on board with more of them than you were, and so I saw them as flawed-but-helpful rather than unhelpful? Then we get to the general question of what standards bad should be judged by given our lack of access to ground truth.)

EliezerYudkowsky

7mo*

My view of the tragedy of OpenPhil is indeed that they were very earnest people trying to figure out what was legit, but ended up believing stuff like "biologically anchored estimates of AI timelines" that were facially absurd and wrong and ultimately self-serving, because the problem "end up with beliefs about AI timelines that aren't influenced by what plays well with our funders and friends" was hard and frankly out of their league and OpenPhil did not know that it was a hard problem or treat it with what I would consider seriousness.

If you'd like to view them as blameless on account of being earnest about it, that's between you and your own moral judgments. I don't particularly think we end up living through this if only we go around morally judging people enough, even correctly. But people ask me for my takes and I am giving a take that makes OpenPhil look bad and my rules do say that I ought to not just do all that behind their backs.

I suppose if you thought that nobody could possibly look bad if my account of them includes, "They were being very earnest in their error", then I wouldn't be obliged to give them a chance to respond to what I was saying about them. But I should prefer to have the chance to respond if somebody was saying that about me. Of course I am earnest, and when I err, it comes from a place of my having tried to be virtuous rather than viceful as best I understood virtue. What of it? There are higher things to aspire to in life besides earnest error.

titotal

7mo

Before I can or should try to write up that take, I need to fact-check one of my take-central beliefs about how the last couple of decades have gone down. My belief is that the Open Philanthropy Project, EA generally, and Oxford EA particularly, had bad AI timelines and bad ASI ruin conditional probabilities; and that these invalidly arrived-at beliefs were in control of funding, and were explicitly publicly promoted at the expense of saner beliefs.

We don't know if AGI timelines or ASI ruin conditional probabilities are "bad", because neither event has happened yet. If you want to ask what openphils probabilities are and if they disagree with your own, you should just ask that directly. My impression is that there is a wide range of views on both questions among EA org leadership.

Matthew_Barnett

6mo*

I'd like to point out that Ajeya Cotra's report was about "transformative AI", which had a specific definition:

I define “transformative artificial intelligence” (transformative AI or TAI) as “software” (i.e. a computer program or collection of computer programs) that has at least as profound an impact on the world’s trajectory as the Industrial Revolution did. This is adapted from a definition introduced by CEO Holden Karnofsky in a 2016 blog post.
How large is an impact “as profound as the Industrial Revolution”? Roughly speaking, over the course of the Industrial Revolution, the rate of growth in gross world product (GWP) went from about ~0.1% per year before 1700 to ~1% per year after 1850, a tenfold acceleration. By analogy, I think of “transformative AI” as software which causes a tenfold acceleration in the rate of growth of the world economy (assuming that it is used everywhere that it would be economically profitable to use it).
Currently, the world economy is growing at ~2-3% per year, so TAI must bring the growth rate to 20%-30% per year if used everywhere it would be profitable to use. This means that if TAI is developed in year Y, the entire world economy would more than double by year Y + 4. This is a very extreme standard -- even 6% annual growth in GWP is outside the bounds of what most economists consider plausible in this century.

My personal belief is that a median timeline of ~2050 for this specific development is still reasonable, and I don't think the timelines in the Bio Anchors report have been falsified. In fact, my current median timeline for TAI, by this definition, is around 2045.

Ben_West🔸

7mo

What time frame are you interested in? E.g. if someone says that they have <30y timelines today, would that meet your criteria?

EliezerYudkowsky

7mo

-39

It would meet my criteria for being smarter than a potted plant, I suppose.

The harm's already been done.

Ben_West🔸

7mo

I feel confused about this response. You're asking for people to give you examples of a thing occurring, I'm asking by what date range you wish to see examples in.

EliezerYudkowsky

7mo*

Okay; I guess I was confused by your question because I thought I'd said that in the main doc.

To repeat and with added explanation: Only opinions from before ChatGPT count.

This is because ChatGPT moved the Overton window and changed which sorts of opinions would earn you the horror of contemptuous looks and lowered status, and my negative model of OpenPhil is that they miraculously arrived at a set of opinions which would balance which sort of looks they got from a weighted set of people they cared about. So whatever happened after the ChatGPT Moment is no longer reflective of what I guess to be the organizational and cognitive processes underlying their earlier failure; and it's fair to ask about this because the earlier stuff was consequential. (Though it didn't move the needle as such; in retrospect and with benefit of hindsight, the needle started at "Dead" and stayed at "Dead" through everything MIRI or OpenPhil tried or failed at.)

While it is now possible to lose a lot of credit for having >30yr median timelines, it is no longer possible to earn significant credit for putting your timelines under 2055 because that is already what "the weighted average of facial expressions on people you care about" is telling you to believe and there are no big social penalties for believing it.

Thanks!

Don't underestimate potted plants!

Comments