[EDIT: Thanks for the questions everyone! Just noting that I'm mostly done answering questions, and there were a few that came in Tuesday night or later that I probably won't get to.]
Hi everyone! I’m Ajeya, and I’ll be doing an Ask Me Anything here. I’ll plan to start answering questions Monday Feb 1 at 10 AM Pacific. I will be blocking off much of Monday and Tuesday for question-answering, and may continue to answer a few more questions through the week if there are ones left, though I might not get to everything.
About me: I’m a Senior Research Analyst at Open Philanthropy, where I focus on cause prioritization and AI. 80,000 Hours released a podcast episode with me last week discussing some of my work, and last September I put out a draft report on AI timelines which is discussed in the podcast. Currently, I’m trying to think about AI threat models and how much x-risk reduction we could expect the “last long-termist dollar” to buy. I joined Open Phil in the summer of 2016, and before that I was a student at UC Berkeley, where I studied computer science, co-ran the Effective Altruists of Berkeley student group, and taught a student-run course on EA.
I’m most excited about answering questions related to AI timelines, AI risk more broadly, and cause prioritization, but feel free to ask me anything!
[I'm not sure if you've thought about the following sort of question much. Also, I haven't properly read your report - let me know if this is covered in there.]
I'm interested in a question along the lines of "Do you think some work done before TAI is developed matters in a predictable way - i.e., better than 0 value in expectation - for its effects on the post-TAI world, in ways that don't just flow through how the work affects the pre-TAI world or how the TAI transition itself plays out? If so, to what extent? And what sort of work?"
An example to illustrate: "Let's say TAI is developed in 2050, and the 'TAI transition' is basically 'done' by 2060. Could some work to improve institutional decision-making be useful in terms of how it affects what happens from 2060 onwards, and not just via reducing x-risk (or reducing suffering etc.) before 2060 and improving how the TAI transition goes?"
But I'm not sure it's obvious what I mean by the above, so here's my attempt to explain:
The question of when TAI will be developed[1] is clearly very important to a whole bunch of prioritisation questions. One reason is that TAI - and probably the systems leading up to it - will very substantially change how many aspects of how society works. Specifically, Open Phil has defined TAI as "AI that precipitates a transition comparable to (or more significant than) the agricultural or industrial revolution" (and Muehlhauser has provided some more detail on what is meant by that).
But I think some EAs implicitly assume something stronger, along the lines of:
But I don't think that necessarily follows from how TAI is defined. E.g., various countries, religious, ideologies, political systems, technologies, etc., existed both before the Industrial Revolution and for decades/centuries afterwards. And it seems like some pre-Industrial-Revolution actions - e.g. people who pushed for democracy or the abolition of slavery - had effects on the post-Industrial-Revolution world that were probably predictably positive in advance and that weren't just about affecting how the Industrial Revolution itself occurred.
(Though it may have still been extremely useful for people taking those actions to know that, when, where, and how the IR would occur, e.g. because then they could push for democracy and abolition in the countries that were about to become much more influential and powerful.)
So I'm tentatively inclined to think that some EAs are assuming that short timelines pushes against certain types of work more than it really does, and that certain (often "broad") interventions could be in expectation useful for influencing the post-TAI world in a relatively "continuous" way. In other words, I'm inclined to thinks there might be less of an extremely abrupt "break" than some people seem to think, even if TAI occurs. (Though it'd still be quite extreme by many standards, just as the Industrial Revolution was.)
[1] Here I'm assuming TAI will be developed, which is questionable, though it seems to me pretty much guaranteed unless some existential catastrophe occurs beforehand.
[Kind-of thinking aloud; bit of a tangent from your AMA]
Yeah, that basically matches my views.
I guess what I have in mind is that some people seem to:
- round up "most compelling reason" to "only reason"
- not consider the idea of trying to influence lock-in events that occur after a TAI transition, in ways other than influencing how the TAI transition itself occurs
- Such ways could include things like influencing political systems in long-lasting ways
- round up "substantial chance that technology that enables lock-in will be built very close in time aft
... (read more)