I've had the opportunity to take a zoom-out look at AI safety & longtermism as a field the past couple weeks. As I've thought and read, I realize that there's a lot of "white space": good ideas that haven't yet been tried or implemented.
I make three major assumptions going into this piece:
This piece has the following sections:
| Cause Area | Sub-field | Impact of TAI |
| Humans | Dictatorship / War | Utopia: TAI peacefully creates a global democracy that satisfies all human preferences Status quo: Dictators continue reigning over oppressed people, and the free world doesn't do much to help Dystopia: TAI empowers dictators to take over the free world, or TAI itself becomes a dictator |
| Disease | Utopia: TAI solves all diseases, including those that only exist in the Global South Status quo: TAI solves diseases relevant to humans in rich countries Dystopia: TAI creates bioweapons that make it impossible for all humans to live / interact with others | |
| Inequality | Utopia: TAI improves technology, and the rising tide lifts all boats Status quo: TAI enriches the top 1%, and further entrenches wealth inequality Dystopia: TAI robs everyone of wealth except those who help it | |
| Non-Human Animals | Farmed Animals | Utopia: TAI makes it ridiculously cheap to produce alternative protein, thereby making animal-based protein obsolete. TAI creates adequate, healthy places for former-farmed animals to live peacefully, and provides for all their needs. Status quo: TAI helps with making factory farming more efficient, but also helps alternative proteins develop. Dystopia: TAI leads to value lock-in, and factory farming is spread throughout the galaxy. |
| Wild Animals | Utopia: TAI is, well, transformative, and brings about a Pearcean "welfare state" for all wild animals, including fish and insects, either by recreating ecosystems so suffering is needless or by creating gene drives so suffering no longer exists neurologically Status quo: TAI helps us terraform earth so wild animal habitats gradually diminish Dystopia: TAI actively tries to torture all wild animals or painfully erode their habits (unclear how or why...) | |
| Digital Sentience | Interaction with Humans | Utopia: Digital sentience has a consensual, collaborative, positive relationship with all humans Status quo: Digital sentience remains subservient to humans, and no political rights exist for those beings Dystopia: Digital sentience is as meaningful and worthy of moral weight as human/animal sentience, but digital beings are tortured and feel enormous pain as they do human bidding, and there are no rights or protections for digital beings. |
| Interaction with Other Digital Sentience | Utopia: Digital sentience plays well with other digital sentience, and there are no abusive/coercive power dynamics Status quo: Digital beings interact with each other on Moltbook. We don't know if they're sentient, but suspect that they're not. Dystopia: Digital sentience dominates and tortures other digital sentience for fun / for economic value, just like humans do to animals or humans do to other humans. |
Most people are already invested in a cause area, but if you're not, here are some observations:
So now we know what is at stake. Given the Causes of Suffering (and joy!), and the potential impact of TAI, one might wonder -- what should we do next?
Luckily, there are ideas out there. These bullets are primarily drawn from William MacAskill's Effective Altruism in the Age of AGI (reason #1). I have bucketed them in a sensible manner.
Direct Work
Many people think that TAI might lead to a "lock-in" of present values / institutions. If this is the case, doing direct work now could help improve outcomes in the long-run:
Avoiding Bad Futures & Creating Good Futures
We can also preempt bad things from happening, and set up the foundations for good futures. For example, if your threat model is a risk of bio-terrorism from an AI-empowered bad actor, then you could create a stockpile of masks (see ProEquip).
Improving Meta Functioning of Humanity
This bucket assumes that we don't have all the answers right now, but we should prepare ourselves (individuals, organizations and large institutions) to make the right decisions when we do know what will matter.
Anything that impacts the longterm future starts as an idea and goes along the following supply chain to become a concrete intervention:
Additionally, you have the following which are not strictly part of the supply chain but do strengthen it:
In an ideal world, each of the causes mentioned in Section 1.2 would have a flourishing ecosystem of organizations developing ideas, proposals, and interventions for a better future. Sadly, this seems to not (yet) be the case!
If you're interested in a cause area, I'd recommend using an LLM to do a deep research on the intersection of a particular cause and supply chain. For example, "find all the organizations doing theoretical / legal research for digital sentience." In many cases, you'll find that there are only a handful of orgs, and that their efforts are spread very think.
Some claims about ecosystems:
Without having done extensive research on every cause, I do have some rough takes on where "the ecosystem" stands on various issues. I say this having done some amount of reading on this forum and Forethought's writings, gone to SFS and EAG for two years, and having worked at Constellation for about a year and chatted with many researchers. But again, I encourage you to do your own reading on what exists out there.
For developed cause areas:
For nascent cause areas, I must admit I don't know the frontier of each cause area here. I generally feel that these fields are neglected, and at most have 1-2 EA-aligned causes working on them. Many of these seem to be bottle-necked on ideas / proposals.
Having done some research and thinking, I believe that the world is dramatically under-prepared for the challenges of transformative artificial intelligence. This will probably affect the long-term future of humanity, and potentially trillions of beings who will live throughout the galaxy. But, I do think that it's worth trying to do something about it.
Writing this piece was helpful for me to orient in my next career steps, and to think from an impartial viewpoint about the work that needs to be done. When you have a good idea of the "white space", you can then start to think about your personal fit, or perhaps what areas need more funding.
I'd like to close with two points:
Good luck!
More reading:
William MacAskill, Effective Altruism in the Age of AGI
Forethought, How to Make the Future Better
Holden Karnofsky, Rowing, Steering, Anchoring, Equity, Mutiny
80,000 Hours, Problem Profiles
Forethought, Preparing for the Intelligence Explosion
David Pearce, A Welfare State for Elephants?
Niki Dupuis, How I'm Thinking About the Next 3 Years
I like the table describing Causes of Suffering. It's useful to imagine what the utopian and dystopian states look like for each cause.
Also, if TAI is just 2-3 years away as you predict, I agree that we should be focused more on policy interventions, less on research. That made me think of Bernie Sanders recent proposal to instate a moratorium on AI data centers. Perhaps more EAs should be pressing for policies of the sort?