Hide table of contents

tldr: I'm looking for undergraduate research assistants / collaborators to work on research questions at the intersection of social science and long-term risks from AI. I've collected some research questions here. If you’re interested in working on these or related questions, and would like advice or mentorship, please contact Vael Gates at vlgates@stanford.edu!


Broader Vision

I'm a social scientist, and I want to contribute to reducing long-term risks from AI. I'm excited about growing the community of fellow researchers (at all levels) who are interested in the intersection of AI existential risk and social science. 

To that end, I'm hoping to:

  1. collect and grow a list of research questions that would be interesting for social scientists of various subfields and valuable to AI safety
  2. work with undergraduate students / collaborators on these questions

Below are some research questions I think are interesting, largely drawn from conversations with others and extant question lists. The questions are biased towards psychology at the moment; I'm looking to expand them. 

If any of these look interesting to you, or if you're generally enthusiastic about the idea of this research intersection, please get in contact (vlgates@stanford.edu)! As a postdoctoral scholar at Stanford, I can offer undergraduate RAs the usual academic array of research experience, mentoring, recommendation letters, and potentially co-authorship on a conference publication.


Research Questions

(AI × social science) Research Questions

Please feel encouraged to add additional research questions to the bottom of the Google doc. 

Current topics:


Previous Work

There have already been several requests for social scientists to do work in longtermist AI safety (as opposed to current-day and near-term AI safety). I am excited about these opportunities, yet note that since the field is still new, many of these opportunities require specialized skillsets and interests, substantial research independence, and have limited options for mentorship. What follows is my impression of the established opportunities: 

  • We need technical people who know how to run experiments, to help implement technical AI safety work that calls for interfacing with humans. (Ought and OpenAI "AI needs social scientists" have historically posted jobs of this nature, but are no longer doing this work to my knowledge)
  • We need people who are skilled at surveys, to study how the AI landscape is progressing, with various populations of people. (FHI and AI Impacts run surveys, e.g. Grace et al. (2018), Zhang and Dafoe (2019), Zhang et al. (2021), others!)
  • We need people who will work on forecasting for long-term AI (this isn't social-scientist specific, but some social scientists could do this work).
  • We need researchers whose technical work spans both AI and social science (see the research agenda from FHI and DeepMind, Open Problems in Cooperative AI)
  • We need social science researchers to advance AI governance.
  • We need expert social scientists who are highly technically-oriented and aware of modern AI and AI trends (e.g. sociologists, historians, psychologists, economists, etc.) to discover a useful role for themselves in shaping the AI landscape positively.

I aim to provide some mentorship for social science-oriented people interested in getting involved, and navigating the above tree. (Perhaps drawing from the growing Effective Altruism behavioral scientist community, and interested EA students, e.g. Stanford Existential Risk Initiative’s “Social Sciences and Existential Risks” reading group in winter 2021?) I hope that as a community we can / continue to generate research questions that have strong ties to impact, and coordinate to involve more interested researchers in answering these questions!

Thanks to Abby Novick Hoskin, Michael Aird, Lucius Caviola, Ari Kagan, Bill Zito, Tobias Gerstenberg, and Michael Keenan for providing commentary on earlier versions of this post. The more public-facing version of this post is cross-posted on my website: Seeking social science students interested in long-term risks from AI.

Sorted by Click to highlight new comments since:

Planned summary for the Alignment Newsletter:

This post presents a list of research questions around existential risk from AI that can be tackled by social scientists. The author is looking for collaborators to expand the list and tackle some of the questions on it, and is aiming to provide some mentorship for people getting involved.

I'm the author of the cited AI safety needs social scientists article (along with Amanda Askell), previously at OpenAI and now at DeepMind.  I currently work with social scientists in several different areas (governance, ethics, psychology, ...), and would be happy to answer questions (though expect delays in replies).

Thanks so much; I'd be excited to talk! Emailed.

Update: I've been running a two-month "program" with eight of the students who reached out to me! We've come up with research questions from my original list, and the expectation is that individuals work 9h/week as volunteer RAs. I've been meeting with each person / group for 30min per week to discuss progress. We're halfway through this experiment, with a variety of projects and progress states-- hopefully you'll see at least one EA Forum post up from those students! 


I was quite surprised by the interest that this post generated; ~30 people reached out to me, and a large number were willing to do a volunteer research for no credit / pay. I ended up working with eight students, mostly based on their willingness to work with me on some of my short-listed projects. I was willing to have their projects drift significantly from my original list if the students were enthusiastic and the project felt decently aligned with risks from long-term AI, and that did occur. My goal here was to get some experience training students who had limited research experience, and I've been enjoying working with them. 

I'm not sure about how likely it is I'll continue working with students past this 2-month program, because it does take up a chunk of time (that's made worse by trying to wrangle schedules), but I'm considering what to do for the future. If anyone's interested in also mentoring students with an interest in longterm risks from AI, please let me know, since I think there's interest! It's a decently low time commitment (30m/student or group of students) once you've got everything sorted. However, I am doing it for the benefit of the students, rather than with the expectation of getting help on my work, so it's more of a volunteer role. 

I am of the belief that counterfactuals are socially constructed to an extent and so it might be useful for someone from a social science background to investigate this - at least if you think there's value in MIRI's research agenda.

The comment about counterfactuals makes me think about computational cognitive scientist Tobias Gerstenberg's research (https://cicl.stanford.edu), where his research focuses a lot on counterfactual reasoning in the physical domain, but he also has work in the social domain. 

I confess to only a surface-level understanding of MIRI's research agenda, so I'm not quite able to connect my understanding of counterfactual reasoning in the social domain to a concrete research question within MIRI's agenda. I'd be happy to hear more though if you had more detail! 

I've written a post on this topic here - https://www.lesswrong.com/posts/9rtWTHsPAf2mLKizi/counterfactuals-as-a-matter-of-social-convention.

BTW, I should be clear that my opinions on this topic aren't necessarily a mainstream position.

Curated and popular this week
Relevant opportunities