Jack -- good question.
IMHO as a psych professor (somewhat biased in favor of psychology!), the most relevant behavioral sciences fields for working on 'AI alignment with human values' would be the key branches of psychology that have actually studied human values, such as moral psychology, political psychology, psychology of religion, social psychology, and evolutionary psychology.
Then there are the behavioral sciences fields that study the diversity of human values across individuals (e.g. personality psychology, clinical psychology, intelligence research, behavior genetics), across cultures (e.g. cross-cultural psychology, anthropology, political science, sociology), and across history (e.g. intellectual, political, religious, social, sexual, and family history).
Also, I think fields such as behavioral game theory, evolutionary game theory, microeconomics, and decision theory are very useful for AI alignment work.
There's a bit of neuroscience that studies human values, preferences, and decision-making (e.g. affective neuroscience, cognitive neuroscience) that might be relevant to AI research. But, in my opinion, neuroscience hasn't discovered much that's relevant to AI research that wasn't already discovered by behavioral sciences. Neuroscience has mostly identified where certain kinds of processing happens in the brain, without really adding much to our understanding of what kinds of processing are happening, and why, and what the implications are for AI alignment. (Epistemic status of this claim: somewhat weak; I studied neuroscience fairly deeply decades ago in grad school, and have kept up to some degree with recent brain imaging work, but I'm not a neuroscience expert.)
The big advantage of neuroscience is that it has high status, cachet, and fundability, and sounds like 'hard science'. So, people who don't understand the behavioral sciences, and who might consider a field like moral psychology to be 'soft science' or 'pseudo-science', might take a neuroscience degree more seriously. Honestly though, if you study neuroscience at PhD level, I would bet that the stuff that proves most useful to AI alignment will be the psychology theories, methods, and findings rather than the neuroscience research.
So, you'd have to decide whether the status benefits of a neuroscience PhD (vs. a PhD in moral psychology, or in behavioral game theory, for example) out-weigh the time costs of having to learn an awful lot about the details of brain anatomy and physiology, brain imaging methods, and voxel-based imaging analysis -- most of which simply won't be very relevant to AI alignment.
I think it's a bad idea for most people to do Neuroscience PhDs. PhDs in general are not optimised for truth seeking, working on high impact projects, or maximising your personal wellbeing. In fact, rates of anxiety and depression are higher amongst graduate students than the population of people with college degrees of similar age. You also get paid extremely badly, which is a problem for people with families or other financial commitments. For any specific question you want to ask, it seems worth investigating if you can do the same work in industry or at a non-profit, just to see if you would be able to study the same questions in a more focused way outside of academia.
So I don’t think doing a Neuro PhD is the most effective route to working on AI Safety. That said, there seem to be some useful research directions if you want to pursue a Neuro PhD program anyway. Some examples include: interpretability work that can be translated from natural to artificial neural networks; specifically studying neural learning algorithms; or doing completely computational research, aka a backdoor CS PhD while fitting your models to neural data collected by other people. (CS PhD programs are insanely competitive right now, and Neuroscience professors are desperate for lab members who know how to code, so this is one way into a computational academic program at a top university if you’re ok working on Neuroscience relevant research questions.)
Vael Gates (who did a Computational/Cognitive Neuroscience PhD with Tom Griffiths, one of the leaders of this field), has some further thoughts that they’ve written up in this EA Forum post. I completely agree with their assessment of neuroscience research from the perspective of AI Safety research here:
Going slightly off tangent: your original question specifically mentions moral uncertainty. I share Geoffrey Miller’s views in his comment on this thread, that Psychology is a more useful discipline to study moral uncertainty compared to Neuroscience.
On the flip side, I think psychologists have done very interesting/useful research on human values (see this paper on how normal people think about population ethics, also eloquently written up as a shorter/more readable EA Forum post here). In this vein, I’ve also been very impressed by work produced by psychologists working with empirical philosophers, for example this paper on the Psychology of Existential Risk.
If you want to focus on moral uncertainty, you can collect way more information from a much more diverse set of individuals if you focus on behaviour instead of neural activity. As Geoffrey mentions, it is *much* easier/cheaper to study people’s opinions or behaviour than it is to study their neural activity. For example, it costs ~$5 to pay somebody to take a quick survey on moral decisions, vs. about $500 an hour to run an fMRI scanner for one subject to collect a super messy dataset that’s incredibly different to interpret. People do take research more seriously if you slap a photo of a brain on it, but that doesn’t mean the brain data adds anything more than aesthetic value.
It might make sense for you to check out what EA Psychologists are actually doing to see if their research seems more up your alley compared to the neuroscience questions you’re interested in. A good place to start is here: https://www.eapsychology.org/
Abby -- excellent advice. This is consistent with what I've seen in neuroscience, psychology, and PhD programs in general.
Thanks! I agreed/appreciated your thoughts on how Psych can actually be relevant to human value alignment as well, especially compared to Neuro!