I'm prepping a new upper-level undergraduate/graduate seminar on 'AI and Psychology', which I'm aiming to start teaching in Jan 2025. I'd appreciate any suggestions that people might have for readings and videos that address the overlap of current AI research (both capabilities and safety) and psychology (e.g. cognitive science, moral psychology, public opinion). The course will have a heavy emphasis on the psychology, politics, and policy issues around AI safety, and will focus more on AGI and ASI than on narrow AI systems. Content that focuses on the challenges of aligning AI systems with diverse human values, goals, ideologies, and cultures would be especially valuable. Ideal readings/videos would be short, clear, relatively non-technical, recent, and aligned with an EA perspective. Thanks in advance!
This course sounds cool! Unfortunately there doesn't seem to be too much relevant material out there.
This is a stretch, but I think there's probably some cool computational modeling to be done with human value datasets (e.g., 70,000 responses to variations on the trolley problem). What kinds of universal human values can we uncover? https://www.pnas.org/doi/10.1073/pnas.1911517117
For digestible content on technical AI safety, Robert Miles makes good videos. https://www.youtube.com/c/robertmilesai
Abby - good suggestions, thank you. I think I will assign some Robert Miles videos! And I'll think about the human value datasets.