I'm a college student who is taking a graduate-level statistics class, with a focus on data science. In this class students have to complete three assignments using a dataset they can freely choose. Assignments consist on studying the dataset using statistics. For example, in the first one we have to pose a question to answer using the dataset, study the variables related to the question, and analyse the results obtained using inferential statistics (hypothesis tests, confidence intervals, etc).
Given the assignment gives us the freedom to choose the dataset and the questions to work on, I thought it could be an opportunity to study a dataset related to our priority causes. Some ideas I have are datasets related to the spread of a mortal disease, the impact of public policy on the regulation of a dangerous new technology, the effectiveness of advanced AI models related to factors like computing power, the scale of armed conflicts related to political or economical data, the age expectancy related to factors like healthy habits or access to healthcare, the effectiveness of different interventions on any particular important issue, etc.
Any ideas? I would be glad to have any input on this.