Long list of AI questions

NunoSempere; technicalities; David Mathers🔸; Misha_Yagudin

Comments 16

Sorted by

New & upvoted

Good job on putting this together

If I could make one suggestion, I think the questions about the "how" a catastrophe would occur (ie nanotech, viruses, etc), deserve it's own section, rather than being lumped in under "miscellaneous". This is a key part of the argument for AI being an x-risk, and imo one of the most underdeveloped parts.

Nick K.

I agree that this would be interesting to explore, but heavily disagree that having a detailed answer to that influences the prediction of X risk substantially.

David Mathers🔸

Why do you disagree?

David Mathers🔸

Fair point. I personally agree that has tended to be underdeveloped.

david_reinstein

★ By 2025/2030/2035 will there be a "best AI safety practices playbook" which all leading labs in the US claim to follow?

unjournal.org Just released our evaluation package for Schuett et al's "Towards best practices in AGI safety and governance".

Please see here.

NunoSempere

11mo

Nice

Peter Slattery 🔸

Thanks for this. If easy, can you please curate your suggested questions in a spreadsheet so that I can filter them by priority and type? If you do this, I will share with at least two academics and labs who might do some of the research desired. I may do so anyway, but at the moment, it probably won't be something that they will find time to read unless I can refer them to the parts that are most immediately relevant.

Peter Slattery 🔸

Here is what I eventually extracted and will share, just in case it's useful.

**★★★ (RP DG) By what year will at least 15% of patents granted in the US be for designs generated primarily via AI? Reasons for inclusion: both an early sign that AI might be able to design dangerous technology and an indicator that AIs will be economically useful to deploy across diverse industries. Question resolves according to the best estimate by the [resolution council].

**★★★ (UF RP) How long will be the gap between the first creation of an AI which could automate 65% of current labour and the availability of an equivalently capable model as a free open-source program?

**★★★ (RP) Meta-capabilities question: by 2029 will there be a better way to assess the capabilities of models than testing their performance on question-and-answer benchmarks?

**★★★ (RP UF) How much money will the Chinese government cumulatively spend on training AI models between 2024 and 2040 as estimated by the [resolution council]?

**★★★ (UF, FE, RP) Consider the first AI model able to individually perform any cognitive labour that a human can. Then, how likely is the chance of an deliberately engineered pandemic which kills >20% of the world's population in the 50 years after the first such model is built?

**★★★ (UF, FE, RP) How does the probability of the previous question change if models are widely available to citizens and private businesses, compared to if only government and specified trusted private organizations are allowed to use them?

**★★★ (FE, RP) What is the total number of EAs in technical AI alignment Across academia, industry, independent research organizations, ¿government?, etc. See The academic contribution to AI safety seems large for an estimate from 2020.

**★★★ (FE, RP) What is the total number of non-EAs in technical AI alignment? Across academia, industry, independent research organizations, ¿government?, etc.

**★★★ (RP) How likely is it that an AI could get nanomachines built just by making ordinary commercial purchases online, and obtaining the cooperation of <30 human beings without scientific skills above masters degrees in relevant subjects?

**★★★ (UF, RP) Take-off speed: after automating 15% of labour, how long will it take until 60% of labour is automated? Question note: 99%+ of labour has been already been automated, since most humans don't work on agriculture any more. This question asks about automating 15% and 60% of labour of the type done in 2023; see "recurring terms".

**★★★ (FE, RP) How long does it take TSMC to manufacture 100k GPUs? Relevance: Not that high, but a neat Fermi estimate warm up. Might just generally be good for having good models of the world, though.

**★★★ (UF, RP) What is the % chance that by 2025/2030/35/40 an AI will persuade a human to commit a crime in order to further the AI's purposes? If one wanted to make this question resolvable: Question resolves according to the [resolution council]'s probability that this has happened. This would require a platform that accepts probabilistic resolutions. See also below "When will the US' SEC accuse someone of committing securities fraud substantially aided by AI systems?"

**★★★ (RP, FE) What fraction of labour will be automated between 2023 and 2028/2035/2040/2050/2100? Question operationalization: See "recurring terms" section For a reference on an adjacent, see Phil Trammell's Economic growth under transformative AI.

NunoSempere

I have extracted top questions to here: https://github.com/NunoSempere/clarivoyance/blob/master/list/top-questions.md with the Linux command at the top of the page. Hope this is helpful enough.

Peter Slattery 🔸

Thank you.

Ozzie Gooen

Just looking at this recently, for thoughts I've been having here.

Some super quick thoughts:
1. Overall, really like this sort of work. Want to see more like it.
2. I like that you've helped prioritize these questions.
3. I find the questions right now very difficult to parse. There are so many of them, they cover many different topics, it's hard for me to feel like I grasp them or keep many in my head at once.
4. I think that scorable function definitions would help here.

I'm currently working on planning some potential scorable function definitions around this issue, might make a post on that later.

Ozzie Gooen

Some other things that come to mind:
1. I'm nervous about the "when will X labor be automated", as a lot of jobs just become more demanding.
2. Similarly, not sure how valuable patent rates are.
3. I really want better indicators of "how quickly is AI progress happening?" My guess now is that the most reliable ones are things like, "How much effective computation is happening over time?"
4. "Long-term autonomous LLM operation when?" -> I'm nervous about this, as I expect that most long-running processes will have some percent human oversight/intervention, so it's hard to define.

A lot of these concepts are slippery, and I'd generally expect a lot of human+AI hybrid systems for a long time, making it less clear how to measure the AI part specifically.

NunoSempere

Thanks Ozzie

Vasco Grilo🔸

Nice work!

In this adjacent document, we also outline a "resolution council"

The link points to this post.

NunoSempere

Thanks, fixed

Toby Tremlett🔹

I spotted three instances of "this document" not being linked to the relevant document. Let me know if this was a bug :)

Comments

More from the author

Thresholds for funding existential risk interventions

NunoSempere·4d ago·20m read

438

My highly personal skepticism braindump on existential risk from artificial intelligence.

NunoSempere·3y ago·17m read

308

A Critical Review of Open Philanthropy’s Bet On Criminal Justice Reform

NunoSempere·4y ago·31m read

Curated and popular this week

What would an animal-aligned AI be aligned to?

Aidan Kankyoku, Anima International·2w ago·Curated 6d ago·15m read

This is a crosspost from the new Animal Welfare Alignment Newsletter by Anima International. You can subscribe on Substack if you are interested in following these efforts. Audio reading also available on Substack. The goals of this post are to: 1. Raise a question I see as crucially important to the goal of aligning AI to animal welfare...

137

Let's taboo the V-word

lincolnq·3d ago·8m read

“How long have you been v*g*n?” This is one of the most common icebreakers at animal protection events. It’s a baseline assumption, and it mostly holds true: if you’re out advocating for animals not to be tortured or abused, realistically these days you are v**n, or close. And it makes for good conversation. It seems fairly safe to assume when you meet strangers. But this assumption is hurting the movement in a way which we don’t always notice: someone new comes into the sp...

Spiro: an update 2.5 years on and a fundraising ask for expansion

Habiba Banu·11h ago·6m read

Summary Back in November 2023 I posted here to launch Spiro and raise our first $198k. Two and a half years later this is an update and a fundraiser for the next step. The short version: we've now reached over-5,900 people with TB preventive medicine, including over 3,000 children under five years old. Our early results have held up well an...

Recent opportunities to take action

Peter Slattery 🔸

**★★★ (RP) Meta-capabilities question: by 2029 will there be a better way to assess the capabilities of models than testing their performance on question-and-answer benchmarks?

**★★★ (RP UF) How much money will the Chinese government cumulatively spend on training AI models between 2024 and 2040 as estimated by the [resolution council]?

**★★★ (FE, RP) What is the total number of non-EAs in technical AI alignment? Across academia, industry, independent research organizations, ¿government?, etc.

Long list of AI questions

Table of Contents

Recommendations

Questions

Recurring terms

Key

Questions relevant to speed of capabilities progress

Questions relevant to safety and alignment

Interpretability

Eliciting Latent Knowledge

Iterated Distillation and Amplification:

Debate (see section 2. here)

General safety

General safety agenda templates

Regulation and Corporate Governance^[16]

Who will be at the forefront of AI research?

Questions about militarization.

Questions about how agent-y and general future AIs will be, and how that affects X-risk from AI

Risks of various kinds from EAs and other people concerned about AI X-risk getting things wrong

General Warning Signs

Chance and Effects of Deliberately Slowing AI Progress

Questions about public and researcher opinion

Security Questions

EA opinion on relevant issues:

AI effects on (non-AI takeover) catastrophic and X-risks in international relations

Miscellaneous

Acknowledgments

Long list of AI questions

Table of Contents

Recommendations

Questions

Recurring terms

Key

Questions relevant to speed of capabilities progress

Questions relevant to safety and alignment

Interpretability

Eliciting Latent Knowledge

Iterated Distillation and Amplification:

Debate (see section 2. here)

General safety

General safety agenda templates

Regulation and Corporate Governance[16]

Who will be at the forefront of AI research?

Questions about militarization.

Questions about how agent-y and general future AIs will be, and how that affects X-risk from AI

Risks of various kinds from EAs and other people concerned about AI X-risk getting things wrong

General Warning Signs

Chance and Effects of Deliberately Slowing AI Progress

Questions about public and researcher opinion

Security Questions

EA opinion on relevant issues:

AI effects on (non-AI takeover) catastrophic and X-risks in international relations

Miscellaneous

Acknowledgments

Regulation and Corporate Governance^[16]