Existential Cybersecurity Risks & AI (A Research Agenda)

This post is co-authored with Ben Garfinkel. It is cross-posted from the CEA blog. A PDF version can be found here. Summary: Some strategic decisions available to the effective altruism m...

If you're agentic, work in biosecurity

sharmaayushmaan🔸·5d ago·7m read

Disclaimer: Although I work on the Groups Team at CEA, I’m writing this in a personal capacity, and this post does not constitute an endorsement by CEA. Agency - the realisation that you really can just do things. TL;DR Biosecurity needs people (of any background) who are agentic and have a high execution velocity and track record....

Marginal Victories: career advising and opportunities for U.S. democracy preservation & political work

Annika Burman 🔸·3d ago·2m read

TL;DR: Marginal Victories is a new initiative to provide 1:1 career advising, opportunities, and resources for people exploring high-leverage U.S. democracy preservation and political work. Built by impact-oriented people doing pro-democracy work, the early MVP is now up at marginalvictories.org. Fill out the 10-minute form now to receive these resources as they become available over the next few...

Recent opportunities to take action

Marginal Victories: career advising and opportunities for U.S. democracy preservation & political work

Annika Burman 🔸·3d ago·2m read

I'm stepping down as Hive's Executive Director, and we're hiring my successor

SofiaBalderson, Hive·3d ago·3m read

Starting an EA group @ SUNY Binghamton

micahzarin·2d ago·1m read

^{^}

T. Shevlane, ‘Structured access: an emerging paradigm for safe AI deployment’, 2022, doi: 10.48550/ARXIV.2201.05159. Available: https://arxiv.org/abs/2201.05159. [Accessed: Sep. 18, 2023]

^{^}

J. E. Barnes, B. Wintrode, and J. Daemmrich, ‘A Babysitter and a Band-Aid Wrapper: Inside the Submarine Spy Case’, The New York Times, Oct. 11, 2021. Available: https://www.nytimes.com/2021/10/11/us/politics/inside-submarine-spy-case.html. [Accessed: Sep. 18, 2023]

^{^}

IBM Security, ‘Cost of a Data Breach 2022’, Armonk, NY, United States, Jul. 2022. Available: https://www.ibm.com/downloads/cas/3R8N1DZJ. [Accessed: Sep. 18, 2023]

^{^}

D. Arp et al., ‘Dos and Don’ts of Machine Learning in Computer Security’, 2020, doi: 10.48550/ARXIV.2010.09470. Available: https://arxiv.org/abs/2010.09470. [Accessed: Sep. 18, 2023]

^{^}

S. Karnouskos, "Artificial Intelligence in Digital Media: The Era of Deepfakes," in IEEE Transactions on Technology and Society, vol. 1, no. 3, pp. 138-147, Sept. 2020, doi: 10.1109/TTS.2020.3001312. Available: https://ieeexplore.ieee.org/document/9123958. [Accessed: Sep. 18, 2023]

^{^}

J. Altmann and F. Sauer, ‘Autonomous Weapon Systems and Strategic Stability’, Survival, vol. 59, no. 5, pp. 117–142, Sep. 2017, doi: 10.1080/00396338.2017.1375263. Available: https://www.tandfonline.com/doi/full/10.1080/00396338.2017.1375263. [Accessed: Sep. 18, 2023]

^{^}

Open AI, ‘Open AI Security Portal’. Available: https://trust.openai.com/. [Accessed: Sep. 18, 2023]

^{^}

Inflection AI, ‘Safety’. Available: https://inflection.ai/safety. [Accessed: Sep. 18, 2023]

^{^}

Although US policy constrains autonomous nuclear command and control, other nuclear powers have no such policies.

Office of U.S. Senator Edward Markey, ‘Markey, Lieu, Beyer, and Buck Introduce Bipartisan Legislation to Prevent AI From Launching a Nuclear Weapon ’. Available: https://www.markey.senate.gov/news/press-releases/markey-lieu-beyer-and-buck-introduce-bipartisan-legislation-to-prevent-ai-from-launching-a-nuclear-weapon. [Accessed Sep. 16, 2023]

^{^}

F. Bajak, ‘Insider Q&A: Artificial intelligence and cybersecurity in military tech’, AP News, May 29, 2023. Available: https://apnews.com/article/ai-cybersecurity-military-tech-weapons-systems-offensive-hacking-cyber-command-ae2a9417909388237d3667d1c61b0f99. [Accessed: Sep. 18, 2023]

^{^}

J. Carlsmith, ‘Is Power-Seeking AI an Existential Risk?’, 2022, doi: 10.48550/ARXIV.2206.13353. Available: https://arxiv.org/abs/2206.13353. [Accessed: Sep. 18, 2023]

^{^}

J. Babcock, J. Kramár, and R. Yampolskiy, ‘The AGI Containment Problem’, in Artificial General Intelligence, B. Steunebrink, P. Wang, and B. Goertzel, Eds., Cham: Springer International Publishing, 2016, pp. 53–63. doi: 10.1007/978-3-319-41649-6_6. Available: http://link.springer.com/10.1007/978-3-319-41649-6_6. [Accessed: Sep. 18, 2023]

^{^}

J. Steinhardt, P. W. W. Koh, and P. S. Liang, ‘Certified Defenses for Data Poisoning Attacks’, in Advances in Neural Information Processing Systems, I. Guyon, U. V. Luxburg, S. Bengio, H. Wallach, R. Fergus, S. Vishwanathan, and R. Garnett, Eds., Curran Associates, Inc., 2017. Available: https://proceedings.neurips.cc/paper_files/paper/2017/file/9d7311ba459f9e45ed746755a32dcd11-Paper.pdf. [Accessed: Sep. 18, 2023]

^{^}

S. Balla, ‘Fake comments flooded in when the FCC repealed net neutrality. They may count less than you think.’, Washington Post, Dec. 14, 2017. Available: https://www.washingtonpost.com/news/monkey-cage/wp/2017/12/14/there-was-a-flood-of-fake-comments-on-the-fccs-repeal-of-net-neutrality-they-may-count-less-than-you-think/. [Accessed: Sep. 19, 2023]

^{^}

W. Zaremba et al., ‘Democratic Inputs to AI’, OpenAI Blog, May 25, 2023. Available: https://openai.com/blog/democratic-inputs-to-ai. [Accessed: Sep. 18, 2023]

^{^}

D. Klepper and A. Swenson, ‘AI-generated disinformation poses threat of misleading voters in 2024 election’, PBS NewsHour, May 14, 2023. Available: https://www.pbs.org/newshour/politics/ai-generated-disinformation-poses-threat-of-misleading-voters-in-2024-election. [Accessed: Sep. 18, 2023]

^{^}

E. Starker, ‘ Large Scale Analysis of DNS Query Logs Reveals Botnets in the Cloud ’, Azure Blog, Mar. 27, 2017. Available: https://techcommunity.microsoft.com/t5/security-compliance-and-identity/large-scale-analysis-of-dns-query-logs-reveals-botnets-in-the/m-p/57064. [Accessed: Sep. 18, 2023]

^{^}

M. Abdelsalam, R. Krishnan, Y. Huang and R. Sandhu, "Malware Detection in Cloud Infrastructures Using Convolutional Neural Networks," 2018 IEEE 11th International Conference on Cloud Computing (CLOUD), San Francisco, CA, USA, 2018, pp. 162-169, doi: 10.1109/CLOUD.2018.00028. Available: https://ieeexplore.ieee.org/document/8457796. [Accessed: Sep. 18, 2023]

^{^}

D. Simchi-Levi, F. Zhu, and M. Loy, ‘Fixing the U.S. Semiconductor Supply Chain’, Harvard Business Review, Oct. 25, 2022. Available: https://hbr.org/2022/10/fixing-the-u-s-semiconductor-supply-chain. [Accessed: Sep. 18, 2023]

^{^}

C. Van Camp and W. Peeters, ‘A World without Satellite Data as a Result of a Global Cyber-Attack’, Space Policy, vol. 59, p. 101458, Feb. 2022, doi: 10.1016/j.spacepol.2021.101458. Available: https://linkinghub.elsevier.com/retrieve/pii/S0265964621000503. [Accessed: Sep. 18, 2023]

^{^}

M. D. Zabel and C. Reid, ‘A brief history of prions’, Pathogens and Disease, vol. 73, no. 9, p. ftv087, Dec. 2015, doi: 10.1093/femspd/ftv087. Available: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4626585/. [Accessed: Sep. 19, 2023]

^{^}

J. Amano, ‘SEMI Publishes First Cybersecurity Standards ’, Standards Watch, Mar. 07, 2022. Available: https://www.semi.org/en/standards-watch-2022-March/SEMI-publishes-first-cybersecurity-standards. [Accessed: Sep. 19, 2023]

^{^}

P. Rome, ‘Every Satellite Orbiting Earth and Who Owns Them’, Data Acquisition Knowledge Base, Jan. 18, 2022. Available: https://dewesoft.com/blog/every-satellite-orbiting-earth-and-who-owns-them. [Accessed: Sep. 19, 2023]

^{^}

The standards I read in depth are: DI-IPSC-82249, DI-IPSC-82250, DI-IPSC-82251, and DI-IPSC-82252.

Defense Logistics Agency, ‘ASSIST’. Sep. 18, 2023. Available: https://quicksearch.dla.mil/qsSearch.aspx. [Accessed: Sep. 19, 2023]

^{^}

Y. Shavit, ‘What does it take to catch a Chinchilla? Verifying Rules on Large-Scale Neural Network Training via Compute Monitoring’, 2023, doi: 10.48550/ARXIV.2303.11341. Available: https://arxiv.org/abs/2303.11341. [Accessed: Sep. 19, 2023]

Existential Cybersecurity Risks & AI (A Research Agenda)

Existential Cybersecurity Risks & AI (A Research Agenda)

Summary

Top 10 Risks Considered

1: Losing Control of Sensitive AI Data

2: Failures of AI Models in Weapons Control

3: Containment Failure for Advanced AI

4: Disruption of Advanced Model Development

5: Disruption of AI Governance Decisions

6: Disruption of Democratic Decisions via AI-enabled Misinformation

7: Unauthorised Use of Advanced Hardware Clusters

8: Blocking the AI Hardware Supply Chain

9: Curtailing Future Human Potential

10: Unknown Unknown Threats

Common Trends/Insights

Future Research Agenda

Existential Cybersecurity Risks & AI (A Research Agenda)

Existential Cybersecurity Risks & AI (A Research Agenda)

Summary

Top 10 Risks Considered

1: Losing Control of Sensitive AI Data

2: Failures of AI Models in Weapons Control

3: Containment Failure for Advanced AI

4: Disruption of Advanced Model Development

5: Disruption of AI Governance Decisions

6: Disruption of Democratic Decisions via AI-enabled Misinformation

7: Unauthorised Use of Advanced Hardware Clusters

8: Blocking the AI Hardware Supply Chain

9: Curtailing Future Human Potential

10: Unknown Unknown Threats

Common Trends/Insights

**Future Research Agenda**

Future Research Agenda