Oliver Guest 10/31/23 Oliver Guest 10/31/23

International AI Safety Dialogues: Benefits, Risks, and Best Practices

Events that bring together international stakeholders to discuss AI safety are a promising way to reduce AI risks. This report recommends ways to make these events a success.

Ashwin Acharya 10/27/23 Ashwin Acharya 10/27/23

Managing AI Risks in an Era of Rapid Progress

This paper discusses risks from future AI systems and proposes priorities for AI R&D and governance. Its many authors include an IAPS researcher, Turing Prize winners, and a Nobel Memorial Prize winner.

Shaun Ee 10/13/23 Shaun Ee 10/13/23

Adapting Cybersecurity Frameworks to Manage Frontier AI Risks: a Defense-in-Depth Approach

The complex and evolving threat landscape of frontier AI development requires a multi-layered approach to risk management (“defense-in-depth”). By reviewing cybersecurity and AI frameworks, we outline three approaches that can help identify gaps in the management of AI-related risks.

Erich Grunewald 10/5/23 Erich Grunewald 10/5/23

How Expertise in AI hardware Can Help with AI Governance

This article was written for the organization 80,000 Hours by an IAPS researcher. It discusses why and how it may be valuable to build expertise in AI hardware and use that expertise to reduce risks and improve governance decisions.

Erich Grunewald 10/4/23 Erich Grunewald 10/4/23

AI Chip Smuggling into China: Potential Paths, Quantities, and Countermeasures

This report examines the prospect of large-scale smuggling of AI chips into China and proposes six interventions for mitigating that.

Institute for AI Policy and Strategy 10/2/23 Institute for AI Policy and Strategy 10/2/23

Open-Sourcing Highly Capable Foundation Models

This paper, led by the Centre for the Governance of AI, evaluates the risks and benefits of open-sourcing, as well as alternative methods for pursuing open-source objectives.

Joe O'Brien 9/30/23 Joe O'Brien 9/30/23

Deployment Corrections: An Incident Response Framework for Frontier AI Models

This report describes a toolkit that frontier AI developers can use to respond to risks discovered after deployment of a model. We also provide a framework for AI developers to prepare and implement this toolkit.