International AI Safety Dialogues: Benefits, Risks, and Best Practices
Events that bring together international stakeholders to discuss AI safety are a promising way to reduce AI risks. This report recommends ways to make these events a success.
Managing AI Risks in an Era of Rapid Progress
This paper discusses risks from future AI systems and proposes priorities for AI R&D and governance. Its many authors include an IAPS researcher, Turing Prize winners, and a Nobel Memorial Prize winner.
Adapting Cybersecurity Frameworks to Manage Frontier AI Risks: a Defense-in-Depth Approach
The complex and evolving threat landscape of frontier AI development requires a multi-layered approach to risk management (“defense-in-depth”). By reviewing cybersecurity and AI frameworks, we outline three approaches that can help identify gaps in the management of AI-related risks.
How Expertise in AI hardware Can Help with AI Governance
This article was written for the organization 80,000 Hours by an IAPS researcher. It discusses why and how it may be valuable to build expertise in AI hardware and use that expertise to reduce risks and improve governance decisions.
AI Chip Smuggling into China: Potential Paths, Quantities, and Countermeasures
This report examines the prospect of large-scale smuggling of AI chips into China and proposes six interventions for mitigating that.
Open-Sourcing Highly Capable Foundation Models
This paper, led by the Centre for the Governance of AI, evaluates the risks and benefits of open-sourcing, as well as alternative methods for pursuing open-source objectives.
Deployment Corrections: An Incident Response Framework for Frontier AI Models
This report describes a toolkit that frontier AI developers can use to respond to risks discovered after deployment of a model. We also provide a framework for AI developers to prepare and implement this toolkit.