Oscar Delaney 9/24/25 Oscar Delaney 9/24/25

Policy Options for Preserving Chain of Thought Monitorability

The most advanced AI models produce detailed reasoning steps in human language—known as "chain of thought" (CoT)—that provide crucial oversight capabilities for ensuring these systems behave as intended. However, competitive pressures may drive developers toward more efficient but non-monitorable architectures that lack a human-readable CoT. This report presents a framework for determining when coordination mechanisms are needed to preserve CoT monitorability.

Oliver Guest 2/19/25 Oliver Guest 2/19/25

AI Companies’ Safety Research Leaves Important Gaps. Governments and Philanthropists Should Fill Them.

This is a linkpost for an article written by IAPS researchers Oscar Delaney and Oliver Guest.

Oliver Guest 2/11/25 Oliver Guest 2/11/25

The Future of the AI Summit Series

This is a link post for a paper which was led by researchers from the Oxford Martin AI Governance Initiative, and on which IAPS researcher Oliver Guest was one of the authors.

Oliver Guest 12/17/24 Oliver Guest 12/17/24

Bridging the Artificial Intelligence Governance Gap: The United States' and China's Divergent Approaches to Governing General-Purpose Artificial Intelligence

A look at U.S. and Chinese policy landscapes reveals differences in how the two countries approach the governance of general-purpose artificial intelligence. Three areas of divergence are notable for policymakers: the focus of domestic AI regulation, key principles of domestic AI regulation, and approaches to implementing international AI governance.

Sumaya Nur Adan 11/9/24 Sumaya Nur Adan 11/9/24

Key questions for the International Network of AI Safety Institutes

In this commentary, we explore key questions for the International Network of AI Safety Institutes and suggest ways forward given the upcoming San Francisco convening on November 20-21, 2024. What should the network work on? How should it be structured in terms of membership and central coordination? How should it fit into the international governance landscape?

Oliver Guest 10/30/24 Oliver Guest 10/30/24

Chinese AI Safety Institute Counterparts

Based on a systematic review of open sources, we identify Chinese “AISI counterparts,” i.e. Chinese institutions doing similar work to the US and UK AISIs and that have relatively close government links.

Oliver Guest 8/30/24 Oliver Guest 8/30/24

The Future of International Scientific Assessments of AI’s Risks

This piece is a link post for a paper which was led by Hadrien Pouget (Carnegie Endowment for International Peace) and Claire Dennis (Centre for the Governance of AI). IAPS staff Renan Araujo and Oliver Guest were among the paper’s co-authors.

Oliver Guest 5/2/24 Oliver Guest 5/2/24

Topics for Track IIs: What Can Be Discussed in Dialogues About Advanced AI Risks Without Leaking Sensitive Information?

This issue brief suggests agenda items for dialogues about advanced AI risks that minimize risk of leaking sensitive information.

Oliver Guest 12/14/23 Oliver Guest 12/14/23

Safeguarding the Safeguards: How Best to Promote Alignment in the Public Interest

With this paper, we aim to help actors who support alignment efforts to make these efforts as effective as possible, and to avoid potential adverse effects.

Oliver Guest 10/31/23 Oliver Guest 10/31/23

International AI Safety Dialogues: Benefits, Risks, and Best Practices

Events that bring together international stakeholders to discuss AI safety are a promising way to reduce AI risks. This report recommends ways to make these events a success.