Asking AI How to Respond to a Foreign Policy Decision

Analysis by Yasir Atalan, Benjamin Jensen, and Ian Reynolds

Published April 17, 2025

AI models have built-in biases that factor into their decisionmaking. These biases should be mitigated before integrating them into foreign policy processes.

CSIS Futures Lab research indicates that some widely used models (e.g., Llama 8B, Gemini 1.5, and Qwen2) choose escalatory responses in the benchmark study, compared to models like Claude, GPT, Llama 70B, and Mistral, that chose a decrease in conflict intensity. These discrepancies likely stem from differences in training data and fine-tuning practices.
All eight large language models (LLMs) recommend more escalatory responses for the United States, United Kingdom, and France, while offering fewer recommendations for escalation to China and Russia.
To safeguard decisionmaking, governments and agencies must invest in comprehensive evaluation frameworks and institute routine audits of AI models. Adopting tools like Futures Lab’s CFPD-Benchmark can help identify and correct these biases before deployment—ensuring that AI supports strategic objectives while minimizing unintended risks.

Related Content

CSIS Charts is produced by the Center for Strategic and International Studies (CSIS), a private, tax-exempt institution focusing on international public policy issues. Its research is nonpartisan and nonproprietary. CSIS does not take specific policy positions. Accordingly, all views, positions, and conclusions expressed in this publication should be understood to be solely those of the author(s).

Yasir Atalan

Deputy Director and Data Fellow, Futures Lab, Defense and Security Department

Benjamin Jensen

Director, Futures Lab and Senior Fellow, Defense and Security Department

Ian Reynolds

Postdoctoral Futures Fellow (Non-resident), Futures Lab, Defense and Security Department

Border Security and Defense Cooperation in North America: Addressing Emerging Challenges

Strengthening Vaccine Production and Access to Routine Immunizations in Sub-Saharan Africa |The CommonHealth Live!

NORAD Modernization and Continental Defense

U.S-Canada Cooperation to Counter Transnational Organized Crime

Asking AI How to Respond to a Foreign Policy Decision

AI models have built-in biases that factor into their decisionmaking. These biases should be mitigated before integrating them into foreign policy processes.

Related Content

Yasir Atalan

Benjamin Jensen

Ian Reynolds

Media Inquiries

Border Security and Defense Cooperation in North America: Addressing Emerging Challenges

Strengthening Vaccine Production and Access to Routine Immunizations in Sub-Saharan Africa |The CommonHealth Live!

NORAD Modernization and Continental Defense

U.S-Canada Cooperation to Counter Transnational Organized Crime

Asking AI How to Respond to a Foreign Policy Decision

AI models have built-in biases that factor into their decisionmaking. These biases should be mitigated before integrating them into foreign policy processes.

Related Content

Tags

Yasir Atalan

Benjamin Jensen

Ian Reynolds

Programs & Projects