Backward Monte Carlo Tree Search: Charting Unsafe Regions in the Belief-Space

Anil Yildiz; Esen Yel; Marcell Vazquez-Chanlatte; Kyle Wray; Mykel J. Kochenderfer; Stefan J. Witwicki

doi:10.1613/jair.1.18011

PDF

Published: Jan 27, 2026

DOI: https://doi.org/10.1613/jair.1.18011

Keywords:

autonomous agents, decision making under uncertainty, markov decisionprocesses, probabilistic reasoning

Anil Yildiz

a:1:{s:5:"en_US";s:39:"Stanford Intelligent Systems Laboratory";}

https://orcid.org/0000-0001-8194-4895

Esen Yel

https://orcid.org/0000-0002-0463-3601

Marcell Vazquez-Chanlatte

https://orcid.org/0000-0002-1248-0000

Kyle Wray

https://orcid.org/0000-0001-6986-9941

Mykel J. Kochenderfer

https://orcid.org/0000-0002-7238-9663

Stefan J. Witwicki

https://orcid.org/0009-0005-3224-9198

Abstract

Safety-critical systems often operate in partially observable environments, where assessing the safety of the underlying policy remains a fundamental challenge. This study focuses on evaluating policies by identifying regions of the belief-space that can lead the system’s policy to an undesirable state with a non-negligible probability. In this paper, we introduce Backward Monte Carlo Tree Search, the first Monte Carlo tree search framework that expands backward in time within the belief-space. The tree search begins from an undesired terminal belief and recursively explores its possible predecessors, constructing a tree of belief transitions that could lead to an unsafe outcome within a given horizon. Evaluations in gridworld and autonomous driving domains show that identifying beliefs from which failures may occur enables runtime risk forecasting and targeted policy retraining, marking a conceptual shift in how safety is validated under uncertainty.

Issue

Vol. 85 (2026)

Section

Articles

Article Sidebar

Main Article Content

Abstract

Article Details