Understanding Sample Generation Strategies for Learning Heuristic Functions in Classical Planning | Journal of Artificial Intelligence Research

PDF

Published: Jun 2, 2024

DOI: https://doi.org/10.1613/jair.1.15742

Keywords:

heuristic search, planning

Rafael V. Bettker

Pedro P. Minini

André G. Pereira

a:1:{s:5:"en_US";s:39:"Federal University of Rio Grande do Sul";}

Marcus Ritt

Abstract

We study the problem of learning good heuristic functions for classical planning tasks with neural networks based on samples represented by states with their cost-to-goal estimates. The heuristic function is learned for a state space and goal condition with the number of samples limited to a fraction of the size of the state space, and must generalize well for all states of the state space with the same goal condition. Our main goal is to better understand the influence of sample generation strategies on the performance of a greedy best-first heuristic search (GBFS) guided by a learned heuristic function. In a set of controlled experiments, we find that two main factors determine the quality of the learned heuristic: the algorithm used to generate the sample set and how close the sample estimates to the perfect cost-to-goal are. These two factors are dependent: having perfect cost-to-goal estimates is insufficient if the samples are not well distributed across the state space. We also study other effects, such as adding samples with high-value estimates. Based on our findings, we propose practical strategies to improve the quality of learned heuristics: three strategies that aim to generate more representative states and two strategies that improve the cost-to-goal estimates. Our practical strategies result in a learned heuristic that, when guiding a GBFS algorithm, increases by more than 30% the mean coverage compared to a baseline learned heuristic.

Issue

Vol. 80 (2024)

Section

Articles

Article Sidebar

Main Article Content

Abstract

Article Details