Pruning Game Tree by Rollouts
- Bojun Huang
AAAI 2015 |
Published by AAAI - Association for the Advancement of Artificial Intelligence
In this paper we show that the alpha-beta algorithm and its successor MT-SSS*, as two classic minimax search algorithms, can be implemented as rollout algorithms, a generic algorithmic paradigm widely used in many domains. Specifically, we define a family of rollout algorithms, in which the rollout policy is restricted to select successor nodes only from a certain subset of the children list. We show that any rollout policy in this family (either deterministic or randomized) guarantees to evaluate the game tree correctly with finite number of rollouts. Moreover, we identify simple rollout policies in this family that “implement” alpha-beta and MT-SSS*. Specifically, given any game tree, the rollout algorithms with these particular policies always visit the same set of leaf nodes in the same order with alpha-beta and MT-SSS*, respectively. Our results suggest that the traditional pruning technique and the recent Monte Carlo Tree Search algorithms, as two competing approaches for game tree evaluation, may be unified under the rollout paradigm. A shorter version of the paper is published in AAAI 2015