Abstract
Tree search is an important component of many decision-making algorithms but often relies on an evaluation function that estimates the desirability of each node. In this paper, we propose to learn which nodes to expand based on a variety of object-level features. We introduce a reward function for this problem based on value of computation estimates with respect to improving the policy for the underlying problem. We apply deep reinforcement learning to this problem in an approach we call Reinforcement Learning for Tree Search (RLTS) and demonstrate that it can yield better performance than baselines in a procedurally generated environment.
Original language | English |
---|---|
Publication status | Accepted/In press - 20 Aug 2023 |
Event | The Bridging the Gap Between AI Planning and Reinforcement Learning Workshop at the International Joint Conference on AI - Macau, China Duration: 20 Aug 2023 → 20 Aug 2023 https://prl-theworkshop.github.io/prl2023-ijcai/ |
Workshop
Workshop | The Bridging the Gap Between AI Planning and Reinforcement Learning Workshop at the International Joint Conference on AI |
---|---|
Abbreviated title | PRL @ IJCAI 2023 |
Country/Territory | China |
City | Macau |
Period | 20/08/2023 → 20/08/2023 |
Internet address |
Keywords
- Reinforcement Learning
- Planning Algorithms
- Tree Search
- Metareasoning