A Hierarchical Goal-Biased Curriculum for Training Reinforcement Learning

Sunandita Patra; Mark Cavolowsky; Onur Kulaksizoglu; Ruoxi Li; Laura Hiatt; Mark Roberts; Dana Nau

doi:10.32473/flairs.v35i.130720

A Hierarchical Goal-Biased Curriculum for Training Reinforcement Learning

Authors

Sunandita Patra University of Maryland, College Park
Mark Cavolowsky University of Maryland, College Park
Onur Kulaksizoglu University of Maryland, College Park
Ruoxi Li University of Maryland, College Park
Laura Hiatt Naval Research Laboratory
Mark Roberts
Dana Nau

DOI:

https://doi.org/10.32473/flairs.v35i.130720

Keywords:

planning and learning, goal biased curriculum, reinforcement learning, curriculum learning, hierarchical planning

Abstract

Hierarchy and curricula are two techniques commonly used to improve training for Reinforcement Learning (RL) agents. Yet few works have examined how to leverage hierarchical planning to generate a curriculum for training RL Options. We formalize a goal skill that extends an RL Option with state-based conditions that must hold during training and execution. We then define a Goal-Skill Network that integrates a Hierarchical Goal Network, a variant of hierarchical planning, with goal skills as the leaves of the network. An automatically generated plan for a Goal-Skill Network correctly orders goal skills such that (1) it is a Goal-Biased Curriculum for training the goal skills, and (2) it can be executed to achieve top-level goals. In a set of six distinct gridworld environments using up to ten goal skills, we demonstrate that these contributions train nearly perfect policies significantly faster than learning a whole policy from scratch.

Downloads

Published

04-05-2022

How to Cite

Patra, S., Cavolowsky, M., Kulaksizoglu, O., Li, R., Hiatt, L., Roberts, M., & Nau, D. (2022). A Hierarchical Goal-Biased Curriculum for Training Reinforcement Learning. The International FLAIRS Conference Proceedings, 35. https://doi.org/10.32473/flairs.v35i.130720

Download Citation

Issue

Vol. 35 (2022): Proceedings of FLAIRS-35

Section

Special Track: Autonomous Robots and Agents

License

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.

A Hierarchical Goal-Biased Curriculum for Training Reinforcement Learning

Authors

DOI:

Keywords:

Abstract

Downloads

Published

How to Cite

Issue

Section

License

Developed By

Make a Submission

Language