A Hierarchical Goal-Biased Curriculum for Training Reinforcement Learning

Sunandita Patra; Mark Cavolowsky; Onur Kulaksizoglu; Ruoxi Li; Laura Hiatt; Mark Roberts; Dana Nau

doi:10.32473/flairs.v35i.130720

A Hierarchical Goal-Biased Curriculum for Training Reinforcement Learning

Autores

Sunandita Patra University of Maryland, College Park
Mark Cavolowsky University of Maryland, College Park
Onur Kulaksizoglu University of Maryland, College Park
Ruoxi Li University of Maryland, College Park
Laura Hiatt Naval Research Laboratory
Mark Roberts
Dana Nau

DOI:

https://doi.org/10.32473/flairs.v35i.130720

Palavras-chave:

planning and learning, goal biased curriculum, reinforcement learning, curriculum learning, hierarchical planning

Resumo

Hierarchy and curricula are two techniques commonly used to improve training for Reinforcement Learning (RL) agents. Yet few works have examined how to leverage hierarchical planning to generate a curriculum for training RL Options. We formalize a goal skill that extends an RL Option with state-based conditions that must hold during training and execution. We then define a Goal-Skill Network that integrates a Hierarchical Goal Network, a variant of hierarchical planning, with goal skills as the leaves of the network. An automatically generated plan for a Goal-Skill Network correctly orders goal skills such that (1) it is a Goal-Biased Curriculum for training the goal skills, and (2) it can be executed to achieve top-level goals. In a set of six distinct gridworld environments using up to ten goal skills, we demonstrate that these contributions train nearly perfect policies significantly faster than learning a whole policy from scratch.

Downloads

PDF (English)

Publicado

2022-05-04

Como Citar

Patra, S., Cavolowsky, M., Kulaksizoglu, O., Li, R., Hiatt, L., Roberts, M., & Nau, D. (2022). A Hierarchical Goal-Biased Curriculum for Training Reinforcement Learning. The International FLAIRS Conference Proceedings, 35. https://doi.org/10.32473/flairs.v35i.130720

Baixar Citação

Edição

v. 35 (2022): Proceedings of FLAIRS-35

Seção

Special Track: Autonomous Robots and Agents

Licença

Este trabalho está licenciado sob uma licença Creative Commons Attribution-NonCommercial 4.0 International License.

A Hierarchical Goal-Biased Curriculum for Training Reinforcement Learning

Autores

DOI:

Palavras-chave:

Resumo

Downloads

Publicado

Como Citar

Edição

Seção

Licença

Desenvolvido por

Enviar Submissão

Idioma