HUANG, Shengyi; ONTAÑÓN, Santiago. A Closer Look at Invalid Action Masking in Policy Gradient Algorithms. The International FLAIRS Conference Proceedings, [S. l.], v. 35, 2022. DOI: 10.32473/flairs.v35i.130584. Disponível em: https://journals.flvc.org/FLAIRS/article/view/130584. Acesso em: 21 nov. 2024.