[1]
S. Huang and S. Ontañón, “A Closer Look at Invalid Action Masking in Policy Gradient Algorithms”, FLAIRS, vol. 35, May 2022.