Huang, Shengyi, and Santiago Ontañón. “A Closer Look at Invalid Action Masking in Policy Gradient Algorithms”. The International FLAIRS Conference Proceedings 35 (May 4, 2022). Accessed February 15, 2025. https://journals.flvc.org/FLAIRS/article/view/130584.