Huang, Shengyi, and Santiago Ontañón. “A Closer Look at Invalid Action Masking in Policy Gradient Algorithms”. The International FLAIRS Conference Proceedings 35 (May 4, 2022). Accessed April 23, 2024. https://journals.flvc.org/FLAIRS/article/view/130584.