Huang, Shengyi, and Santiago Ontañón. 2022. “A Closer Look at Invalid Action Masking in Policy Gradient Algorithms”. The International FLAIRS Conference Proceedings 35 (May). https://doi.org/10.32473/flairs.v35i.130584.