Vadhera, Raghav, and Manfred Huber. “Learning Policies for Neural Network Architecture Optimization Using Reinforcement Learning”. The International FLAIRS Conference Proceedings 36, no. 1 (May 8, 2023). Accessed May 17, 2026. https://journals.flvc.org/FLAIRS/article/view/133380.