Scalable GNN Training for Track Finding

Authors

  • Bipana Bista Youngstown State University
  • Sree Sai Charan Vaitla Youngstown State University
  • Alina Lazar Youngstown State University https://orcid.org/0000-0002-2096-1541

DOI:

https://doi.org/10.32473/flairs.39.1.141823

Keywords:

Graph Neural Networks (GNNs), Distributed Data Parallelism (DDP), Particle Track Reconstruction, High-Energy Physics (HEP)

Abstract

Graph Neural Networks (GNNs) are widely used for particle track finding in High-Energy Physics but are computationally expensive to train on large graph datasets. We study Distributed Data Parallelism (DDP) for accelerating GNN training across multiple GPUs and analyze its impact on runtime and convergence. We evaluate both strong and weak scaling behavior and show that while DDP substantially reduces training time, speedup saturates at larger GPU counts due to communication overhead. In addition, increasing the number of GPUs degrades validation efficiency due to growth in effective batch size. We demonstrate that learning-rate scaling partially mitigates this degradation. Results on the TrackML dataset highlight a trade-off between throughput and model quality that must be addressed for scalable GNN training.

Downloads

Published

06-05-2026

How to Cite

Bista, B., Vaitla, S. S. C., & Lazar, A. (2026). Scalable GNN Training for Track Finding. The International FLAIRS Conference Proceedings, 39(1). https://doi.org/10.32473/flairs.39.1.141823