Multiclass Classification of Solar Flares in Imbalanced Data Using Ensemble Learning and Sampling Methods

Authors

  • Haodi Jiang Sam Houston State University https://orcid.org/0000-0001-6460-408X
  • Ryoma Matsuura University of California, Los Angeles
  • Jason T. L. Wang New Jersey Institute of Technology

DOI:

https://doi.org/10.32473/flairs.37.1.135365

Abstract

Solar flares are intense bursts of radiation across the electromagnetic spectrum on the surface of the Sun. They are categorized into four classes: B, C, M, and X, depending on their intensity, with X-class flares being the strongest. Being able to predict a flare’s class before its occurrence is critical for anticipating the severity of its impact on Earth. We used the Space-weather HMI Active Region Patches (SHARP) parameters available from Stanford’s Joint Science Operations Center (JSOC) to train machine learning models to classify these flares. However, predicting the flare class is a challenging task, as it is a multiclass classification problem
involving imbalanced data due to the small number of X-class flares in a solar cycle. We propose a new method that uses a combination of random undersampling and the synthetic minority oversampling technique (SMOTE) to combat the imbalanced data problem. Furthermore, we develop an ensemble algorithm that uses nine classifiers as base learners and logistic regression as meta-learner. Experimental results show that the proposed method is effective in predicting solar flares, especially the most intense X-class flares, within the next 24 hours.

Downloads

Published

13-05-2024

How to Cite

Jiang, H., Matsuura, R., & Wang, J. T. L. (2024). Multiclass Classification of Solar Flares in Imbalanced Data Using Ensemble Learning and Sampling Methods. The International FLAIRS Conference Proceedings, 37(1). https://doi.org/10.32473/flairs.37.1.135365