Genetic algorithm feature selection resilient to increasing amounts of data imputation

Autores/as

  • Maryam Kebari University of Central Florida
  • Annie S Wu

DOI:

https://doi.org/10.32473/flairs.37.1.135723

Palabras clave:

Genetic Algorithm, Feature Selection

Resumen

This paper investigates the robustness of a genetic algorithm (GA) in feature selection across a dataset with increasing imputed missing values.
Feature selection can be beneficial in predictive modeling to reduce computational costs and potentially improve performance. Beyond these benefits, it also enables a clearer understanding of the algorithm's decision-making processes. In the context of real-world datasets that can contain missing values, feature selection becomes more challenging.
A robust feature selection algorithm should be able to identify the key features despite missing data values.
We investigate the effectiveness of this approach against two other feature selection algorithms on a dataset with increasingly imputed values to determine whether it can sustain good performance with only the selected features.
Our results reveal that compared to the other two methods, the features selected by GA resulted in better classification performance across different imputation rates and methods.

Descargas

Publicado

2024-05-12

Cómo citar

Kebari, M., & Wu, A. S. (2024). Genetic algorithm feature selection resilient to increasing amounts of data imputation. The International FLAIRS Conference Proceedings, 37(1). https://doi.org/10.32473/flairs.37.1.135723