Genetic algorithm feature selection resilient to increasing amounts of data imputation
DOI:
https://doi.org/10.32473/flairs.37.1.135723Palabras clave:
Genetic Algorithm, Feature SelectionResumen
This paper investigates the robustness of a genetic algorithm (GA) in feature selection across a dataset with increasing imputed missing values.
Feature selection can be beneficial in predictive modeling to reduce computational costs and potentially improve performance. Beyond these benefits, it also enables a clearer understanding of the algorithm's decision-making processes. In the context of real-world datasets that can contain missing values, feature selection becomes more challenging.
A robust feature selection algorithm should be able to identify the key features despite missing data values.
We investigate the effectiveness of this approach against two other feature selection algorithms on a dataset with increasingly imputed values to determine whether it can sustain good performance with only the selected features.
Our results reveal that compared to the other two methods, the features selected by GA resulted in better classification performance across different imputation rates and methods.
Descargas
Publicado
Cómo citar
Número
Sección
Licencia
Derechos de autor 2024 Maryam Kebari, Annie S Wu
Esta obra está bajo una licencia internacional Creative Commons Atribución-NoComercial 4.0.