Towards Machine Learning Interpretability for Tabular Data with Mixed Data Types

Prativa Pokhrel; Alina Lazar

doi:10.32473/flairs.v35i.130611

Towards Machine Learning Interpretability for Tabular Data with Mixed Data Types

Authors

Prativa Pokhrel Youngstown State University
Dr. Alina Lazar Youngstown State University

DOI:

https://doi.org/10.32473/flairs.v35i.130611

Abstract

Gradient Boosting (GB) algorithms have been proposed for a variety of automated predictions and classification tasks with applications in many domains. These methods work faster and provide superior performance compared to deep learning methods when applied to tabular datasets. Another advantage is their interpretability. There are many machine learning methods that can train tabular data successfully, however, the inner workings are usually hidden from the user. In this context, SHAP values combined with GB methods, increase model transparency and provide not only consistent feature rankings but also show the contributions of the predictors for individual instances. In this work, we train multiple GB models using several tabular datasets and compare the result in terms of speed, performance, and the global and local models' interpretability.

Downloads

Published

04-05-2022

How to Cite

Pokhrel, P., & Lazar, A. (2022). Towards Machine Learning Interpretability for Tabular Data with Mixed Data Types. The International FLAIRS Conference Proceedings, 35. https://doi.org/10.32473/flairs.v35i.130611