C-SHAP: A Hybrid Method for Fast and Efficient Interpretability
Model interpretability is essential in machine learning, particularly for applications in critical fields like healthcare, where understanding model decisions is paramount. While SHAP (SHapley Additive exPlanations) has proven to be a robust tool for explaining machine learning predictions, its high...
Saved in:
Main Authors: | , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2025-01-01
|
Series: | Applied Sciences |
Subjects: | |
Online Access: | https://www.mdpi.com/2076-3417/15/2/672 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
_version_ | 1832589262237728768 |
---|---|
author | Golshid Ranjbaran Diego Reforgiato Recupero Chanchal K. Roy Kevin A. Schneider |
author_facet | Golshid Ranjbaran Diego Reforgiato Recupero Chanchal K. Roy Kevin A. Schneider |
author_sort | Golshid Ranjbaran |
collection | DOAJ |
description | Model interpretability is essential in machine learning, particularly for applications in critical fields like healthcare, where understanding model decisions is paramount. While SHAP (SHapley Additive exPlanations) has proven to be a robust tool for explaining machine learning predictions, its high computational cost limits its practicality for real-time use. To address this, we introduce C-SHAP (Clustering-Boosted SHAP), a hybrid method that combines SHAP with K-means clustering to reduce execution times significantly while preserving interpretability. C-SHAP excels across various datasets and machine learning methods, matching SHAP’s accuracy in selected features while maintaining an accuracy of 0.73 for Random Forest with substantially faster performance. Notably, in the Diabetes dataset collected by the National Institute of Diabetes and Digestive and Kidney Diseases, C-SHAP reduces the execution time from nearly 2000 s to just 0.21 s, underscoring its potential for scalable, efficient interpretability in time-sensitive applications. Such advancements in interpretability and efficiency may hold value for enhancing decision-making within software-intensive systems, aligning with evolving engineering approaches. |
format | Article |
id | doaj-art-63ebcfbe8ac449ce91563537f0f1378e |
institution | Kabale University |
issn | 2076-3417 |
language | English |
publishDate | 2025-01-01 |
publisher | MDPI AG |
record_format | Article |
series | Applied Sciences |
spelling | doaj-art-63ebcfbe8ac449ce91563537f0f1378e2025-01-24T13:20:23ZengMDPI AGApplied Sciences2076-34172025-01-0115267210.3390/app15020672C-SHAP: A Hybrid Method for Fast and Efficient InterpretabilityGolshid Ranjbaran0Diego Reforgiato Recupero1Chanchal K. Roy2Kevin A. Schneider3Department of Computer Science, University of Saskatchewan, Saskatoon, SK S7N 5A2, CanadaDepartment of Mathematics and Computer Science, University of Cagliari, 09124 Cagliari, ItalyDepartment of Computer Science, University of Saskatchewan, Saskatoon, SK S7N 5A2, CanadaDepartment of Computer Science, University of Saskatchewan, Saskatoon, SK S7N 5A2, CanadaModel interpretability is essential in machine learning, particularly for applications in critical fields like healthcare, where understanding model decisions is paramount. While SHAP (SHapley Additive exPlanations) has proven to be a robust tool for explaining machine learning predictions, its high computational cost limits its practicality for real-time use. To address this, we introduce C-SHAP (Clustering-Boosted SHAP), a hybrid method that combines SHAP with K-means clustering to reduce execution times significantly while preserving interpretability. C-SHAP excels across various datasets and machine learning methods, matching SHAP’s accuracy in selected features while maintaining an accuracy of 0.73 for Random Forest with substantially faster performance. Notably, in the Diabetes dataset collected by the National Institute of Diabetes and Digestive and Kidney Diseases, C-SHAP reduces the execution time from nearly 2000 s to just 0.21 s, underscoring its potential for scalable, efficient interpretability in time-sensitive applications. Such advancements in interpretability and efficiency may hold value for enhancing decision-making within software-intensive systems, aligning with evolving engineering approaches.https://www.mdpi.com/2076-3417/15/2/672interpretability in machine learninginterpretabilitySHAPLIMEC-SHAPcomputational efficiency |
spellingShingle | Golshid Ranjbaran Diego Reforgiato Recupero Chanchal K. Roy Kevin A. Schneider C-SHAP: A Hybrid Method for Fast and Efficient Interpretability Applied Sciences interpretability in machine learning interpretability SHAP LIME C-SHAP computational efficiency |
title | C-SHAP: A Hybrid Method for Fast and Efficient Interpretability |
title_full | C-SHAP: A Hybrid Method for Fast and Efficient Interpretability |
title_fullStr | C-SHAP: A Hybrid Method for Fast and Efficient Interpretability |
title_full_unstemmed | C-SHAP: A Hybrid Method for Fast and Efficient Interpretability |
title_short | C-SHAP: A Hybrid Method for Fast and Efficient Interpretability |
title_sort | c shap a hybrid method for fast and efficient interpretability |
topic | interpretability in machine learning interpretability SHAP LIME C-SHAP computational efficiency |
url | https://www.mdpi.com/2076-3417/15/2/672 |
work_keys_str_mv | AT golshidranjbaran cshapahybridmethodforfastandefficientinterpretability AT diegoreforgiatorecupero cshapahybridmethodforfastandefficientinterpretability AT chanchalkroy cshapahybridmethodforfastandefficientinterpretability AT kevinaschneider cshapahybridmethodforfastandefficientinterpretability |