Model Optimization Analysis of Customer Churn Prediction Using Machine Learning Algorithms with Focus on Feature Reductions

Currently, Customers are struggling to retain their business in today’s competitive markets. Thus, the issue of customer churn becomes a significant challenge for the industries. In order to achieve this, it is vital to have an efficient churn prediction system. In this paper, we discuss methods for...

Full description

Saved in:
Bibliographic Details
Main Authors: Seyed Mohammad Sina Mirabdolbaghi, Babak Amiri
Format: Article
Language:English
Published: Wiley 2022-01-01
Series:Discrete Dynamics in Nature and Society
Online Access:http://dx.doi.org/10.1155/2022/5134356
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832552904224931840
author Seyed Mohammad Sina Mirabdolbaghi
Babak Amiri
author_facet Seyed Mohammad Sina Mirabdolbaghi
Babak Amiri
author_sort Seyed Mohammad Sina Mirabdolbaghi
collection DOAJ
description Currently, Customers are struggling to retain their business in today’s competitive markets. Thus, the issue of customer churn becomes a significant challenge for the industries. In order to achieve this, it is vital to have an efficient churn prediction system. In this paper, we discuss methods for reducing features using PCA, Autoencoders, LDA, T-SNE, and Xgboost. In this paper, a model for predicting light GBM churn is proposed. The model consists of five steps. The first step is to preprocess the data so that missing and corrupt values can be handled and the data can be scaled. Secondly, implementing a comprehensive feature reduction system based on popular algorithms reduces the features and selects the most suitable one. In the third step, light GBM’s hyperparameter is tuned using Bayesian hyperparameter optimization and genetic optimization algorithms. Lastly, interpreting the model and evaluating the impact of the features on model outputs by using the SHAP method, and finally ranking the churners by customer lifetime value. Aside from evaluating and choosing the best feature reduction methods, the proposed method is also evaluated using four famous datasets. It outperforms other ensemble and ML algorithms like AdaBoost, SVM, and decision tree on over seven evaluation metrics: accuracy, area under the curve (AUC), Kappa, Mathews correlation coefficient (MCC), Brier score, F1 score, and EMPC. In light of the evaluation metrics, our model shows a significant improvement in handling imbalanced datasets in churn prediction. Finally, in this paper, interpretability and how the features affect the model’s output are presented by the SHAP method. Then CLV ranking is suggested for better decision-making.
format Article
id doaj-art-38d5eafbf8a94389b3777289ffe22257
institution Kabale University
issn 1607-887X
language English
publishDate 2022-01-01
publisher Wiley
record_format Article
series Discrete Dynamics in Nature and Society
spelling doaj-art-38d5eafbf8a94389b3777289ffe222572025-02-03T05:57:28ZengWileyDiscrete Dynamics in Nature and Society1607-887X2022-01-01202210.1155/2022/5134356Model Optimization Analysis of Customer Churn Prediction Using Machine Learning Algorithms with Focus on Feature ReductionsSeyed Mohammad Sina Mirabdolbaghi0Babak Amiri1Department of Industrial EngineeringIndustrial Engineering SchoolCurrently, Customers are struggling to retain their business in today’s competitive markets. Thus, the issue of customer churn becomes a significant challenge for the industries. In order to achieve this, it is vital to have an efficient churn prediction system. In this paper, we discuss methods for reducing features using PCA, Autoencoders, LDA, T-SNE, and Xgboost. In this paper, a model for predicting light GBM churn is proposed. The model consists of five steps. The first step is to preprocess the data so that missing and corrupt values can be handled and the data can be scaled. Secondly, implementing a comprehensive feature reduction system based on popular algorithms reduces the features and selects the most suitable one. In the third step, light GBM’s hyperparameter is tuned using Bayesian hyperparameter optimization and genetic optimization algorithms. Lastly, interpreting the model and evaluating the impact of the features on model outputs by using the SHAP method, and finally ranking the churners by customer lifetime value. Aside from evaluating and choosing the best feature reduction methods, the proposed method is also evaluated using four famous datasets. It outperforms other ensemble and ML algorithms like AdaBoost, SVM, and decision tree on over seven evaluation metrics: accuracy, area under the curve (AUC), Kappa, Mathews correlation coefficient (MCC), Brier score, F1 score, and EMPC. In light of the evaluation metrics, our model shows a significant improvement in handling imbalanced datasets in churn prediction. Finally, in this paper, interpretability and how the features affect the model’s output are presented by the SHAP method. Then CLV ranking is suggested for better decision-making.http://dx.doi.org/10.1155/2022/5134356
spellingShingle Seyed Mohammad Sina Mirabdolbaghi
Babak Amiri
Model Optimization Analysis of Customer Churn Prediction Using Machine Learning Algorithms with Focus on Feature Reductions
Discrete Dynamics in Nature and Society
title Model Optimization Analysis of Customer Churn Prediction Using Machine Learning Algorithms with Focus on Feature Reductions
title_full Model Optimization Analysis of Customer Churn Prediction Using Machine Learning Algorithms with Focus on Feature Reductions
title_fullStr Model Optimization Analysis of Customer Churn Prediction Using Machine Learning Algorithms with Focus on Feature Reductions
title_full_unstemmed Model Optimization Analysis of Customer Churn Prediction Using Machine Learning Algorithms with Focus on Feature Reductions
title_short Model Optimization Analysis of Customer Churn Prediction Using Machine Learning Algorithms with Focus on Feature Reductions
title_sort model optimization analysis of customer churn prediction using machine learning algorithms with focus on feature reductions
url http://dx.doi.org/10.1155/2022/5134356
work_keys_str_mv AT seyedmohammadsinamirabdolbaghi modeloptimizationanalysisofcustomerchurnpredictionusingmachinelearningalgorithmswithfocusonfeaturereductions
AT babakamiri modeloptimizationanalysisofcustomerchurnpredictionusingmachinelearningalgorithmswithfocusonfeaturereductions