A hybrid machine learning framework by incorporating categorical boosting and manifold learning for financial analysis

The financial analysis is essential to evaluate and assess the financial behavior and risk during the financial activities. However, it is challenging to implement the financial analysis due to the complexity of financial features and their interaction mechanism. This study developed a hybrid machin...

Full description

Saved in:
Bibliographic Details
Main Authors: Yuyang Zhao, Hongbo Zhao
Format: Article
Language:English
Published: Elsevier 2025-03-01
Series:Intelligent Systems with Applications
Subjects:
Online Access:http://www.sciencedirect.com/science/article/pii/S2667305324001479
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1850230722307031040
author Yuyang Zhao
Hongbo Zhao
author_facet Yuyang Zhao
Hongbo Zhao
author_sort Yuyang Zhao
collection DOAJ
description The financial analysis is essential to evaluate and assess the financial behavior and risk during the financial activities. However, it is challenging to implement the financial analysis due to the complexity of financial features and their interaction mechanism. This study developed a hybrid machine-learning framework incorporating categorical boosting (CatBoost) and manifold learning for financial analysis. CatBoost was employed to capture the financial mechanism and characterize the complex and nonlinear relationship between the financial feature and the associated financial behavior. Manifold learning was utilized to select and extract the critical financial features. The developed framework was verified and illustrated by the synthetic datasets, which are based on the financial model for the loan evaluation. The overall accuracy of the CatBoost model increased from 81.5 % to 99.1 %, and the accuracy for predicting unapproved loans increased from 64 % to 98.88 %. The developed framework significantly improves the prediction accuracy of loan-approved status and characterizes the financial behavior and mechanism well. The developed hybrid framework distinguishes between various financial features and the associated loan-approved status. Based on the developed framework, it also found that credit score and annual income are the two essential features, and the contribution of other features is almost negligible. The developed framework revealed that a credit score of 500 and an annual income of 70,000 are critical thresholds for loan approval, as set by the financial analysis model used to generate the dataset. The results show that the developed framework could extract the financial features and capture the financial mechanism during the financial analysis. It provides a scientific, reasonable, and promising approach to financial analysis and understanding financial behavior.
format Article
id doaj-art-99fd513497624e59ac5ec4601c3dfc16
institution OA Journals
issn 2667-3053
language English
publishDate 2025-03-01
publisher Elsevier
record_format Article
series Intelligent Systems with Applications
spelling doaj-art-99fd513497624e59ac5ec4601c3dfc162025-08-20T02:03:46ZengElsevierIntelligent Systems with Applications2667-30532025-03-012520047310.1016/j.iswa.2024.200473A hybrid machine learning framework by incorporating categorical boosting and manifold learning for financial analysisYuyang Zhao0Hongbo Zhao1Prologis Management LLC, Denver, 80202, USA; Corresponding author.School of Civil Engineering and Geomatics, Shandong University of Technology, Zibo, 255000, ChinaThe financial analysis is essential to evaluate and assess the financial behavior and risk during the financial activities. However, it is challenging to implement the financial analysis due to the complexity of financial features and their interaction mechanism. This study developed a hybrid machine-learning framework incorporating categorical boosting (CatBoost) and manifold learning for financial analysis. CatBoost was employed to capture the financial mechanism and characterize the complex and nonlinear relationship between the financial feature and the associated financial behavior. Manifold learning was utilized to select and extract the critical financial features. The developed framework was verified and illustrated by the synthetic datasets, which are based on the financial model for the loan evaluation. The overall accuracy of the CatBoost model increased from 81.5 % to 99.1 %, and the accuracy for predicting unapproved loans increased from 64 % to 98.88 %. The developed framework significantly improves the prediction accuracy of loan-approved status and characterizes the financial behavior and mechanism well. The developed hybrid framework distinguishes between various financial features and the associated loan-approved status. Based on the developed framework, it also found that credit score and annual income are the two essential features, and the contribution of other features is almost negligible. The developed framework revealed that a credit score of 500 and an annual income of 70,000 are critical thresholds for loan approval, as set by the financial analysis model used to generate the dataset. The results show that the developed framework could extract the financial features and capture the financial mechanism during the financial analysis. It provides a scientific, reasonable, and promising approach to financial analysis and understanding financial behavior.http://www.sciencedirect.com/science/article/pii/S2667305324001479Financial analysisFeature selectionManifold learningMachine learningCategorical boosting
spellingShingle Yuyang Zhao
Hongbo Zhao
A hybrid machine learning framework by incorporating categorical boosting and manifold learning for financial analysis
Intelligent Systems with Applications
Financial analysis
Feature selection
Manifold learning
Machine learning
Categorical boosting
title A hybrid machine learning framework by incorporating categorical boosting and manifold learning for financial analysis
title_full A hybrid machine learning framework by incorporating categorical boosting and manifold learning for financial analysis
title_fullStr A hybrid machine learning framework by incorporating categorical boosting and manifold learning for financial analysis
title_full_unstemmed A hybrid machine learning framework by incorporating categorical boosting and manifold learning for financial analysis
title_short A hybrid machine learning framework by incorporating categorical boosting and manifold learning for financial analysis
title_sort hybrid machine learning framework by incorporating categorical boosting and manifold learning for financial analysis
topic Financial analysis
Feature selection
Manifold learning
Machine learning
Categorical boosting
url http://www.sciencedirect.com/science/article/pii/S2667305324001479
work_keys_str_mv AT yuyangzhao ahybridmachinelearningframeworkbyincorporatingcategoricalboostingandmanifoldlearningforfinancialanalysis
AT hongbozhao ahybridmachinelearningframeworkbyincorporatingcategoricalboostingandmanifoldlearningforfinancialanalysis
AT yuyangzhao hybridmachinelearningframeworkbyincorporatingcategoricalboostingandmanifoldlearningforfinancialanalysis
AT hongbozhao hybridmachinelearningframeworkbyincorporatingcategoricalboostingandmanifoldlearningforfinancialanalysis