Integrated Machine Learning Approaches for Landslide Susceptibility Mapping Along the Pakistan–China Karakoram Highway

The effectiveness of data-driven landslide susceptibility mapping relies on data integrity and advanced geospatial analysis; however, selecting the most suitable method and identifying key regional factors remains a challenging task. To address this, this study assessed the performance of six machin...

Full description

Saved in:
Bibliographic Details
Main Authors: Mohib Ullah, Haijun Qiu, Wenchao Huangfu, Dongdong Yang, Yingdong Wei, Bingzhe Tang
Format: Article
Language:English
Published: MDPI AG 2025-01-01
Series:Land
Subjects:
Online Access:https://www.mdpi.com/2073-445X/14/1/172
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832588128478560256
author Mohib Ullah
Haijun Qiu
Wenchao Huangfu
Dongdong Yang
Yingdong Wei
Bingzhe Tang
author_facet Mohib Ullah
Haijun Qiu
Wenchao Huangfu
Dongdong Yang
Yingdong Wei
Bingzhe Tang
author_sort Mohib Ullah
collection DOAJ
description The effectiveness of data-driven landslide susceptibility mapping relies on data integrity and advanced geospatial analysis; however, selecting the most suitable method and identifying key regional factors remains a challenging task. To address this, this study assessed the performance of six machine learning models, including Convolutional Neural Networks (CNNs), Random Forest (RF), Categorical Boosting (CatBoost), their CNN-based hybrid models (CNN+RF and CNN+CatBoost), and a Stacking Ensemble (SE) combining CNN, RF, and CatBoost in mapping landslide susceptibility along the Karakoram Highway in northern Pakistan. Twelve geospatial factors were examined, categorized into Topography/Geomorphology, Land Cover/Vegetation, Geology, Hydrology, and Anthropogenic Influence. A detailed landslide inventory of 272 occurrences was compiled to train the models. The proposed stacking ensemble and hybrid models improve landslide susceptibility modeling, with the stacking ensemble achieving an AUC of 0.91. Hybrid modeling enhances accuracy, with CNN–RF boosting RF’s AUC from 0.85 to 0.89 and CNN–CatBoost increasing CatBoost’s AUC from 0.87 to 0.90. Chi-square (χ<sup>2</sup>) values (9.8–21.2) and <i>p</i>-values (<0.005) confirm statistical significance across models. This study identifies approximately 20.70% of the area as from high to very high risk, with the SE model excelling in detecting high-risk zones. Key factors influencing landslide susceptibility showed slight variations across the models, while multicollinearity among variables remained minimal. The proposed modeling approach reduces uncertainties, enhances prediction accuracy, and supports decision-makers in implementing effective landslide mitigation strategies.
format Article
id doaj-art-54c5590592bc49b4b1dc9c57a49b598b
institution Kabale University
issn 2073-445X
language English
publishDate 2025-01-01
publisher MDPI AG
record_format Article
series Land
spelling doaj-art-54c5590592bc49b4b1dc9c57a49b598b2025-01-24T13:38:11ZengMDPI AGLand2073-445X2025-01-0114117210.3390/land14010172Integrated Machine Learning Approaches for Landslide Susceptibility Mapping Along the Pakistan–China Karakoram HighwayMohib Ullah0Haijun Qiu1Wenchao Huangfu2Dongdong Yang3Yingdong Wei4Bingzhe Tang5Shaanxi Key Laboratory of Earth Surface and Environmental Carrying Capacity, College of Urban and Environmental Sciences, Northwest University, Xi’an 710127, ChinaShaanxi Key Laboratory of Earth Surface and Environmental Carrying Capacity, College of Urban and Environmental Sciences, Northwest University, Xi’an 710127, ChinaShaanxi Key Laboratory of Earth Surface and Environmental Carrying Capacity, College of Urban and Environmental Sciences, Northwest University, Xi’an 710127, ChinaShaanxi Key Laboratory of Earth Surface and Environmental Carrying Capacity, College of Urban and Environmental Sciences, Northwest University, Xi’an 710127, ChinaShaanxi Key Laboratory of Earth Surface and Environmental Carrying Capacity, College of Urban and Environmental Sciences, Northwest University, Xi’an 710127, ChinaShaanxi Key Laboratory of Earth Surface and Environmental Carrying Capacity, College of Urban and Environmental Sciences, Northwest University, Xi’an 710127, ChinaThe effectiveness of data-driven landslide susceptibility mapping relies on data integrity and advanced geospatial analysis; however, selecting the most suitable method and identifying key regional factors remains a challenging task. To address this, this study assessed the performance of six machine learning models, including Convolutional Neural Networks (CNNs), Random Forest (RF), Categorical Boosting (CatBoost), their CNN-based hybrid models (CNN+RF and CNN+CatBoost), and a Stacking Ensemble (SE) combining CNN, RF, and CatBoost in mapping landslide susceptibility along the Karakoram Highway in northern Pakistan. Twelve geospatial factors were examined, categorized into Topography/Geomorphology, Land Cover/Vegetation, Geology, Hydrology, and Anthropogenic Influence. A detailed landslide inventory of 272 occurrences was compiled to train the models. The proposed stacking ensemble and hybrid models improve landslide susceptibility modeling, with the stacking ensemble achieving an AUC of 0.91. Hybrid modeling enhances accuracy, with CNN–RF boosting RF’s AUC from 0.85 to 0.89 and CNN–CatBoost increasing CatBoost’s AUC from 0.87 to 0.90. Chi-square (χ<sup>2</sup>) values (9.8–21.2) and <i>p</i>-values (<0.005) confirm statistical significance across models. This study identifies approximately 20.70% of the area as from high to very high risk, with the SE model excelling in detecting high-risk zones. Key factors influencing landslide susceptibility showed slight variations across the models, while multicollinearity among variables remained minimal. The proposed modeling approach reduces uncertainties, enhances prediction accuracy, and supports decision-makers in implementing effective landslide mitigation strategies.https://www.mdpi.com/2073-445X/14/1/172landslide susceptibilityKarakoram Highwaymachine learning modelsgeo-environmental factorsensemble models
spellingShingle Mohib Ullah
Haijun Qiu
Wenchao Huangfu
Dongdong Yang
Yingdong Wei
Bingzhe Tang
Integrated Machine Learning Approaches for Landslide Susceptibility Mapping Along the Pakistan–China Karakoram Highway
Land
landslide susceptibility
Karakoram Highway
machine learning models
geo-environmental factors
ensemble models
title Integrated Machine Learning Approaches for Landslide Susceptibility Mapping Along the Pakistan–China Karakoram Highway
title_full Integrated Machine Learning Approaches for Landslide Susceptibility Mapping Along the Pakistan–China Karakoram Highway
title_fullStr Integrated Machine Learning Approaches for Landslide Susceptibility Mapping Along the Pakistan–China Karakoram Highway
title_full_unstemmed Integrated Machine Learning Approaches for Landslide Susceptibility Mapping Along the Pakistan–China Karakoram Highway
title_short Integrated Machine Learning Approaches for Landslide Susceptibility Mapping Along the Pakistan–China Karakoram Highway
title_sort integrated machine learning approaches for landslide susceptibility mapping along the pakistan china karakoram highway
topic landslide susceptibility
Karakoram Highway
machine learning models
geo-environmental factors
ensemble models
url https://www.mdpi.com/2073-445X/14/1/172
work_keys_str_mv AT mohibullah integratedmachinelearningapproachesforlandslidesusceptibilitymappingalongthepakistanchinakarakoramhighway
AT haijunqiu integratedmachinelearningapproachesforlandslidesusceptibilitymappingalongthepakistanchinakarakoramhighway
AT wenchaohuangfu integratedmachinelearningapproachesforlandslidesusceptibilitymappingalongthepakistanchinakarakoramhighway
AT dongdongyang integratedmachinelearningapproachesforlandslidesusceptibilitymappingalongthepakistanchinakarakoramhighway
AT yingdongwei integratedmachinelearningapproachesforlandslidesusceptibilitymappingalongthepakistanchinakarakoramhighway
AT bingzhetang integratedmachinelearningapproachesforlandslidesusceptibilitymappingalongthepakistanchinakarakoramhighway