Insights from Optimized Non-Landslide Sampling and SHAP Explainability for Landslide Susceptibility Prediction

The quality of sampling data critically influences landslide susceptibility prediction accuracy. Current studies commonly use a 1:1 ratio of landslide to non-landslide samples, failing to reflect natural geographical variability. This study develops a region-specific framework by integrating SHAP (S...

Full description

Saved in:
Bibliographic Details
Main Authors: Mengyuan Li, Hongling Tian
Format: Article
Language:English
Published: MDPI AG 2025-01-01
Series:Applied Sciences
Subjects:
Online Access:https://www.mdpi.com/2076-3417/15/3/1163
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:The quality of sampling data critically influences landslide susceptibility prediction accuracy. Current studies commonly use a 1:1 ratio of landslide to non-landslide samples, failing to reflect natural geographical variability. This study develops a region-specific framework by integrating SHAP (SHapley Additive exPlanation) analysis with twelve landslide conditioning factors (LCFs) and three progressive sampling strategies, aiming to create adaptive non-landslide point selection criteria tailored to unique environmental and geological characteristics. The strategies include (1) multi-ratio random sampling (1:1 to 1:200), (2) susceptibility-based sampling adjustments derived from pre-susceptibility analysis, and (3) LCF-based correction using the NDVI threshold identified through SHAP analysis. Results show that LCF-based correction achieved the highest performance, while a 1:5 ratio proved optimal in random sampling, aligning with regional characteristics. This framework demonstrates the importance of region-specific sampling strategies in improving landslide susceptibility prediction.
ISSN:2076-3417