Random Cross-Validation Produces Biased Assessment of Machine Learning Performance in Regional Landslide Susceptibility Prediction

Machine learning (ML) models are extensively used in spatial predictive modeling, including landslide susceptibility prediction. The performance statistics of these models are vital for assessing their reliability, which is typically obtained using the random cross-validation (R-CV) method. However,...

Full description

Saved in:

Bibliographic Details
Main Authors:	Chandan Kumar, Gabriel Walton, Paul Santi, Carlos Luza
Format:	Article
Language:	English
Published:	MDPI AG 2025-01-01
Series:	Remote Sensing
Subjects:	landslide susceptibility mapping machine learning random cross-validation spatial autocorrelation spatial cross-validation
Online Access:	https://www.mdpi.com/2072-4292/17/2/213
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1832587594753376256
author	Chandan Kumar Gabriel Walton Paul Santi Carlos Luza
author_facet	Chandan Kumar Gabriel Walton Paul Santi Carlos Luza
author_sort	Chandan Kumar
collection	DOAJ
description	Machine learning (ML) models are extensively used in spatial predictive modeling, including landslide susceptibility prediction. The performance statistics of these models are vital for assessing their reliability, which is typically obtained using the random cross-validation (R-CV) method. However, R-CV has a major drawback, i.e., it ignores the spatial autocorrelation (SAC) inherent in spatial datasets when partitioning the training and testing sets. We assessed the impact of SAC at three crucial phases of ML modeling: hyperparameter tuning, performance evaluation, and learning curve analysis. As an alternative to R-CV, we used spatial cross-validation (S-CV). This method considers SAC when partitioning the training and testing subsets. This experiment was conducted on regional landslide susceptibility prediction using different ML models: logistic regression (LR), k-nearest neighbor (KNN), linear discriminant analysis (LDA), artificial neural networks (ANN), support vector machine (SVM), random forest (RF), and C5.0. The experimental results showed that R-CV often produces optimistic performance estimates, e.g., 6–18% higher than those obtained using the S-CV. R-CV also occasionally fails to reveal the true importance of the hyperparameters of models such as SVM and ANN. Additionally, R-CV falsely portrays a considerable improvement in model performance as the number of variables increases. However, this was not the case when the models were evaluated using S-CV. The impact of SAC was more noticeable in complex models such as SVM, RF, and C5.0 (except for ANN) than in simple models such as LDA and LR (except for KNN). Overall, we recommend S-CV over R-CV for a reliable assessment of ML model performance in large-scale LSM.
format	Article
id	doaj-art-bc4bfcef8be04bc6981e192574e91ef2
institution	Kabale University
issn	2072-4292
language	English
publishDate	2025-01-01
publisher	MDPI AG
record_format	Article
series	Remote Sensing
spelling	doaj-art-bc4bfcef8be04bc6981e192574e91ef22025-01-24T13:47:45ZengMDPI AGRemote Sensing2072-42922025-01-0117221310.3390/rs17020213Random Cross-Validation Produces Biased Assessment of Machine Learning Performance in Regional Landslide Susceptibility PredictionChandan Kumar0Gabriel Walton1Paul Santi2Carlos Luza3Department of Geology and Geological Engineering, Colorado School of Mines, Golden, CO 80401, USADepartment of Geology and Geological Engineering, Colorado School of Mines, Golden, CO 80401, USADepartment of Geology and Geological Engineering, Colorado School of Mines, Golden, CO 80401, USADepartment of Geology, Geophysics and Mines, Universidad Nacional de San Agustín, Arequipa 04000, PeruMachine learning (ML) models are extensively used in spatial predictive modeling, including landslide susceptibility prediction. The performance statistics of these models are vital for assessing their reliability, which is typically obtained using the random cross-validation (R-CV) method. However, R-CV has a major drawback, i.e., it ignores the spatial autocorrelation (SAC) inherent in spatial datasets when partitioning the training and testing sets. We assessed the impact of SAC at three crucial phases of ML modeling: hyperparameter tuning, performance evaluation, and learning curve analysis. As an alternative to R-CV, we used spatial cross-validation (S-CV). This method considers SAC when partitioning the training and testing subsets. This experiment was conducted on regional landslide susceptibility prediction using different ML models: logistic regression (LR), k-nearest neighbor (KNN), linear discriminant analysis (LDA), artificial neural networks (ANN), support vector machine (SVM), random forest (RF), and C5.0. The experimental results showed that R-CV often produces optimistic performance estimates, e.g., 6–18% higher than those obtained using the S-CV. R-CV also occasionally fails to reveal the true importance of the hyperparameters of models such as SVM and ANN. Additionally, R-CV falsely portrays a considerable improvement in model performance as the number of variables increases. However, this was not the case when the models were evaluated using S-CV. The impact of SAC was more noticeable in complex models such as SVM, RF, and C5.0 (except for ANN) than in simple models such as LDA and LR (except for KNN). Overall, we recommend S-CV over R-CV for a reliable assessment of ML model performance in large-scale LSM.https://www.mdpi.com/2072-4292/17/2/213landslide susceptibility mappingmachine learningrandom cross-validationspatial autocorrelationspatial cross-validation
spellingShingle	Chandan Kumar Gabriel Walton Paul Santi Carlos Luza Random Cross-Validation Produces Biased Assessment of Machine Learning Performance in Regional Landslide Susceptibility Prediction Remote Sensing landslide susceptibility mapping machine learning random cross-validation spatial autocorrelation spatial cross-validation
title	Random Cross-Validation Produces Biased Assessment of Machine Learning Performance in Regional Landslide Susceptibility Prediction
title_full	Random Cross-Validation Produces Biased Assessment of Machine Learning Performance in Regional Landslide Susceptibility Prediction
title_fullStr	Random Cross-Validation Produces Biased Assessment of Machine Learning Performance in Regional Landslide Susceptibility Prediction
title_full_unstemmed	Random Cross-Validation Produces Biased Assessment of Machine Learning Performance in Regional Landslide Susceptibility Prediction
title_short	Random Cross-Validation Produces Biased Assessment of Machine Learning Performance in Regional Landslide Susceptibility Prediction
title_sort	random cross validation produces biased assessment of machine learning performance in regional landslide susceptibility prediction
topic	landslide susceptibility mapping machine learning random cross-validation spatial autocorrelation spatial cross-validation
url	https://www.mdpi.com/2072-4292/17/2/213
work_keys_str_mv	AT chandankumar randomcrossvalidationproducesbiasedassessmentofmachinelearningperformanceinregionallandslidesusceptibilityprediction AT gabrielwalton randomcrossvalidationproducesbiasedassessmentofmachinelearningperformanceinregionallandslidesusceptibilityprediction AT paulsanti randomcrossvalidationproducesbiasedassessmentofmachinelearningperformanceinregionallandslidesusceptibilityprediction AT carlosluza randomcrossvalidationproducesbiasedassessmentofmachinelearningperformanceinregionallandslidesusceptibilityprediction

Random Cross-Validation Produces Biased Assessment of Machine Learning Performance in Regional Landslide Susceptibility Prediction

Similar Items