Development of data-driven algal bloom alert models with low temporal resolution data and application to Hong Kong rivers

Study region: The study focuses on the 12 rivers across Hong Kong, that have been facing varying degrees of algal bloom risks. Study focus: Data-driven models are extensively employed to forecast and discern the catalysts of algal blooms, leveraging high temporal resolution data to unveil intricate...

Full description

Saved in:
Bibliographic Details
Main Authors: Shujie Xu, Zhongnan Ye, Shu-Chien Hsu, Xiaoyi Liu, Chunmiao Zheng
Format: Article
Language:English
Published: Elsevier 2025-02-01
Series:Journal of Hydrology: Regional Studies
Subjects:
Online Access:http://www.sciencedirect.com/science/article/pii/S2214581824004579
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832591854058602496
author Shujie Xu
Zhongnan Ye
Shu-Chien Hsu
Xiaoyi Liu
Chunmiao Zheng
author_facet Shujie Xu
Zhongnan Ye
Shu-Chien Hsu
Xiaoyi Liu
Chunmiao Zheng
author_sort Shujie Xu
collection DOAJ
description Study region: The study focuses on the 12 rivers across Hong Kong, that have been facing varying degrees of algal bloom risks. Study focus: Data-driven models are extensively employed to forecast and discern the catalysts of algal blooms, leveraging high temporal resolution data to unveil intricate numerical relationships among indicators. The efficacy of these models can be compromised due to the low temporal resolution of water quality monitoring. This study presents a data-driven algal bloom alert model employing machine learning and data discretization techniques to solve the data issue, utilizing monthly water quality data of 12 rivers and daily weather data in Hong Kong. The critical factors associated with algal blooms were then identified by feature importance analysis and false negative analysis of the optimal performance model, guiding potential algal control strategies for Hong Kong. New hydrological insights for the region: Models that integrate data discretization outperformed those using numerical normalization, showing higher recall scores and greater stability across selected algorithms (linear regression, support vector machine, random forest, and decision tree). Permutation importance analysis identified nitrogen compounds and rising temperatures as key triggers of algal blooms, while false negative analysis highlighted total phosphorus and flow as critical factors. This study offers a potential solution for prediction and forecasting without extensive datasets and emphasizes the importance of integrating domain knowledge in both prior and posterior analyses to enhance the interpretability of data-driven models in water science.
format Article
id doaj-art-8e90643867794cbea00cd788129398db
institution Kabale University
issn 2214-5818
language English
publishDate 2025-02-01
publisher Elsevier
record_format Article
series Journal of Hydrology: Regional Studies
spelling doaj-art-8e90643867794cbea00cd788129398db2025-01-22T05:42:03ZengElsevierJournal of Hydrology: Regional Studies2214-58182025-02-0157102108Development of data-driven algal bloom alert models with low temporal resolution data and application to Hong Kong riversShujie Xu0Zhongnan Ye1Shu-Chien Hsu2Xiaoyi Liu3Chunmiao Zheng4Department of Civil and Environmental Engineering, the Hong Kong Polytechnic University, Hong Kong SAR; Guangdong-Hong Kong Joint Laboratory for Soil and Groundwater Pollution Control, School of Environmental Science and Engineering, Southern University of Science and Technology, Shenzhen 518055, China; Guangdong Provincial Key Laboratory of Soil and Groundwater Pollution Control, School of Environmental Science and Engineering, Southern University of Science and Technology, Shenzhen 518055, ChinaDepartment of Civil and Environmental Engineering, the Hong Kong Polytechnic University, Hong Kong SARDepartment of Civil and Environmental Engineering, the Hong Kong Polytechnic University, Hong Kong SAR; Corresponding author.Department of Civil and Environmental Engineering, the Hong Kong Polytechnic University, Hong Kong SARGuangdong-Hong Kong Joint Laboratory for Soil and Groundwater Pollution Control, School of Environmental Science and Engineering, Southern University of Science and Technology, Shenzhen 518055, China; Guangdong Provincial Key Laboratory of Soil and Groundwater Pollution Control, School of Environmental Science and Engineering, Southern University of Science and Technology, Shenzhen 518055, China; Eastern Institute for Advanced Study, Eastern Institute of Technology, Ningbo, China; Corresponding author at: Guangdong-Hong Kong Joint Laboratory for Soil and Groundwater Pollution Control, School of Environmental Science and Engineering, Southern University of Science and Technology, Shenzhen 518055, China.Study region: The study focuses on the 12 rivers across Hong Kong, that have been facing varying degrees of algal bloom risks. Study focus: Data-driven models are extensively employed to forecast and discern the catalysts of algal blooms, leveraging high temporal resolution data to unveil intricate numerical relationships among indicators. The efficacy of these models can be compromised due to the low temporal resolution of water quality monitoring. This study presents a data-driven algal bloom alert model employing machine learning and data discretization techniques to solve the data issue, utilizing monthly water quality data of 12 rivers and daily weather data in Hong Kong. The critical factors associated with algal blooms were then identified by feature importance analysis and false negative analysis of the optimal performance model, guiding potential algal control strategies for Hong Kong. New hydrological insights for the region: Models that integrate data discretization outperformed those using numerical normalization, showing higher recall scores and greater stability across selected algorithms (linear regression, support vector machine, random forest, and decision tree). Permutation importance analysis identified nitrogen compounds and rising temperatures as key triggers of algal blooms, while false negative analysis highlighted total phosphorus and flow as critical factors. This study offers a potential solution for prediction and forecasting without extensive datasets and emphasizes the importance of integrating domain knowledge in both prior and posterior analyses to enhance the interpretability of data-driven models in water science.http://www.sciencedirect.com/science/article/pii/S2214581824004579Algal bloomsData-driven modelsData discretizationWater management
spellingShingle Shujie Xu
Zhongnan Ye
Shu-Chien Hsu
Xiaoyi Liu
Chunmiao Zheng
Development of data-driven algal bloom alert models with low temporal resolution data and application to Hong Kong rivers
Journal of Hydrology: Regional Studies
Algal blooms
Data-driven models
Data discretization
Water management
title Development of data-driven algal bloom alert models with low temporal resolution data and application to Hong Kong rivers
title_full Development of data-driven algal bloom alert models with low temporal resolution data and application to Hong Kong rivers
title_fullStr Development of data-driven algal bloom alert models with low temporal resolution data and application to Hong Kong rivers
title_full_unstemmed Development of data-driven algal bloom alert models with low temporal resolution data and application to Hong Kong rivers
title_short Development of data-driven algal bloom alert models with low temporal resolution data and application to Hong Kong rivers
title_sort development of data driven algal bloom alert models with low temporal resolution data and application to hong kong rivers
topic Algal blooms
Data-driven models
Data discretization
Water management
url http://www.sciencedirect.com/science/article/pii/S2214581824004579
work_keys_str_mv AT shujiexu developmentofdatadrivenalgalbloomalertmodelswithlowtemporalresolutiondataandapplicationtohongkongrivers
AT zhongnanye developmentofdatadrivenalgalbloomalertmodelswithlowtemporalresolutiondataandapplicationtohongkongrivers
AT shuchienhsu developmentofdatadrivenalgalbloomalertmodelswithlowtemporalresolutiondataandapplicationtohongkongrivers
AT xiaoyiliu developmentofdatadrivenalgalbloomalertmodelswithlowtemporalresolutiondataandapplicationtohongkongrivers
AT chunmiaozheng developmentofdatadrivenalgalbloomalertmodelswithlowtemporalresolutiondataandapplicationtohongkongrivers