Geographically Aware Air Quality Prediction Through CNN-LSTM-KAN Hybrid Modeling with Climatic and Topographic Differentiation

Air pollution poses a pressing global challenge, particularly in rapidly industrializing nations like China where deteriorating air quality critically endangers public health and sustainable development. To address the heterogeneous patterns of air pollution across diverse geographical and climatic...

Full description

Saved in:
Bibliographic Details
Main Authors: Yue Hu, Yitong Ding, Wenjing Jiang
Format: Article
Language:English
Published: MDPI AG 2025-04-01
Series:Atmosphere
Subjects:
Online Access:https://www.mdpi.com/2073-4433/16/5/513
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Air pollution poses a pressing global challenge, particularly in rapidly industrializing nations like China where deteriorating air quality critically endangers public health and sustainable development. To address the heterogeneous patterns of air pollution across diverse geographical and climatic regions, this study proposes a novel CNN-LSTM-KAN hybrid deep learning framework for high-precision Air Quality Index (AQI) time-series prediction. Through systematic analysis of multi-city AQI datasets encompassing five representative Chinese metropolises—strategically selected to cover diverse climate zones (subtropical to temperate), geographical gradients (coastal to inland), and topographical variations (plains to mountains)—we established three principal methodological advancements. First, Shapiro–Wilk normality testing (<i>p</i> < 0.05) revealed non-Gaussian distribution characteristics in the observational data, providing statistical justification for implementing Gaussian filtering-based noise suppression. Second, our multi-regional validation framework extended beyond conventional single-city approaches, demonstrating model generalizability across distinct environmental contexts. Third, we innovatively integrated Kolmogorov–Arnold Networks (KANs) with attention mechanisms to replace traditional fully connected layers, achieving enhanced feature weighting capacity. Comparative experiments demonstrated superior performance with a 23.6–59.6% reduction in Root-Mean-Square Error (RMSE) relative to baseline LSTM models, along with consistent outperformance over CNN-LSTM hybrids. Cross-regional correlation analyses identified PM2.5/PM10 as dominant predictive factors. The developed model exhibited robust generalization capabilities across geographical divisions (R<sup>2</sup> = 0.92–0.99), establishing a reliable decision-support platform for regionally adaptive air quality early-warning systems. This methodological framework provides valuable insights for addressing spatial heterogeneity in environmental modeling applications.
ISSN:2073-4433