Geographically Aware Air Quality Prediction Through CNN-LSTM-KAN Hybrid Modeling with Climatic and Topographic Differentiation
Air pollution poses a pressing global challenge, particularly in rapidly industrializing nations like China where deteriorating air quality critically endangers public health and sustainable development. To address the heterogeneous patterns of air pollution across diverse geographical and climatic...
Saved in:
| Main Authors: | , , |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
MDPI AG
2025-04-01
|
| Series: | Atmosphere |
| Subjects: | |
| Online Access: | https://www.mdpi.com/2073-4433/16/5/513 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
| Summary: | Air pollution poses a pressing global challenge, particularly in rapidly industrializing nations like China where deteriorating air quality critically endangers public health and sustainable development. To address the heterogeneous patterns of air pollution across diverse geographical and climatic regions, this study proposes a novel CNN-LSTM-KAN hybrid deep learning framework for high-precision Air Quality Index (AQI) time-series prediction. Through systematic analysis of multi-city AQI datasets encompassing five representative Chinese metropolises—strategically selected to cover diverse climate zones (subtropical to temperate), geographical gradients (coastal to inland), and topographical variations (plains to mountains)—we established three principal methodological advancements. First, Shapiro–Wilk normality testing (<i>p</i> < 0.05) revealed non-Gaussian distribution characteristics in the observational data, providing statistical justification for implementing Gaussian filtering-based noise suppression. Second, our multi-regional validation framework extended beyond conventional single-city approaches, demonstrating model generalizability across distinct environmental contexts. Third, we innovatively integrated Kolmogorov–Arnold Networks (KANs) with attention mechanisms to replace traditional fully connected layers, achieving enhanced feature weighting capacity. Comparative experiments demonstrated superior performance with a 23.6–59.6% reduction in Root-Mean-Square Error (RMSE) relative to baseline LSTM models, along with consistent outperformance over CNN-LSTM hybrids. Cross-regional correlation analyses identified PM2.5/PM10 as dominant predictive factors. The developed model exhibited robust generalization capabilities across geographical divisions (R<sup>2</sup> = 0.92–0.99), establishing a reliable decision-support platform for regionally adaptive air quality early-warning systems. This methodological framework provides valuable insights for addressing spatial heterogeneity in environmental modeling applications. |
|---|---|
| ISSN: | 2073-4433 |