Exploration of transfer learning techniques for the prediction of PM10
Abstract Modelling of pollutants provides valuable insights into air quality dynamics, aiding exposure assessment where direct measurements are not viable. Machine learning (ML) models can be employed to explore such dynamics, including the prediction of air pollution concentrations, yet demanding e...
Saved in:
Main Authors: | , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Nature Portfolio
2025-01-01
|
Series: | Scientific Reports |
Online Access: | https://doi.org/10.1038/s41598-025-86550-6 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
_version_ | 1832585840380870656 |
---|---|
author | Michael Poelzl Roman Kern Simonas Kecorius Mario Lovrić |
author_facet | Michael Poelzl Roman Kern Simonas Kecorius Mario Lovrić |
author_sort | Michael Poelzl |
collection | DOAJ |
description | Abstract Modelling of pollutants provides valuable insights into air quality dynamics, aiding exposure assessment where direct measurements are not viable. Machine learning (ML) models can be employed to explore such dynamics, including the prediction of air pollution concentrations, yet demanding extensive training data. To address this, techniques like transfer learning (TL) leverage knowledge from a model trained on a rich dataset to enhance one trained on a sparse dataset, provided there are similarities in data distribution. In our experimental setup, we utilize meteorological and pollutant data from multiple governmental air quality measurement stations in Graz, Austria, supplemented by data from one station in Zagreb, Croatia to simulate data scarcity. Common ML models such as Random Forests, Multilayer Perceptrons, Long-Short-Term Memory, and Convolutional Neural Networks are explored to predict particulate matter in both cities. Our detailed analysis of PM10 suggests that similarities between the cities and the meteorological features exist and can be further exploited. Hence, TL appears to offer a viable approach to enhance PM10 predictions for the Zagreb station, despite the challenges posed by data scarcity. Our results demonstrate the feasibility of different TL techniques to improve particulate matter prediction on transferring a ML model trained from all stations of Graz and transferred to Zagreb. Through our investigation, we discovered that selectively choosing time spans based on seasonal patterns not only aids in reducing the amount of data needed for successful TL but also significantly improves prediction performance. Specifically, training a Random Forest model using data from all measurement stations in Graz and transferring it with only 20% of the labelled data from Zagreb resulted in a 22% enhancement compared to directly testing the trained model on Zagreb. |
format | Article |
id | doaj-art-453ba8c440264817ad8cac5c06b73f2f |
institution | Kabale University |
issn | 2045-2322 |
language | English |
publishDate | 2025-01-01 |
publisher | Nature Portfolio |
record_format | Article |
series | Scientific Reports |
spelling | doaj-art-453ba8c440264817ad8cac5c06b73f2f2025-01-26T12:25:43ZengNature PortfolioScientific Reports2045-23222025-01-0115111310.1038/s41598-025-86550-6Exploration of transfer learning techniques for the prediction of PM10Michael Poelzl0Roman Kern1Simonas Kecorius2Mario Lovrić3Institute of Interactive Systems and Data Science, Graz University of TechnologyInstitute of Interactive Systems and Data Science, Graz University of TechnologyInstitute of Epidemiology, Helmholtz Zentrum München - German Research Center for Environmental HealthInstitute for Anthropological ResearchAbstract Modelling of pollutants provides valuable insights into air quality dynamics, aiding exposure assessment where direct measurements are not viable. Machine learning (ML) models can be employed to explore such dynamics, including the prediction of air pollution concentrations, yet demanding extensive training data. To address this, techniques like transfer learning (TL) leverage knowledge from a model trained on a rich dataset to enhance one trained on a sparse dataset, provided there are similarities in data distribution. In our experimental setup, we utilize meteorological and pollutant data from multiple governmental air quality measurement stations in Graz, Austria, supplemented by data from one station in Zagreb, Croatia to simulate data scarcity. Common ML models such as Random Forests, Multilayer Perceptrons, Long-Short-Term Memory, and Convolutional Neural Networks are explored to predict particulate matter in both cities. Our detailed analysis of PM10 suggests that similarities between the cities and the meteorological features exist and can be further exploited. Hence, TL appears to offer a viable approach to enhance PM10 predictions for the Zagreb station, despite the challenges posed by data scarcity. Our results demonstrate the feasibility of different TL techniques to improve particulate matter prediction on transferring a ML model trained from all stations of Graz and transferred to Zagreb. Through our investigation, we discovered that selectively choosing time spans based on seasonal patterns not only aids in reducing the amount of data needed for successful TL but also significantly improves prediction performance. Specifically, training a Random Forest model using data from all measurement stations in Graz and transferring it with only 20% of the labelled data from Zagreb resulted in a 22% enhancement compared to directly testing the trained model on Zagreb.https://doi.org/10.1038/s41598-025-86550-6 |
spellingShingle | Michael Poelzl Roman Kern Simonas Kecorius Mario Lovrić Exploration of transfer learning techniques for the prediction of PM10 Scientific Reports |
title | Exploration of transfer learning techniques for the prediction of PM10 |
title_full | Exploration of transfer learning techniques for the prediction of PM10 |
title_fullStr | Exploration of transfer learning techniques for the prediction of PM10 |
title_full_unstemmed | Exploration of transfer learning techniques for the prediction of PM10 |
title_short | Exploration of transfer learning techniques for the prediction of PM10 |
title_sort | exploration of transfer learning techniques for the prediction of pm10 |
url | https://doi.org/10.1038/s41598-025-86550-6 |
work_keys_str_mv | AT michaelpoelzl explorationoftransferlearningtechniquesforthepredictionofpm10 AT romankern explorationoftransferlearningtechniquesforthepredictionofpm10 AT simonaskecorius explorationoftransferlearningtechniquesforthepredictionofpm10 AT mariolovric explorationoftransferlearningtechniquesforthepredictionofpm10 |