Automatic Threshold Selection for Generalized Pareto and Pareto–Poisson Distributions in Rainfall Analysis: A Case Study Using the NOAA NCDC Daily Rainfall Database
Both extreme-excess modeling and extreme-value analysis of precipitation events frequently utilize the Generalized Pareto (GP) distribution to model peaks above a selected threshold. However, selecting an appropriate threshold remains a complex and challenging task, which has discouraged many practi...
Saved in:
Main Author: | |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2025-01-01
|
Series: | Atmosphere |
Subjects: | |
Online Access: | https://www.mdpi.com/2073-4433/16/1/61 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
_version_ | 1832589134771781632 |
---|---|
author | Roberto Mínguez |
author_facet | Roberto Mínguez |
author_sort | Roberto Mínguez |
collection | DOAJ |
description | Both extreme-excess modeling and extreme-value analysis of precipitation events frequently utilize the Generalized Pareto (GP) distribution to model peaks above a selected threshold. However, selecting an appropriate threshold remains a complex and challenging task, which has discouraged many practitioners from employing Pareto or Pareto–Poisson distributions for extreme-value analysis. Recent analyses of threshold selection methods proposed in the technical literature, particularly when applied to rainfall records with high quantization levels, have shown that nonparametric methods are often unreliable. Additionally, methods relying on the asymptotic properties of the GP distribution tend to produce unrealistically high threshold estimates. In contrast, graphical methods and goodness-of-fit (GoF) metrics that account for the pre-asymptotic behavior of the GP distribution have demonstrated better performance. Despite these improvements, there remains no automatic and statistically robust methodology for threshold selection. This study develops an automatic, statistically sound procedure for optimal threshold selection, leveraging weighted mean square errors and internally studentized residuals. The proposed method outperforms existing approaches in terms of accuracy, as demonstrated through numerical experiments and its application to real-world data from the NOAA NCDC Daily Rainfall Database. Results indicate that the method not only improves threshold estimation precision but also enhances the reliability of extreme-value analysis for precipitation records, making it a valuable tool for hydrological applications. The findings emphasize the practical implications of the method for analyzing extreme rainfall events and its potential for broader climatological studies. |
format | Article |
id | doaj-art-9f51946863d148b7bc26842fb27826ed |
institution | Kabale University |
issn | 2073-4433 |
language | English |
publishDate | 2025-01-01 |
publisher | MDPI AG |
record_format | Article |
series | Atmosphere |
spelling | doaj-art-9f51946863d148b7bc26842fb27826ed2025-01-24T13:21:53ZengMDPI AGAtmosphere2073-44332025-01-011616110.3390/atmos16010061Automatic Threshold Selection for Generalized Pareto and Pareto–Poisson Distributions in Rainfall Analysis: A Case Study Using the NOAA NCDC Daily Rainfall DatabaseRoberto Mínguez0Department of Statistics, Universidad Carlos III de Madrid, Av.Universidad 30, 28911 Leganés, SpainBoth extreme-excess modeling and extreme-value analysis of precipitation events frequently utilize the Generalized Pareto (GP) distribution to model peaks above a selected threshold. However, selecting an appropriate threshold remains a complex and challenging task, which has discouraged many practitioners from employing Pareto or Pareto–Poisson distributions for extreme-value analysis. Recent analyses of threshold selection methods proposed in the technical literature, particularly when applied to rainfall records with high quantization levels, have shown that nonparametric methods are often unreliable. Additionally, methods relying on the asymptotic properties of the GP distribution tend to produce unrealistically high threshold estimates. In contrast, graphical methods and goodness-of-fit (GoF) metrics that account for the pre-asymptotic behavior of the GP distribution have demonstrated better performance. Despite these improvements, there remains no automatic and statistically robust methodology for threshold selection. This study develops an automatic, statistically sound procedure for optimal threshold selection, leveraging weighted mean square errors and internally studentized residuals. The proposed method outperforms existing approaches in terms of accuracy, as demonstrated through numerical experiments and its application to real-world data from the NOAA NCDC Daily Rainfall Database. Results indicate that the method not only improves threshold estimation precision but also enhances the reliability of extreme-value analysis for precipitation records, making it a valuable tool for hydrological applications. The findings emphasize the practical implications of the method for analyzing extreme rainfall events and its potential for broader climatological studies.https://www.mdpi.com/2073-4433/16/1/61exceedancesPareto distributionprecipitation extremesthreshold selection |
spellingShingle | Roberto Mínguez Automatic Threshold Selection for Generalized Pareto and Pareto–Poisson Distributions in Rainfall Analysis: A Case Study Using the NOAA NCDC Daily Rainfall Database Atmosphere exceedances Pareto distribution precipitation extremes threshold selection |
title | Automatic Threshold Selection for Generalized Pareto and Pareto–Poisson Distributions in Rainfall Analysis: A Case Study Using the NOAA NCDC Daily Rainfall Database |
title_full | Automatic Threshold Selection for Generalized Pareto and Pareto–Poisson Distributions in Rainfall Analysis: A Case Study Using the NOAA NCDC Daily Rainfall Database |
title_fullStr | Automatic Threshold Selection for Generalized Pareto and Pareto–Poisson Distributions in Rainfall Analysis: A Case Study Using the NOAA NCDC Daily Rainfall Database |
title_full_unstemmed | Automatic Threshold Selection for Generalized Pareto and Pareto–Poisson Distributions in Rainfall Analysis: A Case Study Using the NOAA NCDC Daily Rainfall Database |
title_short | Automatic Threshold Selection for Generalized Pareto and Pareto–Poisson Distributions in Rainfall Analysis: A Case Study Using the NOAA NCDC Daily Rainfall Database |
title_sort | automatic threshold selection for generalized pareto and pareto poisson distributions in rainfall analysis a case study using the noaa ncdc daily rainfall database |
topic | exceedances Pareto distribution precipitation extremes threshold selection |
url | https://www.mdpi.com/2073-4433/16/1/61 |
work_keys_str_mv | AT robertominguez automaticthresholdselectionforgeneralizedparetoandparetopoissondistributionsinrainfallanalysisacasestudyusingthenoaancdcdailyrainfalldatabase |