Automatic Threshold Selection for Generalized Pareto and Pareto–Poisson Distributions in Rainfall Analysis: A Case Study Using the NOAA NCDC Daily Rainfall Database

Both extreme-excess modeling and extreme-value analysis of precipitation events frequently utilize the Generalized Pareto (GP) distribution to model peaks above a selected threshold. However, selecting an appropriate threshold remains a complex and challenging task, which has discouraged many practi...

Full description

Saved in:
Bibliographic Details
Main Author: Roberto Mínguez
Format: Article
Language:English
Published: MDPI AG 2025-01-01
Series:Atmosphere
Subjects:
Online Access:https://www.mdpi.com/2073-4433/16/1/61
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832589134771781632
author Roberto Mínguez
author_facet Roberto Mínguez
author_sort Roberto Mínguez
collection DOAJ
description Both extreme-excess modeling and extreme-value analysis of precipitation events frequently utilize the Generalized Pareto (GP) distribution to model peaks above a selected threshold. However, selecting an appropriate threshold remains a complex and challenging task, which has discouraged many practitioners from employing Pareto or Pareto–Poisson distributions for extreme-value analysis. Recent analyses of threshold selection methods proposed in the technical literature, particularly when applied to rainfall records with high quantization levels, have shown that nonparametric methods are often unreliable. Additionally, methods relying on the asymptotic properties of the GP distribution tend to produce unrealistically high threshold estimates. In contrast, graphical methods and goodness-of-fit (GoF) metrics that account for the pre-asymptotic behavior of the GP distribution have demonstrated better performance. Despite these improvements, there remains no automatic and statistically robust methodology for threshold selection. This study develops an automatic, statistically sound procedure for optimal threshold selection, leveraging weighted mean square errors and internally studentized residuals. The proposed method outperforms existing approaches in terms of accuracy, as demonstrated through numerical experiments and its application to real-world data from the NOAA NCDC Daily Rainfall Database. Results indicate that the method not only improves threshold estimation precision but also enhances the reliability of extreme-value analysis for precipitation records, making it a valuable tool for hydrological applications. The findings emphasize the practical implications of the method for analyzing extreme rainfall events and its potential for broader climatological studies.
format Article
id doaj-art-9f51946863d148b7bc26842fb27826ed
institution Kabale University
issn 2073-4433
language English
publishDate 2025-01-01
publisher MDPI AG
record_format Article
series Atmosphere
spelling doaj-art-9f51946863d148b7bc26842fb27826ed2025-01-24T13:21:53ZengMDPI AGAtmosphere2073-44332025-01-011616110.3390/atmos16010061Automatic Threshold Selection for Generalized Pareto and Pareto–Poisson Distributions in Rainfall Analysis: A Case Study Using the NOAA NCDC Daily Rainfall DatabaseRoberto Mínguez0Department of Statistics, Universidad Carlos III de Madrid, Av.Universidad 30, 28911 Leganés, SpainBoth extreme-excess modeling and extreme-value analysis of precipitation events frequently utilize the Generalized Pareto (GP) distribution to model peaks above a selected threshold. However, selecting an appropriate threshold remains a complex and challenging task, which has discouraged many practitioners from employing Pareto or Pareto–Poisson distributions for extreme-value analysis. Recent analyses of threshold selection methods proposed in the technical literature, particularly when applied to rainfall records with high quantization levels, have shown that nonparametric methods are often unreliable. Additionally, methods relying on the asymptotic properties of the GP distribution tend to produce unrealistically high threshold estimates. In contrast, graphical methods and goodness-of-fit (GoF) metrics that account for the pre-asymptotic behavior of the GP distribution have demonstrated better performance. Despite these improvements, there remains no automatic and statistically robust methodology for threshold selection. This study develops an automatic, statistically sound procedure for optimal threshold selection, leveraging weighted mean square errors and internally studentized residuals. The proposed method outperforms existing approaches in terms of accuracy, as demonstrated through numerical experiments and its application to real-world data from the NOAA NCDC Daily Rainfall Database. Results indicate that the method not only improves threshold estimation precision but also enhances the reliability of extreme-value analysis for precipitation records, making it a valuable tool for hydrological applications. The findings emphasize the practical implications of the method for analyzing extreme rainfall events and its potential for broader climatological studies.https://www.mdpi.com/2073-4433/16/1/61exceedancesPareto distributionprecipitation extremesthreshold selection
spellingShingle Roberto Mínguez
Automatic Threshold Selection for Generalized Pareto and Pareto–Poisson Distributions in Rainfall Analysis: A Case Study Using the NOAA NCDC Daily Rainfall Database
Atmosphere
exceedances
Pareto distribution
precipitation extremes
threshold selection
title Automatic Threshold Selection for Generalized Pareto and Pareto–Poisson Distributions in Rainfall Analysis: A Case Study Using the NOAA NCDC Daily Rainfall Database
title_full Automatic Threshold Selection for Generalized Pareto and Pareto–Poisson Distributions in Rainfall Analysis: A Case Study Using the NOAA NCDC Daily Rainfall Database
title_fullStr Automatic Threshold Selection for Generalized Pareto and Pareto–Poisson Distributions in Rainfall Analysis: A Case Study Using the NOAA NCDC Daily Rainfall Database
title_full_unstemmed Automatic Threshold Selection for Generalized Pareto and Pareto–Poisson Distributions in Rainfall Analysis: A Case Study Using the NOAA NCDC Daily Rainfall Database
title_short Automatic Threshold Selection for Generalized Pareto and Pareto–Poisson Distributions in Rainfall Analysis: A Case Study Using the NOAA NCDC Daily Rainfall Database
title_sort automatic threshold selection for generalized pareto and pareto poisson distributions in rainfall analysis a case study using the noaa ncdc daily rainfall database
topic exceedances
Pareto distribution
precipitation extremes
threshold selection
url https://www.mdpi.com/2073-4433/16/1/61
work_keys_str_mv AT robertominguez automaticthresholdselectionforgeneralizedparetoandparetopoissondistributionsinrainfallanalysisacasestudyusingthenoaancdcdailyrainfalldatabase