On the estimation of inverse-probability-of-censoring weights for the evaluation of survival prediction error.

Inverse probability weighting (IPW) is a popular method for making inferences regarding unobserved or unobservable data of a target population based on observed data. This paper considers IPW applied to right-censored time-to-event data. We investigate the behavior of the inverse-probability-of-cens...

Full description

Saved in:
Bibliographic Details
Main Authors: Thomas Prince, Andrea Bommert, Jörg Rahnenführer, Matthias Schmid
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2025-01-01
Series:PLoS ONE
Online Access:https://doi.org/10.1371/journal.pone.0318349
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832540281723944960
author Thomas Prince
Andrea Bommert
Jörg Rahnenführer
Matthias Schmid
author_facet Thomas Prince
Andrea Bommert
Jörg Rahnenführer
Matthias Schmid
author_sort Thomas Prince
collection DOAJ
description Inverse probability weighting (IPW) is a popular method for making inferences regarding unobserved or unobservable data of a target population based on observed data. This paper considers IPW applied to right-censored time-to-event data. We investigate the behavior of the inverse-probability-of-censoring weighted (IPCW) Brier score, which is frequently used to assess the predictive performance of time-to-event models. A key requirement of the IPCW Brier score is the estimation of the censoring distribution, which is needed to compute the weights. The established paradigm of splitting a dataset into a training and a test set for model fitting and evaluation raises the question which of these datasets to use in order to fit the censoring model. There seems to be considerable disagreement between authors with regards to this issue, and no standard has been established so far. To shed light on this important question, we conducted a comprehensive experimental study exploring various data scenarios and estimation schemes. We found that it is generally of little importance which dataset is used to model the censoring distribution. However, in some circumstances, such as in the case of a covariate-dependent censoring process, a small sample size, or when dealing with noisy data, it may be advisable to use the test set instead of the training set to model the censoring distribution. A detailed set of practical recommendations concludes our paper.
format Article
id doaj-art-ca930d73b9274ff39fed78b4057e629b
institution Kabale University
issn 1932-6203
language English
publishDate 2025-01-01
publisher Public Library of Science (PLoS)
record_format Article
series PLoS ONE
spelling doaj-art-ca930d73b9274ff39fed78b4057e629b2025-02-05T05:31:40ZengPublic Library of Science (PLoS)PLoS ONE1932-62032025-01-01201e031834910.1371/journal.pone.0318349On the estimation of inverse-probability-of-censoring weights for the evaluation of survival prediction error.Thomas PrinceAndrea BommertJörg RahnenführerMatthias SchmidInverse probability weighting (IPW) is a popular method for making inferences regarding unobserved or unobservable data of a target population based on observed data. This paper considers IPW applied to right-censored time-to-event data. We investigate the behavior of the inverse-probability-of-censoring weighted (IPCW) Brier score, which is frequently used to assess the predictive performance of time-to-event models. A key requirement of the IPCW Brier score is the estimation of the censoring distribution, which is needed to compute the weights. The established paradigm of splitting a dataset into a training and a test set for model fitting and evaluation raises the question which of these datasets to use in order to fit the censoring model. There seems to be considerable disagreement between authors with regards to this issue, and no standard has been established so far. To shed light on this important question, we conducted a comprehensive experimental study exploring various data scenarios and estimation schemes. We found that it is generally of little importance which dataset is used to model the censoring distribution. However, in some circumstances, such as in the case of a covariate-dependent censoring process, a small sample size, or when dealing with noisy data, it may be advisable to use the test set instead of the training set to model the censoring distribution. A detailed set of practical recommendations concludes our paper.https://doi.org/10.1371/journal.pone.0318349
spellingShingle Thomas Prince
Andrea Bommert
Jörg Rahnenführer
Matthias Schmid
On the estimation of inverse-probability-of-censoring weights for the evaluation of survival prediction error.
PLoS ONE
title On the estimation of inverse-probability-of-censoring weights for the evaluation of survival prediction error.
title_full On the estimation of inverse-probability-of-censoring weights for the evaluation of survival prediction error.
title_fullStr On the estimation of inverse-probability-of-censoring weights for the evaluation of survival prediction error.
title_full_unstemmed On the estimation of inverse-probability-of-censoring weights for the evaluation of survival prediction error.
title_short On the estimation of inverse-probability-of-censoring weights for the evaluation of survival prediction error.
title_sort on the estimation of inverse probability of censoring weights for the evaluation of survival prediction error
url https://doi.org/10.1371/journal.pone.0318349
work_keys_str_mv AT thomasprince ontheestimationofinverseprobabilityofcensoringweightsfortheevaluationofsurvivalpredictionerror
AT andreabommert ontheestimationofinverseprobabilityofcensoringweightsfortheevaluationofsurvivalpredictionerror
AT jorgrahnenfuhrer ontheestimationofinverseprobabilityofcensoringweightsfortheevaluationofsurvivalpredictionerror
AT matthiasschmid ontheestimationofinverseprobabilityofcensoringweightsfortheevaluationofsurvivalpredictionerror