On the estimation of inverse-probability-of-censoring weights for the evaluation of survival prediction error.

Inverse probability weighting (IPW) is a popular method for making inferences regarding unobserved or unobservable data of a target population based on observed data. This paper considers IPW applied to right-censored time-to-event data. We investigate the behavior of the inverse-probability-of-cens...

Full description

Saved in:

Bibliographic Details
Main Authors:	Thomas Prince, Andrea Bommert, Jörg Rahnenführer, Matthias Schmid
Format:	Article
Language:	English
Published:	Public Library of Science (PLoS) 2025-01-01
Series:	PLoS ONE
Online Access:	https://doi.org/10.1371/journal.pone.0318349
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1832540281723944960
author	Thomas Prince Andrea Bommert Jörg Rahnenführer Matthias Schmid
author_facet	Thomas Prince Andrea Bommert Jörg Rahnenführer Matthias Schmid
author_sort	Thomas Prince
collection	DOAJ
description	Inverse probability weighting (IPW) is a popular method for making inferences regarding unobserved or unobservable data of a target population based on observed data. This paper considers IPW applied to right-censored time-to-event data. We investigate the behavior of the inverse-probability-of-censoring weighted (IPCW) Brier score, which is frequently used to assess the predictive performance of time-to-event models. A key requirement of the IPCW Brier score is the estimation of the censoring distribution, which is needed to compute the weights. The established paradigm of splitting a dataset into a training and a test set for model fitting and evaluation raises the question which of these datasets to use in order to fit the censoring model. There seems to be considerable disagreement between authors with regards to this issue, and no standard has been established so far. To shed light on this important question, we conducted a comprehensive experimental study exploring various data scenarios and estimation schemes. We found that it is generally of little importance which dataset is used to model the censoring distribution. However, in some circumstances, such as in the case of a covariate-dependent censoring process, a small sample size, or when dealing with noisy data, it may be advisable to use the test set instead of the training set to model the censoring distribution. A detailed set of practical recommendations concludes our paper.
format	Article
id	doaj-art-ca930d73b9274ff39fed78b4057e629b
institution	Kabale University
issn	1932-6203
language	English
publishDate	2025-01-01
publisher	Public Library of Science (PLoS)
record_format	Article
series	PLoS ONE
spelling	doaj-art-ca930d73b9274ff39fed78b4057e629b2025-02-05T05:31:40ZengPublic Library of Science (PLoS)PLoS ONE1932-62032025-01-01201e031834910.1371/journal.pone.0318349On the estimation of inverse-probability-of-censoring weights for the evaluation of survival prediction error.Thomas PrinceAndrea BommertJörg RahnenführerMatthias SchmidInverse probability weighting (IPW) is a popular method for making inferences regarding unobserved or unobservable data of a target population based on observed data. This paper considers IPW applied to right-censored time-to-event data. We investigate the behavior of the inverse-probability-of-censoring weighted (IPCW) Brier score, which is frequently used to assess the predictive performance of time-to-event models. A key requirement of the IPCW Brier score is the estimation of the censoring distribution, which is needed to compute the weights. The established paradigm of splitting a dataset into a training and a test set for model fitting and evaluation raises the question which of these datasets to use in order to fit the censoring model. There seems to be considerable disagreement between authors with regards to this issue, and no standard has been established so far. To shed light on this important question, we conducted a comprehensive experimental study exploring various data scenarios and estimation schemes. We found that it is generally of little importance which dataset is used to model the censoring distribution. However, in some circumstances, such as in the case of a covariate-dependent censoring process, a small sample size, or when dealing with noisy data, it may be advisable to use the test set instead of the training set to model the censoring distribution. A detailed set of practical recommendations concludes our paper.https://doi.org/10.1371/journal.pone.0318349
spellingShingle	Thomas Prince Andrea Bommert Jörg Rahnenführer Matthias Schmid On the estimation of inverse-probability-of-censoring weights for the evaluation of survival prediction error. PLoS ONE
title	On the estimation of inverse-probability-of-censoring weights for the evaluation of survival prediction error.
title_full	On the estimation of inverse-probability-of-censoring weights for the evaluation of survival prediction error.
title_fullStr	On the estimation of inverse-probability-of-censoring weights for the evaluation of survival prediction error.
title_full_unstemmed	On the estimation of inverse-probability-of-censoring weights for the evaluation of survival prediction error.
title_short	On the estimation of inverse-probability-of-censoring weights for the evaluation of survival prediction error.
title_sort	on the estimation of inverse probability of censoring weights for the evaluation of survival prediction error
url	https://doi.org/10.1371/journal.pone.0318349
work_keys_str_mv	AT thomasprince ontheestimationofinverseprobabilityofcensoringweightsfortheevaluationofsurvivalpredictionerror AT andreabommert ontheestimationofinverseprobabilityofcensoringweightsfortheevaluationofsurvivalpredictionerror AT jorgrahnenfuhrer ontheestimationofinverseprobabilityofcensoringweightsfortheevaluationofsurvivalpredictionerror AT matthiasschmid ontheestimationofinverseprobabilityofcensoringweightsfortheevaluationofsurvivalpredictionerror

On the estimation of inverse-probability-of-censoring weights for the evaluation of survival prediction error.

Similar Items