Sami Haidar-Wehbe
Holdout Sets for Safe Predictive Model Updating
Haidar-Wehbe, Sami; Emerson, Samuel R; Aslett, Louis Jm; Liley, James
Authors
Sam Emerson samuel.r.emerson@durham.ac.uk
PGR Student Doctor of Philosophy
Dr Louis Aslett louis.aslett@durham.ac.uk
Associate Professor
Dr James Liley james.liley@durham.ac.uk
Assistant Professor
Abstract
Predictive risk scores for adverse outcomes are increasingly crucial in guiding health interventions. Such scores may need to be periodically updated due to change in the distributions they model. However, directly updating risk scores used to guide intervention can lead to biased risk estimates. To address this, we propose updating using a 'holdout set'-a subset of the population that does not receive interventions guided by the risk score. Balancing the holdout set size is essential to ensure good performance of the updated risk score whilst minimising the number of held out samples. We prove that this approach reduces adverse outcome frequency to an asymptot-ically optimal level and argue that often there is no competitive alternative. We describe conditions under which an optimal holdout size (OHS) can be readily identified, and introduce parametric and semi-parametric algorithms for OHS estimation. We apply our methods to the ASPRE risk score for pre-eclampsia to recommend a plan for updating it in the presence of change in the underlying data distribution. We show that, in order to minimise the number of pre-eclampsia cases over time, this is best achieved using a holdout set of around 10,000 individuals.
Citation
Haidar-Wehbe, S., Emerson, S. R., Aslett, L. J., & Liley, J. (in press). Holdout Sets for Safe Predictive Model Updating. Annals of Applied Statistics,
Journal Article Type | Article |
---|---|
Acceptance Date | Oct 23, 2024 |
Deposit Date | Oct 23, 2024 |
Journal | Annals of Applied Statistics |
Print ISSN | 1932-6157 |
Electronic ISSN | 1941-7330 |
Publisher | Institute of Mathematical Statistics |
Peer Reviewed | Peer Reviewed |
Public URL | https://durham-repository.worktribe.com/output/2980683 |
Publisher URL | https://projecteuclid.org/journals/annals-of-applied-statistics |
This file is under embargo due to copyright reasons.
You might also like
Ethical considerations of use of hold-out sets in clinical prediction model management
(2024)
Journal Article
kalis: a modern implementation of the Li & Stephens model for local ancestry inference in R
(2024)
Journal Article
Downloadable Citations
About Durham Research Online (DRO)
Administrator e-mail: dro.admin@durham.ac.uk
This application uses the following open-source libraries:
SheetJS Community Edition
Apache License Version 2.0 (http://www.apache.org/licenses/)
PDF.js
Apache License Version 2.0 (http://www.apache.org/licenses/)
Font Awesome
SIL OFL 1.1 (http://scripts.sil.org/OFL)
MIT License (http://opensource.org/licenses/mit-license.html)
CC BY 3.0 ( http://creativecommons.org/licenses/by/3.0/)
Powered by Worktribe © 2024
Advanced Search