Home  | Publications | SZB+25

Examining Marginal Properness in the External Validation of Survival Models With Squared and Logarithmic Losses

MCML Authors

Abstract

Scoring rules promote rational and honest decision-making, which is important for model evaluation and becoming increasingly important for automated procedures such as 'AutoML'. In this paper we survey common squared and logarithmic scoring rules for survival analysis, with a focus on their theoretical and empirical properness. We introduce a marginal definition of properness and show that both the Integrated Survival Brier Score (ISBS) and the Right-Censored Log-Likelihood (RCLL) are theoretically improper under this definition. We also investigate a new class of losses that may inform future survival scoring rules. Simulation experiments reveal that both the ISBS and RCLL behave as proper scoring rules in practice. The RCLL showed no violations across all settings, while ISBS exhibited only minor, negligible violations at extremely small sample sizes, suggesting one can trust results from historical experiments. As such we advocate for both the RCLL and ISBS in external validation of models, including in automated procedures. However, we note practical challenges in estimating these losses including estimation of censoring distributions and densities; as such further research is required to advance development of robust and honest evaluation in survival analysis.

misc


Preprint

May. 2025

Authors

R. Sonabend • J. Zobolas • R. Bin • P. Kopper • L. BurkA. Bender

Links


In Collaboration

partnerlogo

Research Area

 A1 | Statistical Foundations & Explainability

BibTeXKey: SZB+25

Back to Top