As the application of machine learning methods continues to gain relevance in the field of official statistics, the accurate estimation of model performance becomes increasingly crucial. Resampling methods, such as cross-validation and bootstrap, offer powerful tools for estimating the generalization error of predictive models. This chapter explores the concept of generalization error and delves into the intricate challenges associated with conducting inference on it.Building upon the foundational insights, we provide a comprehensive survey of existing methodologies in the literature, offering assessments and recommendations for both uncertainty estimation via confidence intervals and the derivation of point estimates within the context of nonstandard data structures, in particular, non-i.i.d. data.
article SBF+25a
BibTeXKey: SBF+25a