Home  | Publications | CK25a

Pre-Trained Nonresponse Prediction in Panel Surveys With Machine Learning

MCML Authors

Link to Profile Christoph Kern

Christoph Kern

Prof. Dr.

Associate

Abstract

While predictive modeling for unit nonresponse in panel surveys has been explored in various contexts, it is still under-researched how practitioners can best adopt these techniques. Currently, practitioners need to wait until they accumulate enough data in their panel to train and evaluate their own modeling options. This paper presents a novel “cross-training” technique in which we show that the indicators of nonresponse are so ubiquitous across studies that it is viable to train a model on one panel study and apply it to a different one. The practical benefit of this approach is that newly commencing panels can potentially make better nonresponse predictions in the early waves because these pre-trained models make use of more data. We demonstrate this technique with five panel surveys which encompass a variety of survey designs: the Socio-Economic Panel (SOEP), the German Internet Usage Panel (GIP), the GESIS Panel, the Mannheim Corona Study (MCS), and the Family Demographic Panel (FREDA). We demonstrate that nonresponse history and demographics, paired with tree-based modeling methods, make highly accurate and generalizable predictions across studies, despite differences in panel design. We show how cross-training can effectively predict nonresponse in early panel waves where attrition is typically highest.

article


Survey Research Methods

19.2. Aug. 2025.

Authors

J. Collins • C. Kern

Links

DOI

Research Area

 C4 | Computational Social Sciences

BibTeXKey: CK25a

Back to Top