Home  | Publications | BPB20a

Collecting Empirical Data About Hyperparameters for Data Driven AutoML

MCML Authors

Abstract

All optimization needs some kind of prior over the functions it is optimizing over. We used a large computing cluster to collect empirical data about the behavior of ML performance, by randomly sampling hyperparameter values and performing cross-validation. We also collected information about cross-validation error by performing some evaluations multiple times, and information about progression of performance with respect to training data size by performing some evaluations on data subsets. We present how we collected data, make some preliminary analyses on the surrogate models that can be built with them, and give an outlook over interesting analyses this should enable.

inproceedings


AutoML @ICML 2020

7th Workshop on Automated Machine Learning at the 37th International Conference on Machine Learning. Virtual, Jul 18, 2020.

Authors

M. BinderF. PfistererB. Bischl

Links

PDF

Research Area

 A1 | Statistical Foundations & Explainability

BibTeXKey: BPB20a

Back to Top