Home  | Events

08

Feb

Teaser image to The highs and lows of performance evaluation: Towards a measurement theory for machine learning

Munich AI Lectures

The Highs and Lows of Performance Evaluation: Towards a Measurement Theory for Machine Learning

Peter Flach, University of Bristol

   08.02.2023

   5:00 pm - 7:00 pm

   Livestream on YouTube

The understanding of performance evaluation measures for machine-learned classifiers has progressed, but gaps persist, leading to questionable evaluation practices. This raises concerns about the trustworthiness of systems utilizing these algorithms. To address this, a proper measurement theory of machine learning is proposed, akin to measurement theory in other domains. This theory would explore relevant concatenation operations in data science and AI and their implications for measurement scales. The need for latent-variable models is highlighted to capture key properties like classification ability and dataset difficulty, enabling causal inferences in machine learning experiments.


Related

Link to Distilling Heterogeneous Treatment Effects: Stable Subgroup Estimation in Causal Inference

AI Keynote Series  •  20.11.2025  •  Online via Zoom

Distilling Heterogeneous Treatment Effects: Stable Subgroup Estimation in Causal Inference

Join the lecture with Melody Huang from Political Science and Statistics & Data Science at Yale University.


Link to Personalized Care Through Causal & Federated Learning: From Data to Decisions

AI Keynote Series  •  13.11.2025  •  Online via Zoom

Personalized Care Through Causal & Federated Learning: From Data to Decisions

Join the lecture with Julie Josse from French National Instiute for Research in Digital Science and Technology (Inria).


Back to Top