23.07.2025
How Reliable Are Machine Learning Methods? With Anne-Laure Boulesteix and Milena Wünsch
Research Film
Often a new machine learning method claims to outperform the last. Whether it’s in bioinformatics, finance, or image recognition, the message is the same: this algorithm is faster, more accurate, more powerful. But can we trust those claims?
«It’s not just about the algorithms. It’s about how we compare them—and what we choose to report or ignore.»
Milena Wünsch
MCML Junior Member
Beneath the surface of many benchmarking studies lies a quiet problem: subtle biases that skew comparisons and inflate performance. These issues often go unnoticed — but they can have real consequences, especially when such models are used to inform research or high-stakes decisions.
«It doesn’t matter whether the bias is deliberate or not. It still shapes how methods are judged and used.»
Anne-Laure Boulesteix
MCML PI
Anne-Laure Boulesteix, Professor of Biometry at LMU and MCML PI, and Milena Wünsch, PhD student at LMU and MCML, study how seemingly harmless methodological choices can lead to misleading results.
One common issue: when a method fails on a dataset, researchers may simply drop it from the analysis. While convenient, this can introduce bias and overstate performance.
Bias can also arise from less obvious sources — like spending more time tuning one method, being more familiar with a tool, or unconsciously interpreting results in its favor.
With so many studies promoting the “next best” algorithm, it’s hard to know which results to trust. Researchers may end up using a method that only looked good due to biased comparisons. Still, the researchers are hopeful. In recent years, the methodological machine learning community has made real progress — pushing for better standards, more transparency, and more careful benchmarking.
©MCML
The film was produced and edited by Nicole Huminski and Nikolai Huber.
Related
22.01.2026
From Global to Regional Explanations: Understanding Models More Locally
JMLR research shows how subgroup-specific explanations reveal hidden patterns that global model explanations often miss.
©MCML
22.01.2026
Research Stay at University of St. Gallen
Andrea Maldonado spent two weeks at HSG with the MCML AI X-Change Program, researching ambiguity in IoT-based process mining.
20.01.2026
MCML PI Matthias Nießner Featured in WirtschaftsWoche on Spaitial AI
MCML PI Matthias Nießner featured in WirtschaftsWoche for Spaitial AI, creating realistic 3D models of rooms and interiors in seconds.
19.01.2026
MCML at AAAI 2026
MCML researchers are represented with 11 papers at AAAI 2026 (8 Main, and 3 Workshops).