Home  | News

23.07.2025

Teaser image to How Reliable Are Machine Learning Methods? With Anne-Laure Boulesteix and Milena Wünsch

How Reliable Are Machine Learning Methods? With Anne-Laure Boulesteix and Milena Wünsch

Research Film

Often a new machine learning method claims to outperform the last. Whether it’s in bioinformatics, finance, or image recognition, the message is the same: this algorithm is faster, more accurate, more powerful. But can we trust those claims?

«It’s not just about the algorithms. It’s about how we compare them—and what we choose to report or ignore.»


Milena Wünsch

MCML Junior Member

Beneath the surface of many benchmarking studies lies a quiet problem: subtle biases that skew comparisons and inflate performance. These issues often go unnoticed — but they can have real consequences, especially when such models are used to inform research or high-stakes decisions.

«It doesn’t matter whether the bias is deliberate or not. It still shapes how methods are judged and used.»


Anne-Laure Boulesteix

MCML PI

Anne-Laure Boulesteix, Professor of Biometry at LMU and MCML PI, and Milena Wünsch, PhD student at LMU and MCML, study how seemingly harmless methodological choices can lead to misleading results.

One common issue: when a method fails on a dataset, researchers may simply drop it from the analysis. While convenient, this can introduce bias and overstate performance.

Bias can also arise from less obvious sources — like spending more time tuning one method, being more familiar with a tool, or unconsciously interpreting results in its favor.

With so many studies promoting the “next best” algorithm, it’s hard to know which results to trust. Researchers may end up using a method that only looked good due to biased comparisons. Still, the researchers are hopeful. In recent years, the methodological machine learning community has made real progress — pushing for better standards, more transparency, and more careful benchmarking.

The film was produced and edited by Nicole Huminski and Nikolai Huber.

 

#blog #research #boulesteix
Subscribe to RSS News feed

Related

Link to From Sitting Dog to Standing: A New Way to Morph 3D Shapes

11.12.2025

From Sitting Dog to Standing: A New Way to Morph 3D Shapes

ICLR 2025 work by Lu Sang and Daniel Cremers in collaboration with U Bonn enables smooth, physics-aware 3D shape deformation from point clouds.

Link to Tom Sterkenburg Wins Karl-Heinz Hoffmann Prize of the Bavarian Academy of Sciences

08.12.2025

Tom Sterkenburg Wins Karl-Heinz Hoffmann Prize of the Bavarian Academy of Sciences

MCML JRG Leader Tom Sterkenburg receives the Karl-Heinz Hoffmann Prize of the BAdW for his interdisciplinary research.

Link to World’s First Complete 3D Model of All Buildings Released

04.12.2025

World’s First Complete 3D Model of All Buildings Released

Xiaoxiang Zhu’s team releases GlobalBuildingAtlas, a high-res 3D map of 2.75B buildings for advanced urban and climate analysis.

Link to When to Say "I’m Not Sure": Making Language Models More Self-Aware

04.12.2025

When to Say "I’m Not Sure": Making Language Models More Self-Aware

ICLR 2025 research by the groups of David Rügamer, and Bernd Bischl introduces methods to make LLMs more reliable by expressing uncertainty.

Link to Research Stay at Princeton University

01.12.2025

Research Stay at Princeton University

Abdurahman Maarouf spent three months at Princeton with the AI X-Change Program, advancing causal ML and studying short-form video platform effects.

Back to Top