Home | Research | Groups | Stefan Bauer

Research Group Stefan Bauer

Stefan Bauer

Prof. Dr.

Principal Investigator

A1 | Statistical Foundations & Explainability

Algorithmic Machine Learning & Explainable AI

Stefan Bauer

is an Associate Professor of Algorithmic Machine Learning & Explainable AI at TU Munich and senior PI at Helmholtz AI.

He works on developing algorithms that learn causal relationships from high-dimensional inputs, explain their decisions, and adapt quickly to new problems. All these requirements are key prerequisites for robust and transformative AI-based technologies with various downstream applications.

Team members @MCML

PostDocs

Andrea Dittadi

Dr.

A1 | Statistical Foundations & Explainability
→ Group Stefan Bauer

Algorithmic Machine Learning & Explainable AI

PhD Students

Emmanouil Angelis

A1 | Statistical Foundations & Explainability
→ Group Stefan Bauer

Algorithmic Machine Learning & Explainable AI

Vincent Pauline

A1 | Statistical Foundations & Explainability
→ Group Stefan Bauer

Algorithmic Machine Learning & Explainable AI

Publications @MCML

2025

[5]

A. Uselis, A. Dittadi and S. J. Oh.
Does Data Scaling Lead to Visual Compositional Generalization?
ICML 2025 - 42nd International Conference on Machine Learning. Vancouver, Canada, Jul 13-19, 2025. To be published. Preprint available. URL GitHub

Abstract

Compositional understanding is crucial for human intelligence, yet it remains unclear whether contemporary vision models exhibit it. The dominant machine learning paradigm is built on the premise that scaling data and model sizes will improve out-of-distribution performance, including compositional generalization. We test this premise through controlled experiments that systematically vary data scale, concept diversity, and combination coverage. We find that compositional generalization is driven by data diversity, not mere data scale. Increased combinatorial coverage forces models to discover a linearly factored representational structure, where concepts decompose into additive components. We prove this structure is key to efficiency, enabling perfect generalization from few observed combinations. Evaluating pretrained models (DINO, CLIP), we find above-random yet imperfect performance, suggesting partial presence of this structure. Our work motivates stronger emphasis on constructing diverse datasets for compositional generalization, and considering the importance of representational structure that enables efficient compositional learning.

MCML Authors

Andrea Dittadi

Dr.

A1 | Statistical Foundations & Explainability
→ Group Stefan Bauer

Algorithmic Machine Learning & Explainable AI

[4]

A. Tejada-Lapuerta, P. Bertin, S. Bauer, H. Aliee, Y. Bengio and F. J. Theis.
Causal machine learning for single-cell genomics.
Nature Genetics (Mar. 2025). DOI

Abstract

Advances in single-cell ‘-omics’ allow unprecedented insights into the transcriptional profiles of individual cells and, when combined with large-scale perturbation screens, enable measuring of the effect of targeted perturbations on the whole transcriptome. These advances provide an opportunity to better understand the causative role of genes in complex biological processes. In this Perspective, we delineate the application of causal machine learning to single-cell genomics and its associated challenges. We first present the causal model that is most commonly applied to single-cell biology and then identify and discuss potential approaches to three open problems: the lack of generalization of models to novel experimental conditions, the complexity of interpreting learned models, and the difficulty of learning cell dynamics.

MCML Authors

Stefan Bauer

Prof. Dr.

A1 | Statistical Foundations & Explainability

Algorithmic Machine Learning & Explainable AI

Fabian Theis

Prof. Dr.

C2 | Biology

Mathematical Modelling of Biological Systems

[3]

P. Bertin, J. D. Viviano, A. Tejada-Lapuerta, W. Wang, S. Bauer, F. J. Theis and Y. Bengio.
A scalable gene network model of regulatory dynamics in single cells.
Preprint (Mar. 2025). arXiv

Abstract

Single-cell data provide high-dimensional measurements of the transcriptional states of cells, but extracting insights into the regulatory functions of genes, particularly identifying transcriptional mechanisms affected by biological perturbations, remains a challenge. Many perturbations induce compensatory cellular responses, making it difficult to distinguish direct from indirect effects on gene regulation. Modeling how gene regulatory functions shape the temporal dynamics of these responses is key to improving our understanding of biological perturbations. Dynamical models based on differential equations offer a principled way to capture transcriptional dynamics, but their application to single-cell data has been hindered by computational constraints, stochasticity, sparsity, and noise. Existing methods either rely on low-dimensional representations or make strong simplifying assumptions, limiting their ability to model transcriptional dynamics at scale. We introduce a Functional and Learnable model of Cell dynamicS, FLeCS, that incorporates gene network structure into coupled differential equations to model gene regulatory functions. Given (pseudo)time-series single-cell data, FLeCS accurately infers cell dynamics at scale, provides improved functional insights into transcriptional mechanisms perturbed by gene knockouts, both in myeloid differentiation and K562 Perturb-seq experiments, and simulates single-cell trajectories of A549 cells following small-molecule perturbations.

MCML Authors

Stefan Bauer

Prof. Dr.

A1 | Statistical Foundations & Explainability

Algorithmic Machine Learning & Explainable AI

Fabian Theis

Prof. Dr.

C2 | Biology

Mathematical Modelling of Biological Systems

[2]

T. Willem, V. A. Shitov, M. D. Luecken, N. Kilbertus, S. Bauer, M. Piraud, A. Buyx and F. J. Theis.
Biases in machine-learning models of human single-cell data.
Nature Cell Biology (Feb. 2025). DOI

Abstract

Recent machine-learning (ML)-based advances in single-cell data science have enabled the stratification of human tissue donors at single-cell resolution, promising to provide valuable diagnostic and prognostic insights. However, such insights are susceptible to biases. Here we discuss various biases that emerge along the pipeline of ML-based single-cell analysis, ranging from societal biases affecting whose samples are collected, to clinical and cohort biases that influence the generalizability of single-cell datasets, biases stemming from single-cell sequencing, ML biases specific to (weakly supervised or unsupervised) ML models trained on human single-cell samples and biases during the interpretation of results from ML models. We end by providing methods for single-cell data scientists to assess and mitigate biases, and call for efforts to address the root causes of biases.

MCML Authors

Niki Kilbertus

Prof. Dr.

A3 | Computational Models

Ethics in Systems Design and Machine Learning

Stefan Bauer

Prof. Dr.

A1 | Statistical Foundations & Explainability

Algorithmic Machine Learning & Explainable AI

2024

[1]

B. M. G. Nielsen, L. Gresele and A. Dittadi.
Challenges in Explaining Representational Similarity through Identifiability.
UniReps @NeurIPS 2024 - 2nd Workshop on Unifying Representations in Neural Models at the 37th Conference on Neural Information Processing Systems (NeurIPS 2023). Vancouver, Canada, Dec 10-15, 2024. URL

Abstract

The phenomenon of different deep learning models producing similar data representations has garnered significant attention, raising the question of why such representational similarity occurs. Identifiability theory offers a partial explanation: for a broad class of discriminative models, including many popular in representation learning, those assigning equal likelihood to the observations yield representations that are equal up to a linear transformation, if a suitable diversity condition holds. In this work, we identify two key challenges in applying identifiability theory to explain representational similarity. First, the assumption of exact likelihood equality is rarely satisfied by practical models trained with different initializations. To address this, we describe how the representations of two models deviate from being linear transformations of each other, based on their difference in log-likelihoods. Second, we demonstrate that even models with similar and near-optimal loss values can produce highly dissimilar representations due to an underappreciated difference between loss and likelihood. Our findings highlight key open questions and point to future research directions for advancing the theoretical understanding of representational similarity.

MCML Authors

Andrea Dittadi

Dr.

A1 | Statistical Foundations & Explainability
→ Group Stefan Bauer

Algorithmic Machine Learning & Explainable AI

Research Group Stefan Bauer

Stefan Bauer

Team members @MCML

PostDocs

PhD Students

Recent News @MCML

MCML Researchers With 24 Papers at ICML 2025

MCML Researchers With 91 Papers in Highly-Ranked Journals

MCML Researchers With 31 Papers at NeurIPS 2024

Several MCML PIs Receive BMBF Funding

Publications @MCML

2025

2024