Home  | Publications | WSL+25

Biases in Machine-Learning Models of Human Single-Cell Data

MCML Authors

Link to Profile Niki Kilbertus PI Matchmaking

Niki Kilbertus

Prof. Dr.

Principal Investigator

Link to Profile Stefan Bauer PI Matchmaking

Stefan Bauer

Prof. Dr.

Principal Investigator

Abstract

Recent machine-learning (ML)-based advances in single-cell data science have enabled the stratification of human tissue donors at single-cell resolution, promising to provide valuable diagnostic and prognostic insights. However, such insights are susceptible to biases. Here we discuss various biases that emerge along the pipeline of ML-based single-cell analysis, ranging from societal biases affecting whose samples are collected, to clinical and cohort biases that influence the generalizability of single-cell datasets, biases stemming from single-cell sequencing, ML biases specific to (weakly supervised or unsupervised) ML models trained on human single-cell samples and biases during the interpretation of results from ML models. We end by providing methods for single-cell data scientists to assess and mitigate biases, and call for efforts to address the root causes of biases.

article


Nature Cell Biology

Feb. 2025.
Top Journal

Authors

T. Willem • V. A. Shitov • M. D. Luecken • N. KilbertusS. Bauer • M. Piraud • A. Buyx • F. J. Theis

Links

DOI

Research Areas

 A1 | Statistical Foundations & Explainability

 A3 | Computational Models

BibTeXKey: WSL+25

Back to Top