Home | Publications | BWR+25a

X-SiT: Inherently Interpretable Surface Vision Transformers for Dementia Diagnosis

MCML Authors

Fabian Bongratz

→ Group Christian Wachinger
Artificial Intelligence in Medical Imaging

Tom Nuno Wolf

→ Group Christian Wachinger
Artificial Intelligence in Medical Imaging

Christian Wachinger

Prof. Dr.

Principal Investigator

Artificial Intelligence in Medical Imaging

Abstract

Interpretable models are crucial for supporting clinical decision-making, driving advances in their development and application for medical images. However, the nature of 3D volumetric data makes it inherently challenging to visualize and interpret intricate and complex structures like the cerebral cortex. Cortical surface renderings, on the other hand, provide a more accessible and understandable 3D representation of brain anatomy, facilitating visualization and interactive exploration. Motivated by this advantage and the widespread use of surface data for studying neurological disorders, we present the eXplainable Surface Vision Transformer (X-SiT). This is the first inherently interpretable neural network that offers human-understandable predictions based on interpretable cortical features. As part of X-SiT, we introduce a prototypical surface patch decoder for classifying surface patch embeddings, incorporating case-based reasoning with spatially corresponding cortical prototypes. The results demonstrate state-of-the-art performance in detecting Alzheimer's disease and frontotemporal dementia while additionally providing informative prototypes that align with known disease patterns and reveal classification errors.

inproceedings BWR+25a

MICCAI 2025

28th International Conference on Medical Image Computing and Computer Assisted Intervention. Daejeon, Republic of Korea, Sep 23-27, 2025.

Authors

F. Bongratz • T. N. Wolf • J. G. Ramon • C. Wachinger

Links

DOI

Research Area

C1 | Medicine

BibTeXKey: BWR+25a

#p-wachinger