Home  | Research | Groups | Zeynep Akata

Research Group Zeynep Akata


Link to website at TUM PI Matchmaking

Zeynep Akata

Prof. Dr.

Principal Investigator

Zeynep Akata

is a Liesel Beckmann Distinguished Professor of Computer Science at TUM and the director of the Institute for Explainable Machine Learning at Helmholtz Munich.

Zeynep Akata’s field of research is explainable machine learning. Her goal is to build transparent computer algorithms that can make comprehensible decisions. Her approach combines different methods of machine vision, machine learning and natural language processing. Her scientific vision is to create a self-explanatory artificial intelligence that can learn through minimal feedback and interact reliably with humans.

Team members @MCML

PostDocs

Link to website

Stephan Alaniz

Dr.

Link to website

Quentin Bouniot

Dr.

Link to website

Maria Alejandra Bravo Sarmiento

Link to website

Kirill Bykov

PhD Students

Link to website

Jessica Bader

Link to website

Massimo Bini

Link to website

Luca Eyring

Link to website

Leander Girrbach

Link to website

Yiran Huang

Link to website

Shyamgopal Karthik

Link to website

Jae Myung Kim

Link to website

Sanghwan Kim

Link to website

Mateusz Pach

Link to website

Razieh Rezaei

Link to website

Simon Roschmann

Link to website

Karsten Roth

Link to website

Rui Xiao

Recent News @MCML

Link to MCML at ICCV 2025

MCML at ICCV 2025

Link to MCML at ICML 2025

MCML at ICML 2025

Link to Zeynep Akata Receives 2025 ZukunftsWissen Prize

26.06.2025

Zeynep Akata Receives 2025 ZukunftsWissen Prize

Link to MCML at CVPR 2025

MCML at CVPR 2025

Link to MCML at ICLR 2025

MCML at ICLR 2025

Publications @MCML

2025


[42]
Y. Huang • L. Thede • M. Mancini • W. Xu • Z. Akata
Investigating Structural Pruning and Recovery Techniques for Compressing Multimodal Large Language Models: An Empirical Study.
GCPR 2025 - German Conference on Pattern Recognition. Freiburg, Germany, Oct 23-26, 2025. To be published. Preprint available. arXiv

[41]
S. N. Rai • S. KarthikM.-I. Georgescu • B. Caputo • C. Masone • Z. Akata
Road Obstacle Video Segmentation.
GCPR 2025 - German Conference on Pattern Recognition. Freiburg, Germany, Oct 23-26, 2025. To be published. Preprint available. arXiv

[40] A* Conference
J. BaderL. GirrbachS. AlanizZ. Akata
SUB: Benchmarking CBM Generalization via Synthetic Attribute Substitutions.
ICCV 2025 - IEEE/CVF International Conference on Computer Vision. Honolulu, Hawai’i, Oct 19-23, 2025. To be published. Preprint available. URL GitHub

[39] A* Conference
S. Karthik • H. Coskun • Z. Akata • S. Tulyakov • J. Ren • A. Kag
Scalable Ranked Preference Optimization for Text-to-Image Generation.
ICCV 2025 - IEEE/CVF International Conference on Computer Vision. Honolulu, Hawai’i, Oct 19-23, 2025. To be published. Preprint available. arXiv

[38]
L. GirrbachS. Alaniz • G. Smith • T. Darrell • Z. Akata
Person-Centric Annotations of LAION-400M: Auditing Bias and Its Transfer to Models.
Preprint (Oct. 2025). arXiv

[37]
S. KimR. XiaoS. Alaniz • Y. Xian • Z. Akata
Training-free Uncertainty Guidance for Complex Visual Tasks with MLLMs.
Preprint (Oct. 2025). arXiv

[36]
J. BaderM. Pach • M. A. Bravo • S. Belongie • Z. Akata
Stitch: Training-Free Position Control in Multimodal Diffusion Transformers.
Preprint (Sep. 2025). arXiv GitHub

[35]
L. Girrbach • C.-P. Su • T. Saanum • R. Socher • E. Schulz • Z. Akata
Reference-Free Rating of LLM Responses via Latent Information.
Preprint (Sep. 2025). arXiv

[34]
L. EyringS. Karthik • A. Dosovitskiy • N. Ruiz • Z. Akata
Noise Hypernetworks: Amortizing Test-Time Compute in Diffusion Models.
Preprint (Aug. 2025). arXiv GitHub

[33] A* Conference
L. Thede • K. Roth • M. Bethge • Z. Akata • T. Hartvigsen
WikiBigEdit: Understanding the Limits of Lifelong Knowledge Editing in LLMs.
ICML 2025 - 42nd International Conference on Machine Learning. Vancouver, Canada, Jul 13-19, 2025. URL

[32]
P. Spohn • L. GirrbachJ. BaderZ. Akata
Align-then-Unlearn: Embedding Alignment for LLM Unlearning.
MUGen @ICML 2025 - Workshop on Machine Unlearning for Generative AI at the 42nd International Conference on Machine Learning. Vancouver, Canada, Jul 13-19, 2025. URL

[31] A* Conference
Q. Bouniot • I. Redko • A. Mallasto • C. Laclau • O. Struckmeier • K. Arndt • M. Heinonen • V. Kyrki • S. Kaski
From Alexnet to Transformers: Measuring the Non-linearity of Deep Neural Networks with Affine Optimal Transport.
CVPR 2025 - IEEE/CVF Conference on Computer Vision and Pattern Recognition. Nashville, TN, USA, Jun 11-15, 2025. DOI

[30] A* Conference
S. Dziadzio • V. Udandarao • K. Roth • A. Prabhu • Z. Akata • S. Albanie • M. Bethge
How to Merge Your Multimodal Models Over Time?
CVPR 2025 - IEEE/CVF Conference on Computer Vision and Pattern Recognition. Nashville, TN, USA, Jun 11-15, 2025. DOI

[29] A* Conference
S. KimR. XiaoM.-I. GeorgescuS. AlanizZ. Akata
COSMOS: Cross-Modality Self-Distillation for Vision Language Pre-training.
CVPR 2025 - IEEE/CVF Conference on Computer Vision and Pattern Recognition. Nashville, TN, USA, Jun 11-15, 2025. DOI

[28] A* Conference
K. RothZ. Akata • D. Damen • I. Balažević • O. J. Hénaff
Context-Aware Multimodal Pretraining.
CVPR 2025 - IEEE/CVF Conference on Computer Vision and Pattern Recognition. Nashville, TN, USA, Jun 11-15, 2025. DOI

[27] A* Conference
R. XiaoS. KimM.-I. GeorgescuZ. AkataS. Alaniz
FLAIR: VLM with Fine-grained Language-informed Image Representations.
CVPR 2025 - IEEE/CVF Conference on Computer Vision and Pattern Recognition. Nashville, TN, USA, Jun 11-15, 2025. DOI GitHub

[26]
S. RoschmannQ. Bouniot • V. Feofanov • I. Redko • Z. Akata
Time Series Representations for Classification Lie Hidden in Pretrained Vision Transformers.
Preprint (Jun. 2025). arXiv

[25]
R. Skorobogat • K. RothM.-I. GeorgescuZ. Akata
Subspace-Boosted Model Merging.
Preprint (Jun. 2025). arXiv

[24]
J. Kim • S. Alaniz • C. Schmid • Z. Akata
LoFT: LoRA-fused Training Dataset Generation with Few-shot Guidance.
Preprint (May. 2025). arXiv GitHub

[23] A* Conference
M. BiniL. GirrbachZ. Akata
Decoupling Angles and Strength in Low-rank Adaptation.
ICLR 2025 - 13th International Conference on Learning Representations. Singapore, Apr 24-28, 2025. URL GitHub

[22] A* Conference
Q. Bouniot • P. Mozharovskyi • F. d'Alché-Buc
Tailoring Mixup to Data for Calibration.
ICLR 2025 - 13th International Conference on Learning Representations. Singapore, Apr 24-28, 2025. URL

[21] A* Conference
L. GirrbachY. HuangS. Alaniz • T. Darrell • Z. Akata
Revealing and Reducing Gender Biases in Vision and Language Assistants (VLAs).
ICLR 2025 - 13th International Conference on Learning Representations. Singapore, Apr 24-28, 2025. URL

[20] A* Conference
T. Uscidda • L. EyringK. RothF. J. TheisZ. Akata • M. Cuturi
Disentangled Representation Learning with the Gromov-Monge Gap.
ICLR 2025 - 13th International Conference on Learning Representations. Singapore, Apr 24-28, 2025. URL

[19]
S. Dziadzio • V. Udandarao • K. Roth • A. Prabhu • Z. Akata • S. Albanie • M. Bethge
How to Merge Multimodal Models Over Time?
MCDC @ICLR 2025 - Workshop on Modularity for Collaborative, Decentralized, and Continual Deep Learning at the 13th International Conference on Learning Representations. Singapore, Apr 24-28, 2025. URL

[18]
M. PachS. KarthikQ. Bouniot • S. Belongie • Z. Akata
Sparse Autoencoders Learn Monosemantic Features in Vision-Language Models.
Preprint (Apr. 2025). arXiv

[17]
L. GirrbachS. Alaniz • G. Smith • Z. Akata
A Large Scale Analysis of Gender Biases in Text-to-Image Generative Models.
Preprint (Mar. 2025). arXiv

[16]
S. Wu • S. Alaniz • E. Schulz • Z. Akata
Discovering Chunks in Neural Embeddings for Interpretability.
Preprint (Feb. 2025). arXiv

[15] Top Journal
M. Binz • S. Alaniz • A. Roskies • B.  • C. T. Bergstrom • C. Allen • D. Schad • D. Wulff • J. D.  • Q. Zhang • R. M. Shiffrin • S. J. Gershman • V. Popov • E. M. Bender • M. Marelli • M. M. Botvinick • Z. Akata • E. Schulz
How should the advancement of large language models affect the practice of science?
Proceedings of the National Academy of Sciences 122.5. Jan. 2025. DOI

2024


[14] A* Conference
L. EyringS. KarthikK. Roth • A. Dosovitskiy • Z. Akata
ReNO: Enhancing One-step Text-to-Image Models through Reward-based Noise Optimization.
NeurIPS 2024 - 38th Conference on Neural Information Processing Systems. Vancouver, Canada, Dec 10-15, 2024. URL GitHub

[13] A* Conference
V. Udandarao • K. Roth • S. Dziadzio • A. Prabhu • M. Cherti • O. Vinyals • O. Hénaff • S. Albanie • Z. Akata • M. Bethge
A Practitioner's Guide to Real-World Continual Multimodal Pretraining.
NeurIPS 2024 - 38th Conference on Neural Information Processing Systems. Vancouver, Canada, Dec 10-15, 2024. URL GitHub

[12]
A. HöhlI. Obadic • M.-Á. Fernández-Torres • H. Najjar • D. Oliveira • Z. Akata • A. Dengel • X. Zhu
Opening the Black Box: A systematic review on explainable artificial intelligence in remote sensing.
IEEE Geoscience and Remote Sensing Magazine 12.4. Dec. 2024. DOI

[11]
A. Baumann • R. Li • M. Klasson • S. Mentu • S. KarthikZ. Akata • A. Solin • M. Trapp
Post-hoc Probabilistic Vision-Language Models.
Preprint (Dec. 2024). arXiv

[10]
Y. YeganehR. Xiao • G. Guvercin • N. NavabA. Farshad
Conformable Convolution for Topologically Aware Learning of Complex Anatomical Structures.
Preprint (Dec. 2024). arXiv

[9] A* Conference
A. Christensen • N. Mojab • K. Patel • K. Ahuja • Z. Akata • O. Winther • O. Gonzalez-Franco • A. Colaco
Geometry Fidelity for Spherical Images.
ECCV 2024 - 18th European Conference on Computer Vision. Milano, Italy, Sep 29-Oct 04, 2024. DOI

[8] A* Conference
T. Hummel • S. KarthikM.-I. GeorgescuZ. Akata
EgoCVR: An Egocentric Benchmark for Fine-Grained Composed Video Retrieval.
ECCV 2024 - 18th European Conference on Computer Vision. Milano, Italy, Sep 29-Oct 04, 2024. DOI GitHub

[7] A* Conference
J. M. KimJ. BaderS. Alaniz • C. Schmid • Z. Akata
DataDream: Few-shot Guided Dataset Generation.
ECCV 2024 - 18th European Conference on Computer Vision. Milano, Italy, Sep 29-Oct 04, 2024. DOI GitHub

[6]
L. Thede • K. Roth • O. J. Hénaff • M. Bethge • Z. Akata
Reflecting on the State of Rehearsal-free Continual Learning with Pretrained Models.
CoLLAs 2024 - 3rd Conference on Lifelong Learning Agents. Pisa, Italy, Aug 11-14, 2024. URL

[5]
M. Dani • M. J. Prakash • Z. Akata • S. Liebe
SemioLLM: Assessing Large Language Models for Semiological Analysis in Epilepsy Research.
AI4Science @ICML 2024 - AI for Science Workshop at the 41st International Conference on Machine Learning. Vienna, Austria, Jul 21-27, 2024. URL

[4] A* Conference
M. Bini • K. Roth • Z. Akata • A. Khoreva
ETHER: Efficient Finetuning of Large-Scale Models with Hyperplane Reflections.
ICML 2024 - 41st International Conference on Machine Learning. Vienna, Austria, Jul 21-27, 2024. URL GitHub

[3]
T. Uscidda • L. EyringK. RothF. J. TheisZ. Akata • M. Cuturi
Disentangled Representation Learning through Geometry Preservation with the Gromov-Monge Gap.
SPIGM @ICML 2024 - Workshop on Structured Probabilistic Inference & Generative Modeling at the 41st International Conference on Machine Learning. Vienna, Austria, Jul 21-27, 2024. arXiv

2023


[2]
Y. YeganehA. Farshad • G. Guevercin • A. Abu-zer • R. Xiao • Y. Tang • E. Adeli • N. Navab
SCOPE: Structural Continuity Preservation for Medical Image Segmentation.
GRAIL @MICCAI 2023 - 5th Workshop on GRaphs in biomedicAl Image anaLysis at the 26th International Conference on Medical Image Computing and Computer Assisted Intervention. Vancouver, Canada, Oct 08-12, 2023. DOI

[1]
Y. Yeganeh • G. Güvercin • R. Xiao • A. Abuzer • E. Adeli • A. FarshadN. Navab
SCOPE: Structural Continuity Preservation for Retinal Vessel Segmentation.
GRAIL @MICCAI 2023 - 5th Workshop on GRaphs in biomedicAl Image anaLysis at the 26th International Conference on Medical Image Computing and Computer Assisted Intervention. Vancouver, Canada, Oct 08-12, 2023. DOI