27.02.2025
MCML at WACV 2025
Eight Accepted Papers
IEEE/CVF Winter Conference on Applications of Computer Vision, Tucson, AZ, USA, Feb 28-Mar 04, 2025
We are happy to announce that MCML researchers have contributed a total of 8 papers to WACV 2025. Congrats to our researchers!
Main Track (8 papers)
R. Amoroso • G. Zhang • R. Koner • L. Baraldi • R. Cucchiara • V. Tresp
Perceive, Query & Reason: Enhancing Video QA with Question-Guided Temporal Queries.
WACV 2025 - IEEE/CVF Winter Conference on Applications of Computer Vision. Tucson, AZ, USA, Feb 28-Mar 04, 2025. DOI
Perceive, Query & Reason: Enhancing Video QA with Question-Guided Temporal Queries.
WACV 2025 - IEEE/CVF Winter Conference on Applications of Computer Vision. Tucson, AZ, USA, Feb 28-Mar 04, 2025. DOI
A. H. Berger • L. Lux • S. Shit • I. Ezhov • G. Kaissis • M. J. Menten • D. Rückert • J. C. Paetzold
Cross-Domain and Cross-Dimension Learning for Image-to-Graph Transformers.
WACV 2025 - IEEE/CVF Winter Conference on Applications of Computer Vision. Tucson, AZ, USA, Feb 28-Mar 04, 2025. DOI
Cross-Domain and Cross-Dimension Learning for Image-to-Graph Transformers.
WACV 2025 - IEEE/CVF Winter Conference on Applications of Computer Vision. Tucson, AZ, USA, Feb 28-Mar 04, 2025. DOI
S. Chen • Z. Han • B. He • J. Liu • M. Buckley • Y. Qin • P. Torr • V. Tresp • J. Gu
Can Multimodal Large Language Models Truly Perform Multimodal In-Context Learning?
WACV 2025 - IEEE/CVF Winter Conference on Applications of Computer Vision. Tucson, AZ, USA, Feb 28-Mar 04, 2025. DOI URL
Can Multimodal Large Language Models Truly Perform Multimodal In-Context Learning?
WACV 2025 - IEEE/CVF Winter Conference on Applications of Computer Vision. Tucson, AZ, USA, Feb 28-Mar 04, 2025. DOI URL
F. Fundel • J. Schusterbauer • V. T. Hu • B. Ommer
Distillation of Diffusion Features for Semantic Correspondence.
WACV 2025 - IEEE/CVF Winter Conference on Applications of Computer Vision. Tucson, AZ, USA, Feb 28-Mar 04, 2025. DOI
Distillation of Diffusion Features for Semantic Correspondence.
WACV 2025 - IEEE/CVF Winter Conference on Applications of Computer Vision. Tucson, AZ, USA, Feb 28-Mar 04, 2025. DOI
F. Hofherr • B. Haefner • D. Cremers
On Neural BRDFs: A Thorough Comparison of State-of-the-Art Approaches.
WACV 2025 - IEEE/CVF Winter Conference on Applications of Computer Vision. Tucson, AZ, USA, Feb 28-Mar 04, 2025. Oral Presentation. DOI
On Neural BRDFs: A Thorough Comparison of State-of-the-Art Approaches.
WACV 2025 - IEEE/CVF Winter Conference on Applications of Computer Vision. Tucson, AZ, USA, Feb 28-Mar 04, 2025. Oral Presentation. DOI
Y. Li • M. Ghahremani • Y. Wally • C. Wachinger
DiaMond: Dementia Diagnosis with Multi-Modal Vision Transformers Using MRI and PET.
WACV 2025 - IEEE/CVF Winter Conference on Applications of Computer Vision. Tucson, AZ, USA, Feb 28-Mar 04, 2025. DOI
DiaMond: Dementia Diagnosis with Multi-Modal Vision Transformers Using MRI and PET.
WACV 2025 - IEEE/CVF Winter Conference on Applications of Computer Vision. Tucson, AZ, USA, Feb 28-Mar 04, 2025. DOI
O. Wysocki • Y. Tan • T. Froech • Y. Xia • M. Wysocki • L. Hoegner • D. Cremers • C. Holst
ZAHA: Introducing the Level of Facade Generalization and the Large-Scale Point Cloud Facade Semantic Segmentation Benchmark Dataset.
WACV 2025 - IEEE/CVF Winter Conference on Applications of Computer Vision. Tucson, AZ, USA, Feb 28-Mar 04, 2025. DOI GitHub
ZAHA: Introducing the Level of Facade Generalization and the Large-Scale Point Cloud Facade Semantic Segmentation Benchmark Dataset.
WACV 2025 - IEEE/CVF Winter Conference on Applications of Computer Vision. Tucson, AZ, USA, Feb 28-Mar 04, 2025. DOI GitHub
Y. Zhang • H. Chen • A. Frikha • Y. Yang • D. Krompass • G. Zhang • J. Gu • V. Tresp
CL-Cross VQA: A Continual Learning Benchmark for Cross-Domain Visual Question Answering.
WACV 2025 - IEEE/CVF Winter Conference on Applications of Computer Vision. Tucson, AZ, USA, Feb 28-Mar 04, 2025. DOI
CL-Cross VQA: A Continual Learning Benchmark for Cross-Domain Visual Question Answering.
WACV 2025 - IEEE/CVF Winter Conference on Applications of Computer Vision. Tucson, AZ, USA, Feb 28-Mar 04, 2025. DOI
Related
28.04.2026
Björn Ommer: How AI Can Transform Society if We Use It Responsibly
MCML PI Björn Ommer explains the philosophy behind Stable Diffusion and why his team focuses on efficiency.
23.04.2026
Development of a New AI Foundation Model
Bavaria invests €50M in Nvidia GPUs for AI infrastructure with MCML PI Björn Ommer highlighting efficient and accessible AI development.
23.04.2026
When Vision AI Hallucinates Details
Why do vision-language models invent details? Our PI Zeynep Akata and her team present a fix for AI hallucinations at CVPR 2026.