Home  | News

10.06.2025

Tiny logo
Teaser image to MCML at CVPR 2025

35 Accepted Papers (29 Main, and 6 Workshops)

IEEE/CVF Conference on Computer Vision and Pattern Recognition, Nashville, TN, USA, Jun 11-15, 2025

We are happy to announce that MCML researchers have contributed a total of 35 papers to CVPR 2025: 29 Main, and 6 Workshop papers. Congrats to our researchers!

Main Track (29 papers)

S. A. Baumann • F. Krause • M. Neumayr • N. Stracke • M. Sevi • V. T. HuB. Ommer
Continuous, Subject-Specific Attribute Control in T2I Models by Identifying Semantic Directions.
CVPR 2025 - IEEE/CVF Conference on Computer Vision and Pattern Recognition. Nashville, TN, USA, Jun 11-15, 2025. DOI GitHub

Q. Bouniot • I. Redko • A. Mallasto • C. Laclau • O. Struckmeier • K. Arndt • M. Heinonen • V. Kyrki • S. Kaski
From Alexnet to Transformers: Measuring the Non-linearity of Deep Neural Networks with Affine Optimal Transport.
CVPR 2025 - IEEE/CVF Conference on Computer Vision and Pattern Recognition. Nashville, TN, USA, Jun 11-15, 2025. DOI

H. Chen • H. Li • Y. ZhangG. ZhangJ. Bi • P. Torr • J. Gu • D. Krompass • V. Tresp
FedBiP: Heterogeneous One-Shot Federated Learning with Personalized Latent Diffusion Models.
CVPR 2025 - IEEE/CVF Conference on Computer Vision and Pattern Recognition. Nashville, TN, USA, Jun 11-15, 2025. DOI

C. CurreliD. MuhleA. Saroha • Z. Ye • R. MarinD. Cremers
Nonisotropic Gaussian Diffusion for Realistic 3D Human Motion Prediction.
CVPR 2025 - IEEE/CVF Conference on Computer Vision and Pattern Recognition. Nashville, TN, USA, Jun 11-15, 2025. DOI GitHub

Z. Chen • Y. Wang • L. Nan • X. Zhu
Parametric Point Cloud Completion for Polygonal Surface Reconstruction.
CVPR 2025 - IEEE/CVF Conference on Computer Vision and Pattern Recognition. Nashville, TN, USA, Jun 11-15, 2025. DOI GitHub

S. Dziadzio • V. Udandarao • K. Roth • A. Prabhu • Z. Akata • S. Albanie • M. Bethge
How to Merge Your Multimodal Models Over Time?
CVPR 2025 - IEEE/CVF Conference on Computer Vision and Pattern Recognition. Nashville, TN, USA, Jun 11-15, 2025. DOI

T. DagèsS. WeberY.-W. E. Lin • R. Talmon • D. Cremers • M. Lindenbaum • A. M. Bruckstein • R. Kimmel
Finsler Multi-Dimensional Scaling: Manifold Learning for Asymmetric Dimensionality Reduction and Embedding.
CVPR 2025 - IEEE/CVF Conference on Computer Vision and Pattern Recognition. Nashville, TN, USA, Jun 11-15, 2025. DOI

T. Hannan • M. M. Islam • J. Gu • T. Seidl • G. Bertasius
ReVisionLLM: Recursive Vision-Language Model for Temporal Grounding in Hour-Long Videos.
CVPR 2025 - IEEE/CVF Conference on Computer Vision and Pattern Recognition. Nashville, TN, USA, Jun 11-15, 2025. DOI GitHub

O. Hahn • C. ReichN. AraslanovD. Cremers • C. Rupprecht • S. Roth
Scene-Centric Unsupervised Panoptic Segmentation.
CVPR 2025 - IEEE/CVF Conference on Computer Vision and Pattern Recognition. Nashville, TN, USA, Jun 11-15, 2025. DOI GitHub

S. KimR. XiaoM.-I. GeorgescuS. AlanizZ. Akata
COSMOS: Cross-Modality Self-Distillation for Vision Language Pre-training.
CVPR 2025 - IEEE/CVF Conference on Computer Vision and Pattern Recognition. Nashville, TN, USA, Jun 11-15, 2025. DOI

T. Liu • Z. Lai • J. Wang • G. ZhangS. Chen • P. Torr • V. Demberg • V. Tresp • J. Gu
Multimodal Pragmatic Jailbreak on Text-to-image Models.
CVPR 2025 - 2nd Workshop on Responsible Generative AI at IEEE/CVF Conference on Computer Vision and Pattern Recognition. Nashville, TN, USA, Jun 11-15, 2025. Best Paper Award. URL GitHub

W. Li • H. Xu • J. HuangH. Jung • P. K. Yu • N. NavabB. Busam
GCE-Pose: Global Context Enhancement for Category-level Object Pose Estimation.
CVPR 2025 - IEEE/CVF Conference on Computer Vision and Pattern Recognition. Nashville, TN, USA, Jun 11-15, 2025. DOI GitHub

D. Mildenberger • P. Hager • D. RückertM. J. Menten
A Tale of Two Classes: Adapting Supervised Contrastive Learning to Binary Imbalanced Datasets.
CVPR 2025 - IEEE/CVF Conference on Computer Vision and Pattern Recognition. Nashville, TN, USA, Jun 11-15, 2025. DOI

E. ÖzsoyC. Pellegrini • T. Czempiel • F. TristramK. YuanD. Bani-Harouni • U. Eck • B. BusamM. KeicherN. Navab
MM-OR: A Large Multimodal Operating Room Dataset for Semantic Understanding of High-Intensity Surgical Environments.
CVPR 2025 - IEEE/CVF Conference on Computer Vision and Pattern Recognition. Nashville, TN, USA, Jun 11-15, 2025. DOI GitHub

R. Qorbani • G. Villani • T. Panagiotakopoulos • M. B. Colomer • L. Härenstam-Nielsen • M. Segu • P. L. Dovesi • J. Karlgren • D. Cremers • F. Tombari • M. Poggi
Semantic Library Adaptation: LoRA Retrieval and Fusion for Open-Vocabulary Semantic Segmentation.
CVPR 2025 - IEEE/CVF Conference on Computer Vision and Pattern Recognition. Nashville, TN, USA, Jun 11-15, 2025. DOI

K. RothZ. Akata • D. Damen • I. Balažević • O. J. Hénaff
Context-Aware Multimodal Pretraining.
CVPR 2025 - IEEE/CVF Conference on Computer Vision and Pattern Recognition. Nashville, TN, USA, Jun 11-15, 2025. DOI

P. Roetzer • V. EhmD. Cremers • Z. Lähner • F. Bernard
Higher-Order Ratio Cycles for Fast and Globally Optimal Shape Matching.
CVPR 2025 - IEEE/CVF Conference on Computer Vision and Pattern Recognition. Nashville, TN, USA, Jun 11-15, 2025. DOI

D. SchnausN. AraslanovD. Cremers
It's a (Blind) Match! Towards Vision-Language Correspondence without Parallel Data.
CVPR 2025 - IEEE/CVF Conference on Computer Vision and Pattern Recognition. Nashville, TN, USA, Jun 11-15, 2025. DOI

N. Stracke • S. A. Baumann • K. Bauer • F. Fundel • B. Ommer
CleanDIFT: Diffusion Features without Noise.
CVPR 2025 - IEEE/CVF Conference on Computer Vision and Pattern Recognition. Nashville, TN, USA, Jun 11-15, 2025. DOI

L. Sang • Z. Canfes • D. Cao • R. Marin • F. Bernard • D. Cremers
4Deform: Neural Surface Deformation for Robust Shape Interpolation.
CVPR 2025 - IEEE/CVF Conference on Computer Vision and Pattern Recognition. Nashville, TN, USA, Jun 11-15, 2025. DOI

J. Schusterbauer • M. Gui • F. Fundel • B. Ommer
Diff2Flow: Training Flow Matching Models via Diffusion Model Alignment.
CVPR 2025 - IEEE/CVF Conference on Computer Vision and Pattern Recognition. Nashville, TN, USA, Jun 11-15, 2025. DOI

D. SinitsynL. Härenstam-NielsenD. Cremers
PRaDA: Projective Radial Distortion Averaging.
CVPR 2025 - IEEE/CVF Conference on Computer Vision and Pattern Recognition. Nashville, TN, USA, Jun 11-15, 2025. DOI

F. WimbauerW. ChenD. Muhle • C. Rupprecht • D. Cremers
AnyCam: Learning to Recover Camera Poses and Intrinsics from Casual Videos.
CVPR 2025 - IEEE/CVF Conference on Computer Vision and Pattern Recognition. Nashville, TN, USA, Jun 11-15, 2025. DOI

Y. Xie • V. Ehm • P. Roetzer • N. Amrani • M. Gao • F. Bernard • D. Cremers
EchoMatch: Partial-to-Partial Shape Matching via Correspondence Reflection.
CVPR 2025 - IEEE/CVF Conference on Computer Vision and Pattern Recognition. Nashville, TN, USA, Jun 11-15, 2025. DOI

R. XiaoS. KimM.-I. GeorgescuZ. AkataS. Alaniz
FLAIR: VLM with Fine-grained Language-informed Image Representations.
CVPR 2025 - IEEE/CVF Conference on Computer Vision and Pattern Recognition. Nashville, TN, USA, Jun 11-15, 2025. DOI GitHub

Y. YeganehA. Farshad • I. Charisiadis • M. Hasny • M. Hartenberger • B. OmmerN. Navab • E. Adeli
Latent Drifting in Diffusion Models for Counterfactual Medical Image Synthesis.
CVPR 2025 - IEEE/CVF Conference on Computer Vision and Pattern Recognition. Nashville, TN, USA, Jun 11-15, 2025. Highlight Paper. DOI

Y. Yuan • Y. XiaD. Cremers • M. Sester
SparseAlign: a Fully Sparse Framework for Cooperative Object Detection.
CVPR 2025 - IEEE/CVF Conference on Computer Vision and Pattern Recognition. Nashville, TN, USA, Jun 11-15, 2025. DOI

D. Zhu • Y. Di • S. Gavranovic • S. Ilic
SeaLion: Semantic Part-Aware Latent Point Diffusion Models for 3D Generation.
CVPR 2025 - IEEE/CVF Conference on Computer Vision and Pattern Recognition. Nashville, TN, USA, Jun 11-15, 2025. DOI

G. Zhang • M. L. A. Fok • J. Ma • Y. XiaD. Cremers • P. Torr • V. Tresp • J. Gu
Localizing Events in Videos with Multimodal Queries.
CVPR 2025 - IEEE/CVF Conference on Computer Vision and Pattern Recognition. Nashville, TN, USA, Jun 11-15, 2025. DOI

Workshops (6 papers)

L. BastianM. RashedN. Navab • T. Birdal
Continuous-Time SO(3) Forecasting with Savitzky--Golay Neural Controlled Differential Equations.
4DVision @CVPR 2025 - Workshop on 4D Vision: Modeling the Dynamic World at IEEE/CVF Conference on Computer Vision and Pattern Recognition. Nashville, TN, USA, Jun 11-15, 2025. arXiv

Y. Luo • R. Hoffmann • Y. Xia • O. Wysocki • B. Schwab • T. H. Kolbe • D. Cremers
RADLER: Radar Object Detection Leveraging Semantic 3D City Models and Self-Supervised Radar-Image Learning.
PBVS @CVPR 2025 - 21st IEEE Workshop on Perception Beyond the Visible Spectrum at IEEE/CVF Conference on Computer Vision and Pattern Recognition. Nashville, TN, USA, Jun 11-15, 2025. DOI GitHub

E. ÖzsoyF. HolmC. Pellegrini • T. Czempiel • M. Saleh • N. NavabB. Busam
Location-Free Scene Graph Generation.
MULA @CVPR 2025 - 8th Multimodal Learning and Applications Workshop at IEEE/CVF Conference on Computer Vision and Pattern Recognition. Nashville, TN, USA, Jun 11-15, 2025. DOI

W. Tang • W. Li • X. Liang • O. Wysocki • F. Biljecki • C. Holst • B. Jutzi
Texture2LoD3: Enabling LoD3 Building Reconstruction With Panoramic Images.
USM3D @CVPR 2025 - 2nd Workshop on Urban Scene Modeling at IEEE/CVF Conference on Computer Vision and Pattern Recognition. Nashville, TN, USA, Jun 11-15, 2025. DOI GitHub

L. Waldmann • A. Shah • Y. Wang • N. LehmannA. J. StewartZ. XiongX. ZhuS. Bauer • J. Chuang
Panopticon: Advancing Any-Sensor Foundation Models for Earth Observation.
EARTHVISION @CVPR 2025 - Workshop EarthVision: Large Scale Computer Vision for Remote Sensing Imagery at IEEE/CVF Conference on Computer Vision and Pattern Recognition. Nashville, TN, USA, Jun 11-15, 2025. DOI

D. Zverev • T. Wiedemer • A. Prabhu • M. Bethge • W. Brendel • A. S. Koepke
VGGSounder: Audio-Visual Evaluations for Foundation Models.
Sight and Sound @CVPR 2025 - Workshop Sight and Sound at IEEE/CVF Conference on Computer Vision and Pattern Recognition. Nashville, TN, USA, Jun 11-15, 2025. PDF

#research #top-tier-work #akata #bauer-s #busam #cremers #jegelka #koepke #marin-riccardo #menten #navab #ommer #rueckert #seidl #thuerey #tresp #wang-xi #zhu
Subscribe to RSS News feed

Related

Link to Fabian Theis Featured in Handelsblatt on the Future of AI in Precision Medicine

16.12.2025

Fabian Theis Featured in Handelsblatt on the Future of AI in Precision Medicine

MCML PI Fabian Theis discusses AI-driven precision medicine and its growing impact on individualized healthcare and biomedical research.

Link to Gitta Kutyniok Featured in VDI Nachrichten on AI Ethics

16.12.2025

Gitta Kutyniok Featured in VDI Nachrichten on AI Ethics

Gitta Kutyniok discusses measurable criteria for ethical AI, promoting safe and responsible autonomous decision-making.

Link to Hinrich Schütze Featured in WirtschaftsWoche on Innovative AI Approaches

16.12.2025

Hinrich Schütze Featured in WirtschaftsWoche on Innovative AI Approaches

Hinrich Schütze discusses Giotto.ai’s efficient AI models, highlighting memory separation and context-aware decoding to improve robustness.

Link to Xiaoxiang Zhu Featured in Focus Online on Global 3D Building Atlas

16.12.2025

Xiaoxiang Zhu Featured in Focus Online on Global 3D Building Atlas

Xiaoxiang Zhu maps 2.75B buildings in 3D, revealing global urbanization, housing, and social inequalities using AI.

Link to From Sitting Dog to Standing: A New Way to Morph 3D Shapes

11.12.2025

From Sitting Dog to Standing: A New Way to Morph 3D Shapes

ICLR 2025 work by Lu Sang and Daniel Cremers in collaboration with U Bonn enables smooth, physics-aware 3D shape deformation from point clouds.

Back to Top