Home | Research | Groups | Xi Wang

Research Group Xi Wang

Xi Wang

Dr.

JRG Leader Egocentric Vision

Computer Vision & Artificial Intelligence

Xi Wang

leads the MCML Junior Research Group ‘Egocentric Vision’ at TU Munich.

Xi Wang and her team conduct cutting-edge research in egocentric vision, focusing on learning from first-person human videos to understand behavior patterns and extract valuable information for potential applications in robotics. Their ongoing projects include 3D reconstruction using Gaussian splitting and multimodal learning with vision-language models. Funded as a BMBF project, the group maintains close ties with MCML and actively seeks collaborations that bridge egocentric vision with other research domains, extending beyond our own focus.

Team members @MCML

PhD Students

Abhishek Saroha

→ Group Xi Wang
Computer Vision & Artificial Intelligence

Dominik Schnaus

→ Group Xi Wang
Computer Vision & Artificial Intelligence

Ganlin Zhang

→ Group Xi Wang
Computer Vision & Artificial Intelligence

Publications @MCML

2026

[22]

C. Curreli • F. Hofherr • D. Muhle • A. Saroha • R. Marin • D. Cremers
EquiFusion: Kinematics-Agnostic Human Motion Prediction via Equivariant Latent Diffusion.
ECCV 2026 - 19th European Conference on Computer Vision. Malmö, Sweden, Sep 08-12, 2026. To be published. Preprint available. arXiv URL

[21]

C. Koke • Y. Shen • A. Saroha • M. Eisenberger • B. Rieck • M. M. Bronstein • D. Cremers
Graph Neural Networks Are Not Continuous Across Graph Resolutions.
ICML 2026 - 43rd International Conference on Machine Learning. Seoul, South Korea, Jul 06-11, 2026. To be published. Preprint available. URL GitHub

[20]

F. Dröge • C. Curreli • A. Saroha • D. Cremers
Benchmarking Single-Step Inpainting Methods for Multi-Object 3D Gaussian Splatting Scenes.
CVEU @CVPR 2026 - Workshop on AI for Creative Visual Content Generation Editing and Understanding at the IEEE/CVF Conference on Computer Vision and Pattern Recognition. Denver, CO, USA, Jun 03-07, 2026. arXiv URL

[19]

A. Saroha • H. Zeng • X. Zuo • D. Cremers • X. Wang
EgoFlow: Gradient-Guided Flow Matching for Egocentric 6DoF Object Motion Generation.
CVPR 2026 - IEEE/CVF Conference on Computer Vision and Pattern Recognition. Denver, CO, USA, Jun 03-07, 2026. To be published. Preprint available. arXiv GitHub

[18]

G. Zhang • W. Chen • D. Cremers • X. Wang
BA-T: An Iterative Transformer for Two-View Bundle Adjustment.
Preprint (Jun. 2026). arXiv

[17]

W. Chen • C. Zheng • G. Zhang • A. Vedaldi • D. Cremers
NOVA3R: Non-pixel-aligned Visual Transformer for Amodal 3D Reconstruction.
ICLR 2026 - 14th International Conference on Learning Representations. Rio de Janeiro, Brazil, Apr 23-27, 2026. To be published. Preprint available. URL

[16]

H. Zeng • A. Saroha • D. Cremers • X. Wang
GMT: Goal-Conditioned Multimodal Transformer for 6-DOF Object Trajectory Synthesis in 3D Scenes.
3DV 2026 - 13th International Conference on 3D Vision. Vancouver, Canada, Mar 20-23, 2026. To be published. Preprint available. URL

[15]

Y. Wang • Y. Miao • W. Zhao • W. Yang • Z. Wang • J. Pajarinen • L. Van Gool • D. P. Paudel • J. Kannala • X. Wang • A. Solin
PAWS: Perception of Articulation in the Wild at Scale from Egocentric Videos.
Preprint (Mar. 2026). arXiv GitHub

[14]

S. Qian • G. Zhang • S. Wu • D. Cremers
Flow4R: Unifying 4D Reconstruction and Tracking with Scene Flow.
Preprint (Feb. 2026). arXiv

2025

[13]

O. Kuzyk • Z. Li • M. Pollefeys • X. Wang
VisualChef: Generating Visual Aids in Cooking via Mask Inpainting.
GCPR 2025 - German Conference on Pattern Recognition. Freiburg, Germany, Oct 23-26, 2025. DOI

[12]

W. Chen • G. Zhang • F. Wimbauer • R. Wang • N. Araslanov • A. Vedaldi • D. Cremers
Back on Track: Bundle Adjustment for Dynamic Scene Reconstruction.
ICCV 2025 - IEEE/CVF International Conference on Computer Vision. Honolulu, Hawai’i, Oct 19-23, 2025. DOI

[11]

G. Zhang • S. Qian • X. Wang • D. Cremers
ViSTA-SLAM: Visual SLAM with Symmetric Two-view Association.
Preprint (Sep. 2025). arXiv GitHub

[10]

D. Schnaus • N. Araslanov • D. Cremers
It's a (Blind) Match! Towards Vision-Language Correspondence without Parallel Data.
CVPR 2025 - IEEE/CVF Conference on Computer Vision and Pattern Recognition. Nashville, TN, USA, Jun 11-15, 2025. DOI GitHub

[9]

C. Koke • Y. Shen • A. Saroha • M. Eisenberger • B. Rieck • M. M. Bronstein • D. Cremers
Graph Networks struggle with variable Scale.
ICBINB @ICLR 2025 - Workshop I Can’t Believe It’s Not Better: Challenges in Applied Deep Learning at the 13th International Conference on Learning Representations. Singapore, Apr 24-28, 2025. URL

[8]

C. Koke • D. Schnaus • Y. Shen • A. Saroha • M. Eisenberger • B. Rieck • M. M. Bronstein • D. Cremers
On multi-scale Graph Representation Learning.
LMRL @ICLR 2025 - Workshop on Learning Meaningful Representations of Life at the 13th International Conference on Learning Representations. Singapore, Apr 24-28, 2025. URL

[7]

C. Koke • Y. Shen • A. Saroha • M. Eisenberger • B. Rieck • M. M. Bronstein • D. Cremers
On Incorporating Scale into Graph Networks.
MLMP @ICLR 2025 - Workshop on Machine Learning Multiscale Processes at the 13th International Conference on Learning Representations. Singapore, Apr 24-28, 2025. Best Paper Award. URL

[6]

N. P. A. Vu • A. Saroha • O. Litany • D. Cremers
GAS-NeRF: Geometry-Aware Stylization of Dynamic Radiance Fields.
Preprint (Mar. 2025). arXiv

[5]

A. Saroha • F. Hofherr • M. Gladkova • C. Curreli • O. Litany • D. Cremers
ZDySS -- Zero-Shot Dynamic Scene Stylization using Gaussian Splatting.
Preprint (Jan. 2025). arXiv

2024

[4]

L. Sang • A. Saroha • M. Gao • D. Cremers
Enhancing Surface Neural Implicits with Curvature-Guided Sampling and Uncertainty-Augmented Representations.
GCPR 2024 - German Conference on Pattern Recognition. Munich, Germany, Oct 10-13, 2024. DOI

[3]

A. Saroha • M. Gladkova • C. Curreli • D. Muhle • T. Yenamandra • D. Cremers
Gaussian Splatting in Style.
GCPR 2024 - German Conference on Pattern Recognition. Munich, Germany, Oct 10-13, 2024. DOI

[2]

L. Härenstam-Nielsen • L. Sang • A. Saroha • N. Araslanov • D. Cremers
DiffCD: A Symmetric Differentiable Chamfer Distance for Neural Implicit Surface Fitting.
ECCV 2024 - 18th European Conference on Computer Vision. Milano, Italy, Sep 29-Oct 04, 2024. DOI GitHub

[1]

L. Sang • M. Gao • A. Saroha • D. Cremers
Enhancing Surface Neural Implicits with Curvature-Guided Sampling and Uncertainty-Augmented Representations.
Wild3D @ECCV 2024 - Workshop 3D Modeling, Reconstruction, and Generation in the Wild at the 18th European Conference on Computer Vision. Milano, Italy, Sep 29-Oct 04, 2024. URL

2025-03-14 - Last modified: 2026-07-03

Research Group Xi Wang

Xi Wang

Team members @MCML

PhD Students

Recent News @MCML

MCML at ICML 2026

MCML at CVPR 2026

MCML at ICLR 2026

MCML at ICCV 2025

Publications @MCML

2026

2025

2024