Home | Publications | LXH+25

GCE-Pose: Global Context Enhancement for Category-Level Object Pose Estimation

MCML Authors

Weihang Li

→ Group Benjamin Busam
Photogrammetry and Remote Sensing

Junwen Huang

Dr.

→ Group Benjamin Busam
Photogrammetry and Remote Sensing

Hyunjun Jung

Dr.

→ Group Benjamin Busam
Photogrammetry and Remote Sensing

Nassir Navab

Prof. Dr.

Core PI

Computer Aided Medical Procedures & Augmented Reality

Benjamin Busam

Prof. Dr.

Core PI

Photogrammetry and Remote Sensing

Abstract

A key challenge in model-free category-level pose estimation is the extraction of contextual object features that generalize across varying instances within a specific category. Recent approaches leverage foundational features to capture semantic and geometry cues from data. However, these approaches fail under partial visibility. We overcome this with a first-complete-then-aggregate strategy for feature extraction utilizing class priors. In this paper, we present GCE-Pose, a method that enhances pose estimation for novel instances by integrating category-level global context prior. GCE-Pose performs semantic shape reconstruction with a proposed Semantic Shape Reconstruction (SSR) module. Given an unseen partial RGB-D object instance, our SSR module reconstructs the instance's global geometry and semantics by deforming category-specific 3D semantic prototypes through a learned deep Linear Shape Model. We further introduce a Global Context Enhanced (GCE) feature fusion module that effectively fuses features from partial RGB-D observations and the reconstructed global context. Extensive experiments validate the impact of our global context prior and the effectiveness of the GCE fusion module, demonstrating that GCE-Pose significantly outperforms existing methods on challenging real-world datasets HouseCat6D and NOCS-REAL275.

inproceedings LXH+25

CVPR 2025

IEEE/CVF Conference on Computer Vision and Pattern Recognition. Nashville, TN, USA, Jun 11-15, 2025.

Authors

W. Li • H. Xu • J. Huang • H. Jung • P. K. Yu • N. Navab • B. Busam

Links

DOI GitHub

In Collaboration

XYZ Robotics

Research Areas

B1 | Computer Vision

C1 | Medicine

BibTeXKey: LXH+25

#p-busam #p-navab