25

Jan

Teaser image to Advances in Dense Video Captioning, Vision-Guided Navigation and Robot Manipulation

Munich AI Lectures

Advances in Dense Video Captioning, Vision-Guided Navigation and Robot Manipulation

Cordelia Schmid, Inria Institute / Google Research

   25.01.2024

   5:00 pm - 7:00 pm

   TUM Campus Munich, Room 0790, Arcisstraße 21, 80333 München

On behalf of our partners at the Bavarian AI network baiosphere, the MCML cordially invites you to the Munich AI Lectures.

Cordelia Schmid is a pioneer in AI research. She invented procedures in the field of image recognition that enabled computers to semantically interpret image and video content. Her computer vision algorithms are key for the development of robotic assistants that can, in the future, recognize their surroundings and respond to spoken commands. Her work has been honored with important awards, including the Körber Prize, endowed with one million euros, her most recent achievement.

In this talk, she presents Vid2Seq, a model for dense video captioning that predicts temporal boundaries and textual descriptions from video and speech, and a retrieval-augmented visual language model that achieves state-of-the-art results in video question answering and image captioning. She also introduces the History Aware Multimodal Transformer (HAMT) for vision-guided navigation and robot manipulation, demonstrating its superior performance on benchmarks and in real-world applications with the Tiago robot and UR5 arm.

Organized by:

baiosphere

Bavarian Academy of Science and Humanities

Helmholtz Munich

LMU Munich

TUM

AI-HUB LMU

ELLIS Munich Unit

Konrad Zuse School of Excellence in Reliable AI

MCML

Munich Data Science Institute TUM

Munich Institute of Robotics and Machine Intelligence TUM


Related

Link to Additive Density-on-Scalar Regression in Bayes Hilbert Spaces with an Application to Gender Economics

Colloquium  •  15.01.2025  •  LMU Department of Statistics and via zoom

Additive Density-on-Scalar Regression in Bayes Hilbert Spaces With an Application to Gender Economics

15.01.25, 4-6 pm: LMU Statistics Colloquium with Sonja Greven (HU Berlin).


Link to Experimental Designs for A/B Testing in Marketplaces

AI Keynote Series  •  09.01.2025  •  Online via Zoom

Experimental Designs for a/B Testing in Marketplaces

09.01.25, 12-1:30 pm: AI Keynote Series with Chengchun Shi from London School of Economics and Political Science.