Home  | Publications | WCL+25

Multimodal Emotion Recognition in Conversations: A Survey of Methods, Trends, Challenges and Prospects

MCML Authors

Abstract

While text-based emotion recognition methods have achieved notable success, real-world dialogue systems often demand a more nuanced emotional understanding than any single modality can offer. Multimodal Emotion Recognition in Conversations (MERC) has thus emerged as a crucial direction for enhancing the naturalness and emotional understanding of human-computer interaction. Its goal is to accurately recognize emotions by integrating information from various modalities such as text, speech, and visual signals.<br>This survey offers a systematic overview of MERC, including its motivations, core tasks, representative methods, and evaluation strategies. We further examine recent trends, highlight key challenges, and outline future directions. As interest in emotionally intelligent systems grows, this survey provides timely guidance for advancing MERC research.

inproceedings


Findings @EMNLP 2025

Findings of the Conference on Empirical Methods in Natural Language Processing. Suzhou, China, Nov 04-09, 2025. To be published. Preprint available.
Conference logo
A* Conference

Authors

C. Wu • Y. Cai • Y. Liu • P. Zhu • Y. Xue • Z. Gong • J. Hirschberg • B. Ma

Links


Research Area

 C4 | Computational Social Sciences

BibTeXKey: WCL+25

Back to Top