Home | Research | Groups | Björn Schuller

Research Group Björn Schuller

Link to website at TUM

Björn Schuller

Prof. Dr.

Core PI

Health Informatics

Björn Schuller

is professor of Health Informatics at TU Munich.

His research combines computer science with modern health care and medicine. The main focus lies in the acquisition, analysis, and interpretation of biosignals including in daily life, such as those generated in monitoring heart activity, metabolism, or neuronal activities. Additionally, acoustic, visual, and a variety of other parameters are also evaluated. The goal is prevention, diagnosis, as well as decision support and intervention through efficient, transparent, and trustworthy methods of current Artificial Intelligence.

Team members @MCML

PostDocs

Link to website

Anton Batliner

Dr.

→ Group Björn Schuller
Health Informatics

Link to website

Manuel Milling

Dr.

→ Group Björn Schuller
Health Informatics

Link to website

Florian Pokorny

Dr.

→ Group Björn Schuller
Health Informatics

Link to website

Esther Rituerto-González

Dr.

→ Group Björn Schuller
Health Informatics

Link to website

Andreas Triantafyllopoulos

Dr.

→ Group Björn Schuller
Health Informatics

Link to website

Jiadong Wang

Dr.

→ Group Björn Schuller
Health Informatics

PhD Students

Link to website

Simon Bucher

→ Group Björn Schuller
Health Informatics

Link to website

Alexander Gebhard

→ Group Björn Schuller
Health Informatics

Link to website

Monica Gonzalez Machorro

→ Group Björn Schuller
Health Informatics

Xin Jing

Xin Jing

→ Group Björn Schuller
Health Informatics

Link to website

Alexander Kathan

→ Group Björn Schuller
Health Informatics

Link to website

Nikolai Körber

→ Group Björn Schuller
Health Informatics

Link to website

Mina Adel Fahmy Nessiem

→ Group Björn Schuller
Health Informatics

Link to website

Filip Packań

→ Group Björn Schuller
Health Informatics

Link to website

Simon Pistrosch

→ Group Björn Schuller
Health Informatics

Link to website

Michelle Schlicher

→ Group Björn Schuller
Health Informatics

Link to website

Anika Spiesberger

→ Group Björn Schuller
Health Informatics

Link to website

Mujtaba Hussain Razvi Syed

→ Group Björn Schuller
Health Informatics

Link to website

Iosif Tsangko

→ Group Björn Schuller
Health Informatics

Link to website

Zipeng Zhang

→ Group Björn Schuller
Health Informatics

Recent News @MCML

Link to MCML at ICML 2026

03.07.2026

MCML at ICML 2026

86 Accepted Papers (71 Main, and 15 Workshops)

Learn more

Link to MCML at EACL 2026

23.03.2026

MCML at EACL 2026

13 Accepted Papers (9 Main, and 4 Findings)

Learn more

Link to MCML Researchers in Highly-Ranked Journals

02.01.2026

MCML Researchers in Highly-Ranked Journals

86 Papers in 2026 Highlight Scientific Impact

Learn more

Link to Björn Schuller Guest on ZDF’s Terra X

08.12.2025

Björn Schuller Guest on ZDF’s Terra X

Emotion Recognition and Empathic AI

Learn more

Show all news of this group

Publications @MCML

2026

[124]

M. Gonzalez-Machorro • R. von Heynitz • J. Hanslmeier • F. Grimm • A.-I. Deac • A. Gründel • I. Cordts • B. W. Schuller
Towards Speech Impairment Prediction in German-Speaking Individuals with Amyotrophic Lateral Sclerosis.
Interspeech 2026 - 27th Annual Conference of the International Speech Communication Association. Sydney, Australia, Sep 27-Oct 01, 2026. To be published. Preprint available. arXiv URL

[123]

M. Krahn • L. Bastian • V. K. Garg • B. W. Schuller • T. Birdal
Collapsed Effective Operators for Higher-order Structures.
ICML 2026 - 43rd International Conference on Machine Learning. Seoul, South Korea, Jul 06-11, 2026. To be published. Preprint available. URL

[122]

N. N. Schmitt • M. D. Schlicher • A. Triantafyllopoulos • C. Gawrilow • C. Eickhoff • B. W. Schuller • J. Löchner
Monitoring Students' Well-Being through Journal Analysis -- Study Protocol of an Explorative Approach using Natural Language Processing on Typed and Transcribed Entries to Monitor and Generate Personalized Feedback.
Frontiers in Digital Health. Jul. 2026. To be published. URL

[121]

Q. Sun • Y. Chang • Y. Li • X. Shao • Z. Zhang • B. W. Schuller
CHARM: Charge Calibration and Acoustic Rescue for LLM-based Multimodal Sarcasm Detection.
Preprint (Jul. 2026). arXiv GitHub

[120]

M. R. Wrobel • K. D. Bartl-Pokorny • T. Zorcec • D. E. Barkana • H. Kose • M. Milling • B. Robins • B. W. Schuller • A. Landowska
Guidelines for Emotion Recognition in Robot-Supported Interventions in Autism.
International Journal of Social Robotics 18.68. Jun. 2026. DOI

[119]

M. Moldovan • A. Batliner • T. M. Berghaus • B. W. Schuller • A. Triantafyllopoulos
CoughPhase-CLR: Designing an acoustics-informed foundation model for coughing sound classification.
Preprint (Jun. 2026). arXiv GitHub

[118]

Q. Sun • Y. Chang • Z. Zhang • B. W. Schuller
SIGMA: Saliency-Guided Sparse Mask Attacks for Speech Emotion Recognition.
Preprint (Jun. 2026). arXiv

[117]

I. Tsangko • A. Triantafyllopoulos • B. W. Schuller
Acoustic Cue Alignment in Audio Language Models for Speech Emotion Recognition.
Preprint (Jun. 2026). arXiv

[116]

A. Triantafyllopoulos • I. Tsangko • B. W. Schuller
Bringing Multimodal Foundation Models to Hearing Aids.
ICASSP 2026 - IEEE International Conference on Acoustics, Speech and Signal Processing. Barcelona, Spain, May 04-08, 2026. DOI

[115]

M. Milling • A. Triantafyllopoulos • S. Rampp • A. Akman • B. W. Schuller
Leveraging Sample Difficulty in Computer Audition Analysis.
IEEE Access 11.2023. May. 2026. DOI

[114]

A. Gebhard • A. Triantafyllopoulos • D. Arend • S. Müller • S. Schmidt • M. Scherer-Lorenzen • B. W. Schuller
CoarseSoundNet: Building a reliable model for ecological soundscape analysis.
Preprint (May. 2026). arXiv GitHub

[113]

A. Triantafyllopoulos • J. Šťastný • A. Terpinas • T. Liu • Y. Wang • B. W. Schuller
A conceptual framework for learning to listen by reward: Curiosity-driven search for novel sources.
Preprint (May. 2026). arXiv

[112]

Q. Sun • Y. Li • A. Javadov • X. Wu • B. W. Schuller
Prior-Aligned Frequency-Domain Explanations for Heart Sound Classification: A Scale-Consistent Attribution Approach.
Frontiers in Artificial Intelligence 9. Apr. 2026. DOI

[111]

A. Triantafyllopoulos • A. Batliner • B. W. Schuller
Charting 15 years of progress in deep learning for speech emotion recognition: A replication study.
IEEE Transactions on Affective Computing Early Access. Apr. 2026. DOI

[110]

Y. Zuo • B. Chen • C. Maksimovic • J. Qi • J. Hu • B. W. Schuller
Looking Alike Does Not Mean Judging Alike: Dissociation Between Aesthetic Consistency and Gaze Consistency.
International Journal of Human–Computer Interaction. Apr. 2026. DOI

[109]

Z. Yu • S. Escalera • D.-P. Fan • B. W. Schuller • P. H. S. Torr
Editorial for Special Issue on Subtle Visual Computing.
Machine Intelligence Research 23. Apr. 2026. DOI

[108]

M. Milling • A. Triantafyllopoulosa • S. D. N. Rampp • B. W. Schuller
A frequency analysis of filterbank initialisation and noise augmentation for LEAF.
Scientific Reports 16.13410. Apr. 2026. DOI GitHub

[107]

Y. Li • Q. Sun • H. Li • L. Specia • B. W. Schuller
Explainable detection of machine generated music and early systematic evaluation.
Scientific Reports 16.13757. Apr. 2026. DOI GitHub

[106]

L. Christ • S. Amiriparian
Training-Free Text Emotion Tagging via LLM-Based Best-Worst Scaling.
Findings @EACL 2026 - Findings of the 19th Conference of the European Chapter of the Association for Computational Linguistics. Rabat, Morocco, Mar 24-29, 2026. DOI

[105]

J. K. Throm • M. Milling • A. Triantafyllopoulos • A. Kathan • A. F. Dörsam • J. Löchner • B. W. Schuller • K. E. Giel
Affective Dimensions in Maternal Voice During Child Feeding in Mothers With and Without Eating Disorder History—Findings From a Machine Learning Analysis of Speech Data.
European Eating Disorders Review 34.2. Mar. 2026. DOI

[104]

Y. Ni • R. Liang • X. Hao • J. Cheng • Q. Wang • C. Huang • C. Zou • W. Zhou • W. Ding • B. W. Schuller
Affine Modulation-based Audiogram Fusion Network for Joint Noise Reduction and Hearing Loss Compensation.
Information Fusion 127. Part A.103726. Mar. 2026. DOI GitHub

[103]

X. Jing • A. Triantafyllopoulos • J. Wang • S. Amiriparian • J. Luo • B. W. Schuller
EmoSURA: Towards Accurate Evaluation of Detailed and Long-Context Emotional Speech Captions.
Preprint (Mar. 2026). arXiv

[102]

M. Milling • A. Triantafyllopoulos • A. Gebhard • S. Rampp • B. W. Schuller
How Class Ontology and Data Scale Affect Audio Transfer Learning.
Preprint (Mar. 2026). arXiv

[101]

S. Pistrosch • K. Avramidis • Z. Ren • T. Feng • J. Lee • M. Gonzalez-Machorro • A. Batliner • T. Schultz • S. Narayanan • B. W. Schuller
Affect Decoding in Phonated and Silent Speech Production from Surface EMG.
Preprint (Mar. 2026). arXiv

[100]

Y. Li • H. Li • L. Specia • B. W. Schuller
M6: multi-generator, multi-domain, multi-lingual and cultural, multi-genres, multi-instrument machine-generated music detection databases.
Scientific Reports In press. Feb. 2026. DOI

[99]

B. W. Schuller • A. Mallol-Ragolta • A. P. Almansa • I. Tsangko • M. M. Amin • A. Semertzidou • L. Christ • S. Amiriparian
Affective Computing Has Changed: The Foundation Model Disruption.
npj Artificial Intelligence 2.16. Jan. 2026. DOI

[98]

X. Jing • J. Wang • A. Triantafyllopoulos • M. Gerczuk • S. Amiriparian • J. Luo • B. W. Schuller
SmoothCLAP: Soft-Target Enhanced Contrastive Language-Audio Pretraining for Affective Computing.
Preprint (Jan. 2026). arXiv

2025

[97]

A. Mallol-Ragolta • B. W. Schuller
ProtoCLAP – Prototypical Contrastive Language-Audio Pretraining.
ASRU 2025 - IEEE Automatic Speech Recognition and Understanding Workshop. Honolulu, HI, USA, Dec 06-10, 2025. DOI

[96]

M. Milling
Training Dynamics in Deep Learning for Computer Audition and Beyond.
Dissertation TU München. Dec. 2025. URL

[95]

L. Christ • S. Amiriparian • B. W. Schuller • S. Müller
Automatically Predicting Social Perception From Faces Across 35 Dimensions.
IEEE Access 13. Dec. 2025. DOI

[94]

J. Han • Z. Gao • S. Gao • J. Liu • H. Chen • Z. Zhang • B. W. Schuller
Pioneering Multimodal Emotion Recognition in the Era of Large Models: From Closed Sets to Open Vocabularies.
Preprint (Dec. 2025). arXiv

[93]

A. Triantafyllopoulos • A. Spiesberger • I. Tsangko • X. Jing • V. Distler • F. Dietz • F. Alt • B. W. Schuller
Vishing: Detecting social engineering in spoken communication — A first survey & urgent roadmap to address an emerging societal challenge.
Computer Speech and Language 94.101802. Nov. 2025. DOI

[92]

D. Arend • A. Gebhard • A. Triantafyllopoulos • B. W. Schuller • M. Scherer-Lorenzen • S. Müller
Soundscape-based evaluation of small-scale forest management interventions.
Forest Ecology and Management 596.123067. Nov. 2025. DOI

[91]

I. Tsangko • A. Triantafyllopoulos • A. Abdelmoula • A. Mallol-Ragolta • B. W. Schuller
Reading Smiles: Proxy Bias in Foundation Models for Facial Emotion Recognition.
IEEE Access 13. Nov. 2025. DOI

[90]

Z. Wang • Y. Tan • H. Zhang • R. Wang • B. Hu • Y. Yamamoto • K. Qian • B. W. Schuller
Can Information Representations Inspired by the Human Auditory Perception Benefit Computer Audition-based Disease Detection? An Interpretable Comparative Study.
IEEE Journal of Biomedical and Health Informatics 30.6. Nov. 2025. DOI

[89]

Y. Li • Z. Wei • H. Yu • J. Xue • H. Zhou • B. W. Schuller
DOTA-ME-CS: Daily Oriented Text Audio-Mandarin English-Code Switching Dataset.
Preprint (Nov. 2025). arXiv

[88]

A. Gebhard • A. Triantafyllopoulos • I. Tsangko • B. W. Schuller
Towards Audio-based Zero-Shot Action Recognition in Kitchen Environments.
DCASE 2025 - Workshop on Detection and Classification of Acoustic Scenes and Events. Barcelona, Spain, Oct 30-31, 2025. DOI

[87]

J. Löchner • M. Bolivar • L. Booth • S. Canella • M. Calobro • J. Firth • A. Garcia-Palacios • A. Kyritsaka • L. B. Sander • C. Seiferth • L. Teesson • J. Tyrowicz • L. Vogel • E. Wheeler • J. Wolstein • B. W. Schuller
Multidisciplinary perspectives on personalised prevention in youth mental health.
Frontiers in Digital Health 7. Oct. 2025. DOI

[86]

M. Schlicher • Y. Li • S. M. K. Murthy • Q. Sun • B. W. Schuller
Emotionally Adaptive Support: A Narrative Review of Affective Computing for Mental Health.
Frontiers in Digital Health 7. Oct. 2025. DOI

[85]

J. Cheng • R. Liang • Y. Ni • C. Xu • J. Li • W. Zhou • R. Liu • B. W. Schuller • X. Hao
I2RF-TFCKD: Intra-Inter Representation Fusion with Time-Frequency Calibration Knowledge Distillation for Speech Enhancement.
Preprint (Oct. 2025). arXiv

[84]

A. Triantafyllopoulos • I. Tsangko • M. Müller • H. Schröter • B. W. Schuller
Speaker vs Noise Conditioning for Adaptive Speech Enhancement.
ITG-SC 2025 - 16th ITG Conference on Speech Communication. Berlin, Germany, Sep 24-26, 2025. URL

[83]

F. B. Pokorny • K. D. Bartl-Pokorny
Editorial: Artificial intelligence for child health and wellbeing.
Frontiers in Digital Health 7. Sep. 2025. DOI

[82]

X. Xu • B. W. Schuller • E. André • E. Cambria
Guest Editorial Extremely Low-Resource Autonomous Affective Learning.
IEEE Transactions on Affective Computing 16.3. Sep. 2025. DOI

[81]

J. Ding • Q. Sun • A. Akman • B. W. Schuller
Cross-Dialect Bird Species Recognition with Dialect-Calibrated Augmentation.
Preprint (Sep. 2025). arXiv

[80]

S. M. K. Murthy • U. Airsang • K. Rajamani • B. W. Schuller
From Code to Carbon: Being Energy Efficient while Coding.
ICSRF 2025 - International Conference on Sustainable & Resilient Futures: Bridging Science, Policy, & Practice . Kollam, India, Aug 29-Sep 01, 2025. DOI

[79]

M. Gonzalez-Machorro • U. Reichel • P. Hecker • H. Hammer • H. Sagha • F. Eyben • R. Hoepner • B. W. Schuller
Speech-Based Depressive Mood Detection in the Presence of Multiple Sclerosis: A Cross-Corpus and Cross-Lingual Study.
ICNLSP 2025 - 8th International Conference on Natural Language and Speech Processing. Odense, Denmark, Aug 25-27, 2025. URL

[78]

U. Reichel • M. Gonzalez Machorro • L. Ehlen • P. Hecker • D. Peitz • C. Werner • F. Burkhardt • C. Kohlschein • F. Eyben • B. W. Schuller
Domain adaptation and question-answer pooling for Aphasia modelling.
ICNLSP 2025 - 8th International Conference on Natural Language and Speech Processing. Odense, Denmark, Aug 25-27, 2025. URL

[77]

Y. Li • S. Shao • M. Milling • B. W. Schuller
Large language models for depression recognition in spoken language integrating psychological knowledge.
Frontiers in Computer Science. Aug. 2025. DOI GitHub

[76]

H. Zhang • F. Tian • Y. Tan • L. Shen • E. Li • J. Ma • J. Liu • K. Qian • J. Li • B. Hu • Y. Yamamoto • B. W. Schuller
Towards Practical Colorectal Cancer Diagnosis: A Bowel Sound-Based System with Portable Sensor and On-Board Lightweight AI Model.
IEEE Internet of Things Journal 12.21. Aug. 2025. DOI

[75]

A. Triantafyllopoulos • I. Tsangko • A. Gebhard • A. Mesaros • T. Virtanen • B. W. Schuller
Computer Audition: From Task-Specific Machine Learning to Foundation Models.
Proceedings of the IEEE 113.4. Aug. 2025. DOI

[74]

Y. Li • Q. Sun • M. Schlicher • Y. W. Lim • B. W. Schuller
Artificial Emotion: A Survey of Theories and Debates on Realising Emotion in Artificial Intelligence.
Preprint (Aug. 2025). arXiv

[73]

Z. Ren • S. Pistrosch • B. Coşkun • K. Scheck • A. Batliner • B. W. Schuller • T. Schultz
An Introduction to Silent Paralinguistics.
Preprint (Aug. 2025). arXiv

[72]

A. Gonzalez-Machorro • R. von Heynitz • K. Scherzer • I. Cordts • B. W. Schuller
Detection of Amyotrophic Lateral Sclerosis with Computer Audition: An Impact Analysis of Different Speech Tasks.
EMBC 2025 - 47th Annual International Conference of the IEEE Engineering in Medicine and Biology Society. Copenhagen, Denmark, Jul 14-18, 2025. DOI

[71]

Z. Ge • X. Xu • H. Guo • B. W. Schuller
Multi-Task Partially Spoofed Speech Detection Using a Dual-View Graph Neural Network Assisted Segment-Level Module.
IEEE Transactions on Audio, Speech and Language Processing 33. Jul. 2025. DOI

[70]

Y. Yang • R. Liang • Y. Ni • Y. Xie • C. Zou • B. W. Schuller
A Non-intrusive Speech Quality Evaluation Framework for Hearing Aids Based on Speech Label Assistance and Multi-task Learning Strategy.
IEEE Transactions on Audio, Speech and Language Processing 33. Jul. 2025. DOI

[69]

M. Keinert • S. Pistrosch • A. Mallol-Ragolta • B. W. Schuller • M. Berking
Facial Emotion Recognition of 16 Distinct Emotions From Smartphone Videos: Comparative Study of Machine Learning and Human Performance.
Journal of Medical Internet Research 27. Jul. 2025. DOI

[68]

S. M. K. Murthy • K. Rajamani • S. T. Rajamani • Y. Li • Q. Sun • B. W. Schuller
Automatic Contouring of Spinal Vertebrae on X-Ray using a Novel Sandwich U-Net Architecture.
Preprint (Jul. 2025). arXiv

[67]

A. Mallol-Ragolta • M. Gonzalez-Machorro • R. von Heynitz • K. Scherzer • I. Cordts • B. W. Schuller
Early Detection of ALS in Absence of Speech Impairments with Computer Audition.
AIME 2025 - 23rd International Conference on Artificial Intelligence in Medicine. Pavia, Italy, Jun 23-26, 2025. DOI

[66]

I. Tsangko • A. Triantafyllopoulos • E. Kyriakidis • G. Margetis • B. W. Schuller
Large Language Models for the Analysis of Project Proposals.
AI-HCI 2025 - 6th International Conference on Artificial Intelligence in Human Computer Interaction. Gothenburg, Sweden, Jun 22-27, 2025. DOI

[65]

K. D. Bartl-Pokorny • A. Mallol-Ragolta • A. Spiesberger • A. Semertzidou • J. Löchner • F. B. Pokorny • B. W. Schuller
'Hey Smartphone, Am I Ill?' Detecting Diseases From The Voice.
Frontiers Frontiers for Young Minds. Jun. 2025. URL

[64]

L. Mamede • R. C. Sabàb • S. Van Coillie • J. Prevot • S. Sánchez-Ramón • C. Poli • A. Barasa • B. W. Schuller • A. Hendel • N. Garcelon • C. Boersma • P. Lee • C. Booth • L. D. Notarangelo • J. Drabwell • N. L. Rider • F. Staal • S. O. Burns • M. van Hagen • M. Pergrnt • J. G. Rivière • N. Mahlaoui
Navigating disruption in the PID landscape: embracing opportunities and anticipating threats in the next ten years.
Frontiers in Immunology 16. May. 2025. DOI

[63]

X. Jing • J. Wang • I. Tsangko • A. Triantafyllopoulos • B. W. Schuller
MELT: Towards Automated Multimodal Emotion Data Annotation by Leveraging LLM Embedded Knowledge.
Preprint (May. 2025). arXiv

[62]

X. Jing • K. Zhou • A. Triantafyllopoulos • B. W. Schuller
Enhancing Emotional Text-to-Speech Controllability with Natural Language Guidance through Contrastive Learning and Diffusion Models.
ICASSP 2025 - IEEE International Conference on Acoustics, Speech and Signal Processing. Hyderabad, India, Apr 06-11, 2025. DOI

[61]

I. Tsangko • A. Triantafyllopoulos • M. Müller • H. Schröter • B. W. Schuller
DFingerNet: Noise-Adaptive Speech Enhancement for Hearing Aids.
ICASSP 2025 - IEEE International Conference on Acoustics, Speech and Signal Processing. Hyderabad, India, Apr 06-11, 2025. DOI

[60]

W. Qi • X. Xu • K. Qian • B. W. Schuller • G. Fortino • A. Aliverti
A Review of AIoT-Based Human Activity Recognition: From Application to Technique.
IEEE Journal of Biomedical and Health Informatics 29.4. Apr. 2025. DOI

[59]

X. Qiu • W. Qiu • Y. Zhang • K. Qian • C. Li • B. Hu • B. W. Schuller • Y. Yamamoto
FedKDC: Consensus-Driven Knowledge Distillation for Personalized Federated Learning in EEG-Based Emotion Recognition.
IEEE Journal of Biomedical and Health Informatics 29.8. Apr. 2025. DOI GitHub

[58]

L. Christ • S. Amiriparian • A. Kathan • N. Müller • A. König • B. W. Schuller
Towards Multimodal Prediction of Spontaneous Humor: A Novel Dataset and First Results.
IEEE Transactions on Affective Computing 16.2. Apr. 2025. DOI

[57]

L. Shen • H. Zhang • C. Zhu • R. Li • K. Qian • F. Tian • B. Hu • B. W. Schuller • Y. Yamamoto
Enhancing Emotion Regulation in Mental Disorder Treatment: An AIGC-based Closed-Loop Music Intervention System.
IEEE Transactions on Affective Computing 16.3. Apr. 2025. DOI

[56]

Z. Li • Z. Wang • X. Xu • Y. Chen • B. W. Schuller
Unsupervised Domain-Adaptive Semantic Segmentation for Surgical Instruments Leveraging Dropout-Enhanced Dual Heads and Coarse-Grained Classification Branch.
IEEE Transactions on Medical Robotics and Bionics 7.3. Apr. 2025. DOI

[55]

L. Zhu • R. Wang • X. Jin • Y. Li • F. Tian • R. Cai • K. Qian • X. Hu • B. Hu • Y. Yamamoto • B. W. Schuller
Explainable Depression Classification Based on EEG Feature Selection from Audio Stimuli.
IEEE Transactions on Neural Systems and Rehabilitation Engineering 33. Apr. 2025. DOI

[54]

J. Xie • Y. Wang • X. Qian • J. Zhang • B. W. Schuller
Improving Bird Vocalization Recognition in Open-Set Cross-Corpus Scenarios with Semantic Feature Reconstruction and Dual Strategy Scoring.
IEEE Signal Processing Letters 32. Mar. 2025. DOI

[53]

Y. Li • M. Milling • B. W. Schuller
Neuroplasticity in Artificial Intelligence -- An Overview and Inspirations on Drop In & Out Learning.
Preprint (Mar. 2025). arXiv

[52]

Y. Li • Q. Sun • S. M. K. Murthy • E. Alturki • B. W. Schuller
GatedxLSTM: A Multimodal Affective Computing Approach for Emotion Recognition in Conversations.
Preprint (Mar. 2025). arXiv

[51]

Q. Sun • A. Akman • B. W. Schuller
Explainable Artificial Intelligence for Medical Applications: A Review.
ACM Transactions on Computing for Healthcare 6.2. Feb. 2025. DOI

[50]

Z. Sun • J. Kang • K. Qian • B. W. Schuller • B. Hu
Creating Healthier Living Environments: The Role of Soundscapes in Promoting Mental Health and Well-Being.
IEEE Transactions on Computational Social Systems 12.1. Feb. 2025. DOI

[49]

J. F. Bauer • L. Schindler-Gmelch • M. Gerczuk • B. W. Schuller • M. Berking
Prosody-focused feedback enhances the efficacy of anti-depressive self-statements in depressed individuals – A randomized controlled trial.
Behaviour Research and Therapy 184.104667. Jan. 2025. DOI

[48]

M. Milling • S. D. Rampp • A. Triantafyllopoulos • M. P. Plaza • J. O. Brunner • C. Traidl-Hoffmann • B. W. Schuller • A. Damialis
Automating airborne pollen classification: Identifying and interpreting hard samples for classifiers.
Heliyon 11.2. Jan. 2025. DOI GitHub

[47]

F. Tian • H. Zhang • Y. Tan • L. Zhu • L. Shen • K. Qian • B. Hu • B. W. Schuller • Y. Yamamoto
An On-Board Executable Multi-Feature Transfer-Enhanced Fusion Model for Three-Lead EEG Sensor-Assisted Depression Diagnosis.
IEEE Journal of Biomedical and Health Informatics 29.1. Jan. 2025. DOI

[46]

A. Akman • Q. Sun • B. W. Schuller
Improving Audio Explanations using Audio Language Models.
IEEE Signal Processing Letters 32. Jan. 2025. DOI

[45]

Y. Sun • Y. Zhou • X. Xu • J. Qi • F. Xu • Z. Ren • B. W. Schuller
Weakly-Supervised Depression Detection in Speech Through Self-Learning Based Label Correction.
IEEE Transactions on Audio, Speech and Language Processing 33. Jan. 2025. DOI

[44]

W. Mayr • A. Triantafyllopoulos • A. Batliner • B. W. Schuller • T. M. Berghaus
Assessing the Clinical and Functional Status of COPD Patients Using Speech Analysis During and After Exacerbation.
International Journal of Chronic Obstructive Pulmonary Disease 20. Jan. 2025. DOI

[43]

S. Iqbal • X. Zhong • M. A. Khan • Z. Wu • N. A. Almujally • W. Liu • B. W. Schuller • A.
Cross-modal invariant learning with latent diffusion for reliable medical diagnosis under dynamic shifts.
Neurocomputing 665.132111. Jan. 2025. DOI

[42]

Z. Yang • M. Song • X. Jing • H. Zhang • K. Qian • B. Hu • K. Tamada • T. Takumi • B. W. Schuller • Y. Yamamoto
MADUV: The 1st INTERSPEECH Mice Autism Detection via Ultrasound Vocalization Challenge.
Preprint (Jan. 2025). arXiv

2024

[41]

A. Triantafyllopoulos • B. W. Schuller
Hearing aids in the era of foundation models.
GMS Zeitschrift für Audiologie 6.28. Dec. 2024. DOI

[40]

Q. Sun • A. Akman • X. Jing • M. Milling • B. W. Schuller
Audio-based Kinship Verification Using Age Domain Conversion.
IEEE Signal Processing Letters 32. Dec. 2024. DOI

[39]

L. Shen • H. Zhang • C. Zhu • R. Li • K. Qian • W. Meng • F. Tian • B. Hu • B. W. Schuller • Y. Yamamoto
A First Look at Generative Artificial Intelligence Based Music Therapy for Mental Disorders.
IEEE Transactions on Consumer Electronics 71.3. Dec. 2024. DOI

[38]

Y. Li • M. Milling • L. Specia • B. W. Schuller
From Audio Deepfake Detection to AI-Generated Music Detection -- A Pathway and Overview.
Preprint (Dec. 2024). arXiv

[37]

Q. Sun • Y. Li • E. Alturki • S. M. K. Murthy • B. W. Schuller
Towards Friendly AI: A Comprehensive Review and New Perspectives on Human-AI Alignment.
Preprint (Dec. 2024). arXiv

[36]

A. Kathan • S. Amiriparian • L. Christ • S. Eulitz • B. W. Schuller
Automatic Speech-Based Charisma Recognition and the Impact of Integrating Auxiliary Characteristics.
TELEPRESENCE 2024 - IEEE Conference on Telepresence. Pasadena, CA, USA, Nov 16-17, 2024. DOI

[35]

S. Amiriparian • M. Gerczuk • J. Lutz • W. Strube • I. Papazova • A. Hasan • A. Kathan • B. W. Schuller
Non-Invasive Suicide Risk Prediction Through Speech Analysis.
EHB 2024 - 12th E-Health and Bioengineering Conference. IASI, Romania, Nov 14-15, 2024. DOI

[34]

A. Mallol-Ragolta • M. Milling • B. W. Schuller
Multi-Triplet Loss-Based Models for Categorical Depression Recognition from Speech.
IberSPEECH 2024 - 7th Conference IberSPEECH 2024. Aveiro, Portugal, Nov 11-13, 2024. PDF

[33]

A. Mallol-Ragolta • A. Spiesberger • A. B. Salvador • B. W. Schuller
Prototypical Networks for Speech Emotion Recognition in Spanish.
IberSPEECH 2024 - 7th Conference IberSPEECH 2024. Aveiro, Portugal, Nov 11-13, 2024. PDF

[32]

A. Mallol-Ragolta • A. Spiesberger • B. W. Schuller
Face Mask Type and Coverage Area Recognition from Speech with Prototypical Networks.
IberSPEECH 2024 - 7th Conference IberSPEECH 2024. Aveiro, Portugal, Nov 11-13, 2024. PDF

[31]

K. D. Bartl-Pokorny • C. Zitta • M. Beirit • G. Vogrinec • B. W. Schuller • F. B. Pokorny
Focused review on artificial intelligence for disease detection in infants.
Frontiers in Digital Health 6. Nov. 2024. DOI

[30]

S. Rampp • M. Milling • A. Triantafyllopoulos • B. W. Schuller
Does the Definition of Difficulty Matter? Scoring Functions and their Role for Curriculum Learning.
Preprint (Nov. 2024). arXiv

[29]

S. Rampp • A. Triantafyllopoulos • M. Milling • B. W. Schuller
autrainer: A Modular and Extensible Deep Learning Toolkit for Computer Audition Tasks.
Preprint (Nov. 2024). arXiv

[28]

K. R. Scherer • F. Burkhardt • U. D. Reichel • F. Eyben • B. W. Schuller
Using voice analysis as an early indicator of risk for depression in young adults.
Preprint (Nov. 2024). arXiv

[27]

A. Triantafyllopoulos • Y. Terhorst • I. Tsangko • F. B. Pokorny • K. D. Bartl-Pokorny • L. Seizer • A. Klein • J. Chim • D. Atzil-Slonim • M. Liakata • M. Bühner • J. Löchner • B. W. Schuller
Large language models for mental health.
Preprint (Nov. 2024). arXiv

[26]

S. Amiriparian • L. Christ • A. Kathan • M. Gerczuk • N. Müller • S. Klug • L. Stappen • A. König • E. Cambria • B. W. Schuller • S. Eulitz
The MuSe 2024 Multimodal Sentiment Analysis Challenge: Social Perception and Humor Recognition.
MuSe @MM 2024 - 5th on Multimodal Sentiment Analysis Challenge and Workshop: Social Perception and Humor at the 32nd ACM International Conference on Multimedia. Melbourne, Australia , Oct 28-Nov 01, 2024. DOI

[25]

M. M. Amin • R. Mao • E. Cambria • B. W. Schuller
A Wide Evaluation of ChatGPT on Affective Computing Tasks.
IEEE Transactions on Affective Computing 15.4. Oct. 2024. DOI

[24]

M. M. Amin • B. W. Schuller
On Prompt Sensitivity of ChatGPT in Affective Computing.
ACII 2024 - 12th International Conference on Affective Computing and Intelligent Interaction. Glasgow, UK, Sep 15-18, 2024. DOI

[23]

A. Triantafyllopoulos • L. Christ • A. Gebhard • X. Jing • A. Kathan • M. Milling • I. Tsangko • S. Amiriparian • B. W. Schuller
Beyond deep learning: Charting the next frontiers of affective computing.
Intelligent Computing 3.0089. Sep. 2024. DOI

[22]

S. Amiriparian • F. Packań • M. Gerczuk • B. W. Schuller
ExHuBERT: Enhancing HuBERT Through Block Extension and Fine-Tuning on 37 Emotion Datasets.
Interspeech 2024 - 25th Annual Conference of the International Speech Communication Association. Kos Island, Greece, Sep 01-05, 2024. DOI

[21]

L. Christ • S. Amiriparian • F. Hawighorst • A.-K. Schill • A. Boutalikakis • L. Graf-Vlachy • A. König • B. W. Schuller
This Paper Had the Smartest Reviewers -- Flattery Detection Utilising an Audio-Textual Transformer-Based Approach.
Interspeech 2024 - 25th Annual Conference of the International Speech Communication Association. Kos Island, Greece, Sep 01-05, 2024. DOI

[20]

M. Gerczuk • S. Amiriparian • J. Lutz • W. Strube • I. Papazova • A. Hasan • B. W. Schuller
Exploring Gender-Specific Speech Patterns in Automatic Suicide Risk Assessment.
Interspeech 2024 - 25th Annual Conference of the International Speech Communication Association. Kos Island, Greece, Sep 01-05, 2024. DOI

[19]

S. Kalabakov • M. Gonzalez-Machorro • F. Eyben • B. W. Schuller • B. Arnrich
A Comparative Analysis of Federated Learning for Speech-Based Cognitive Decline Detection.
Interspeech 2024 - 25th Annual Conference of the International Speech Communication Association. Kos Island, Greece, Sep 01-05, 2024. PDF

[18]

A. Kathan • M. Bürger • A. Triantafyllopoulos • S. Milkus • R. Musil • B. W. Schuller • S. Amiriparian
Real-world PTSD Recognition: A Cross-corpus and Cross-linguistic Evaluation.
Interspeech 2024 - 25th Annual Conference of the International Speech Communication Association. Kos Island, Greece, Sep 01-05, 2024. DOI

[17]

O. Schrüfer • M. Milling • F. Burkhardt • F. Eyben • B. W. Schuller
Are you sure? Analysing Uncertainty Quantification Approaches for Real-world Speech Emotion Recognition.
Interspeech 2024 - 25th Annual Conference of the International Speech Communication Association. Kos Island, Greece, Sep 01-05, 2024. PDF

[16]

A. Spiesberger • A. Triantafyllopoulos • A. Kathan • A. Semertzidou • C. Gawrilow • T. Reinelt • W. Rauch • B. W. Schuller
'So... my child...' -- How Child ADHD Influences the Way Parents Talk.
Interspeech 2024 - 25th Annual Conference of the International Speech Communication Association. Kos Island, Greece, Sep 01-05, 2024. PDF

[15]

A. Triantafyllopoulos • A. Batliner • S. Rampp • M. Milling • B. W. Schuller
INTERSPEECH 2009 Emotion Challenge Revisited: Benchmarking 15 Years of Progress in Speech Emotion Recognition.
Interspeech 2024 - 25th Annual Conference of the International Speech Communication Association. Kos Island, Greece, Sep 01-05, 2024. DOI

[14]

A. Triantafyllopoulos • B. W. Schuller
Enrolment-based personalisation for improving individual-level fairness in speech emotion recognition.
Interspeech 2024 - 25th Annual Conference of the International Speech Communication Association. Kos Island, Greece, Sep 01-05, 2024. PDF

[13]

M. Milling • S. Liu • A. Triantafyllopoulos • I. Aslan • B. W. Schuller
Audio Enhancement for Computer Audition -- An Iterative Training Paradigm Using Sample Importance.
Journal of Computer Science and Technology 39. Sep. 2024. DOI

[12]

A. Triantafyllopoulos • A. Gebhard • M. Milling • S. Rampp • B. W. Schuller
An Automatic Analysis of Ultrasound Vocalisations for the Prediction of Interaction Context in Captive Egyptian Fruit Bats.
EUSIPCO 2024 - 32nd European Signal Processing Conference. Lyon, France, Aug 26-30, 2024. DOI

[11]

L. Christ • S. Amiriparian • M. Milling • I. Aslan • B. W. Schuller
Modeling Emotional Trajectories in Written Stories Utilizing Transformers and Weakly-Supervised Learning.
Findings @ACL 2024 - Findings of the 62nd Annual Meeting of the Association for Computational Linguistics. Bangkok, Thailand, Aug 11-16, 2024. DOI

[10]

Z. Ren • Y. Chang • T. T. Nguyen • Y. Tan • K. Qian • B. W. Schuller
A Comprehensive Survey on Heart Sound Analysis in the Deep Learning Era.
IEEE Computational Intelligence Magazine 19.3. Aug. 2024. DOI

[9]

A. Kathan • S. Amiriparian • A. Triantafyllopoulos • A. Gebhard • S. Milkus • J. Hohmann • P. Muderlak • J. Schottdorf • R. Musil • B. W. Schuller
Personalised Speech-Based PTSD Prediction Using Weighted-Instance Learning.
EMBC 2024 - 46th Annual International Conference of the IEEE Engineering in Medicine and Biology Society. Orlando, FL, USA, Jul 15-19, 2024. DOI

[8]

S. T. Rajamani • K. Rajamani • A. J • K. R • B. W. Schuller
CBAM_SAUNet: A novel attention U-Net for effective segmentation of corner cases.
EMBC 2024 - 46th Annual International Conference of the IEEE Engineering in Medicine and Biology Society. Orlando, FL, USA, Jul 15-19, 2024. DOI

[7]

A. Spiesberger • A. Mallol-Ragolta • A. Triantafyllopoulos • B. W. Schuller
Towards Predicting Menstrual Cycle Phases Exploiting Paralinguistic Features.
EMBC 2024 - 46th Annual International Conference of the IEEE Engineering in Medicine and Biology Society. Orlando, FL, USA, Jul 15-19, 2024. DOI

[6]

W. Qiu • Y. Feng • Y. Li • Y. Chang • K. Qian • B. Hu • Y. Yamamoto • B. W. Schuller
Fed-MStacking: Heterogeneous Federated Learning With Stacking Misaligned Labels for Abnormal Heart Sound Detection.
IEEE Journal of Biomedical and Health Informatics 28.9. Jul. 2024. DOI

[5]

P. Purucker • C. Reil • A. Höß • B. W. Schuller
Deep Neural Quality of Service Prediction for Unmanned Aircraft System Communications.
IWCMC 2024 - 20th International Wireless Communications and Mobile Computing Conference. Cyprus, Greece, May 27-31, 2024. DOI

[4]

W. Qiu • C. Quan • L. Zhu • Y. Yu • Z. Wang • Y. Ma • M. Sun • Y. Chang • K. Qian • B. Hu • Y. Yamamoto • B. W. Schuller
Heart Sound Abnormality Detection From Multi-Institutional Collaboration: Introducing a Federated Learning Framework.
IEEE Transactions on Biomedical Engineering 71.10. May. 2024. DOI

[3]

A. Triantafyllopoulos • B. W. Schuller
Expressivity and Speech Synthesis.
Oxford Handbook of Expressivity in Language. Apr. 2024. arXiv URL

[2]

A. Mallol-Ragolta • B. W. Schuller
Coupling Sentiment and Arousal Analysis Towards an Affective Dialogue Manager.
IEEE Access 12. Feb. 2024. DOI

[1]

J. Xie • Y. Shi • D. Ni • M. Milling • S. Liu • J. Zhang • K. Qian • B. W. Schuller
Automatic Bird Sound Source Separation Based on Passive Acoustic Devices in Wild Environment.
IEEE Internet of Things Journal 11.9. Jan. 2024. DOI

©all images: LMU | TUM

2024-12-27 - Last modified: 2026-07-03