Research Group Björn Schuller
Björn Schuller
is professor of Health Informatics at TU Munich.
His research combines computer science with modern health care and medicine. The main focus lies in the acquisition, analysis, and interpretation of biosignals including in daily life, such as those generated in monitoring heart activity, metabolism, or neuronal activities. Additionally, acoustic, visual, and a variety of other parameters are also evaluated. The goal is prevention, diagnosis, as well as decision support and intervention through efficient, transparent, and trustworthy methods of current Artificial Intelligence.
Team members @MCML
PostDocs
PhD Students
Xin Jing
→ Group Björn Schuller
Health Informatics
Recent News @MCML
Publications @MCML
2026
[112]
M. Krahn • L. Bastian • V. K. Garg • B. W. Schuller • T. Birdal
Collapsed Effective Operators for Higher-order Structures.
ICML 2026 - 43rd International Conference on Machine Learning. Seoul, South Korea, Jul 06-11, 2026. To be published. Preprint available. URL
Collapsed Effective Operators for Higher-order Structures.
ICML 2026 - 43rd International Conference on Machine Learning. Seoul, South Korea, Jul 06-11, 2026. To be published. Preprint available. URL
[111]
A. Triantafyllopoulos • I. Tsangko • B. Schuller
Bringing Multimodal Foundation Models to Hearing Aids.
ICASSP 2026 - IEEE International Conference on Acoustics, Speech and Signal Processing. Barcelona, Spain, May 04-08, 2026. DOI
Bringing Multimodal Foundation Models to Hearing Aids.
ICASSP 2026 - IEEE International Conference on Acoustics, Speech and Signal Processing. Barcelona, Spain, May 04-08, 2026. DOI
[110]
M. Milling • A. Triantafyllopoulos • S. Rampp • A. Akman • B. W. Schuller
Leveraging Sample Difficulty in Computer Audition Analysis.
IEEE Access 11.2023. May. 2026. DOI
Leveraging Sample Difficulty in Computer Audition Analysis.
IEEE Access 11.2023. May. 2026. DOI
[109]
Q. Sun • Y. Li • A. Javadov • X. Wu • B. W. Schuller
Prior-Aligned Frequency-Domain Explanations for Heart Sound Classification: A Scale-Consistent Attribution Approach.
Frontiers in Artificial Intelligence Early Access. Apr. 2026. URL
Prior-Aligned Frequency-Domain Explanations for Heart Sound Classification: A Scale-Consistent Attribution Approach.
Frontiers in Artificial Intelligence Early Access. Apr. 2026. URL
[108]
A. Triantafyllopoulos • A. Batliner • B. W. Schuller
Charting 15 years of progress in deep learning for speech emotion recognition: A replication study.
IEEE Transactions on Affective Computing Early Access. Apr. 2026. DOI
Charting 15 years of progress in deep learning for speech emotion recognition: A replication study.
IEEE Transactions on Affective Computing Early Access. Apr. 2026. DOI
[107]
Y. Zuo • B. Chen • C. Maksimovic • J. Qi • J. Hu • B. W. Schuller
Looking Alike Does Not Mean Judging Alike: Dissociation Between Aesthetic Consistency and Gaze Consistency.
International Journal of Human–Computer Interaction. Apr. 2026. DOI
Looking Alike Does Not Mean Judging Alike: Dissociation Between Aesthetic Consistency and Gaze Consistency.
International Journal of Human–Computer Interaction. Apr. 2026. DOI
[106]
Z. Yu • S. Escalera • D.-P. Fan • B. W. Schuller • P. H. S. Torr
Editorial for Special Issue on Subtle Visual Computing.
Machine Intelligence Research 23. Apr. 2026. DOI
Editorial for Special Issue on Subtle Visual Computing.
Machine Intelligence Research 23. Apr. 2026. DOI
[105]
M. Milling • A. Triantafyllopoulosa • S. D. N. Rampp • B. W. Schuller
A frequency analysis of filterbank initialisation and noise augmentation for LEAF.
Scientific Reports 16.13410. Apr. 2026. DOI GitHub
A frequency analysis of filterbank initialisation and noise augmentation for LEAF.
Scientific Reports 16.13410. Apr. 2026. DOI GitHub
[104]
Y. Li • Q. Sun • H. Li • L. Specia • B. W. Schuller
Explainable detection of machine generated music and early systematic evaluation.
Scientific Reports 16.13757. Apr. 2026. DOI GitHub
Explainable detection of machine generated music and early systematic evaluation.
Scientific Reports 16.13757. Apr. 2026. DOI GitHub
[103]
L. Christ • S. Amiriparian
Training-Free Text Emotion Tagging via LLM-Based Best-Worst Scaling.
Findings @EACL 2026 - Findings of the 19th Conference of the European Chapter of the Association for Computational Linguistics. Rabat, Morocco, Mar 24-29, 2026. DOI
Training-Free Text Emotion Tagging via LLM-Based Best-Worst Scaling.
Findings @EACL 2026 - Findings of the 19th Conference of the European Chapter of the Association for Computational Linguistics. Rabat, Morocco, Mar 24-29, 2026. DOI
[102]
J. K. Throm • M. Milling • A. Triantafyllopoulos • A. Kathan • A. F. Dörsam • J. Löchner • B. W. Schuller • K. E. Giel
Affective Dimensions in Maternal Voice During Child Feeding in Mothers With and Without Eating Disorder History—Findings From a Machine Learning Analysis of Speech Data.
European Eating Disorders Review 34.2. Mar. 2026. DOI
Affective Dimensions in Maternal Voice During Child Feeding in Mothers With and Without Eating Disorder History—Findings From a Machine Learning Analysis of Speech Data.
European Eating Disorders Review 34.2. Mar. 2026. DOI
[101]
Y. Ni • R. Liang • X. Hao • J. Cheng • Q. Wang • C. Huang • C. Zou • W. Zhou • W. Ding • B. W. Schuller
Affine Modulation-based Audiogram Fusion Network for Joint Noise Reduction and Hearing Loss Compensation.
Information Fusion 127. Part A.103726. Mar. 2026. DOI GitHub
Affine Modulation-based Audiogram Fusion Network for Joint Noise Reduction and Hearing Loss Compensation.
Information Fusion 127. Part A.103726. Mar. 2026. DOI GitHub
[100]
X. Jing • A. Triantafyllopoulos • J. Wang • S. Amiriparian • J. Luo • B. W. Schuller
EmoSURA: Towards Accurate Evaluation of Detailed and Long-Context Emotional Speech Captions.
Preprint (Mar. 2026). arXiv
EmoSURA: Towards Accurate Evaluation of Detailed and Long-Context Emotional Speech Captions.
Preprint (Mar. 2026). arXiv
[99]
M. Milling • A. Triantafyllopoulos • A. Gebhard • S. Rampp • B. W. Schuller
How Class Ontology and Data Scale Affect Audio Transfer Learning.
Preprint (Mar. 2026). arXiv
How Class Ontology and Data Scale Affect Audio Transfer Learning.
Preprint (Mar. 2026). arXiv
[98]
S. Pistrosch • K. Avramidis • Z. Ren • T. Feng • J. Lee • M. Gonzalez-Machorro • A. Batliner • T. Schultz • S. Narayanan • B. W. Schuller
Affect Decoding in Phonated and Silent Speech Production from Surface EMG.
Preprint (Mar. 2026). arXiv
Affect Decoding in Phonated and Silent Speech Production from Surface EMG.
Preprint (Mar. 2026). arXiv
[97]
Y. Li • H. Li • L. Specia • B. W. Schuller
M6: multi-generator, multi-domain, multi-lingual and cultural, multi-genres, multi-instrument machine-generated music detection databases.
Scientific Reports In press. Feb. 2026. DOI
M6: multi-generator, multi-domain, multi-lingual and cultural, multi-genres, multi-instrument machine-generated music detection databases.
Scientific Reports In press. Feb. 2026. DOI
[96]
B. W. Schuller • A. Mallol-Ragolta • A. P. Almansa • I. Tsangko • M. M. Amin • A. Semertzidou • L. Christ • S. Amiriparian
Affective Computing Has Changed: The Foundation Model Disruption.
npj Artificial Intelligence 2.16. Jan. 2026. DOI
Affective Computing Has Changed: The Foundation Model Disruption.
npj Artificial Intelligence 2.16. Jan. 2026. DOI
[95]
X. Jing • J. Wang • A. Triantafyllopoulos • M. Gerczuk • S. Amiriparian • J. Luo • B. W. Schuller
SmoothCLAP: Soft-Target Enhanced Contrastive Language-Audio Pretraining for Affective Computing.
Preprint (Jan. 2026). arXiv
SmoothCLAP: Soft-Target Enhanced Contrastive Language-Audio Pretraining for Affective Computing.
Preprint (Jan. 2026). arXiv
2025
[94]
M. Milling
Training Dynamics in Deep Learning for Computer Audition and Beyond.
Dissertation TU München. Dec. 2025. URL
Training Dynamics in Deep Learning for Computer Audition and Beyond.
Dissertation TU München. Dec. 2025. URL
[93]
L. Christ • S. Amiriparian • B. W. Schuller • S. Müller
Automatically Predicting Social Perception From Faces Across 35 Dimensions.
IEEE Access 13. Dec. 2025. DOI
Automatically Predicting Social Perception From Faces Across 35 Dimensions.
IEEE Access 13. Dec. 2025. DOI
[92]
J. Han • Z. Gao • S. Gao • J. Liu • H. Chen • Z. Zhang • B. W. Schuller
Pioneering Multimodal Emotion Recognition in the Era of Large Models: From Closed Sets to Open Vocabularies.
Preprint (Dec. 2025). arXiv
Pioneering Multimodal Emotion Recognition in the Era of Large Models: From Closed Sets to Open Vocabularies.
Preprint (Dec. 2025). arXiv
[91]
A. Triantafyllopoulos • A. Spiesberger • I. Tsangko • X. Jing • V. Distler • F. Dietz • F. Alt • B. W. Schuller
Vishing: Detecting social engineering in spoken communication — A first survey & urgent roadmap to address an emerging societal challenge.
Computer Speech and Language 94.101802. Nov. 2025. DOI
Vishing: Detecting social engineering in spoken communication — A first survey & urgent roadmap to address an emerging societal challenge.
Computer Speech and Language 94.101802. Nov. 2025. DOI
[90]
I. Tsangko • A. Triantafyllopoulos • A. Abdelmoula • A. Mallol-Ragolta • B. W. Schuller
Reading Smiles: Proxy Bias in Foundation Models for Facial Emotion Recognition.
IEEE Access 13. Nov. 2025. DOI
Reading Smiles: Proxy Bias in Foundation Models for Facial Emotion Recognition.
IEEE Access 13. Nov. 2025. DOI
[89]
Z. Wang • Y. Tan • H. Zhang • R. Wang • B. Hu • Y. Yamamoto • K. Qian • B. W. Schuller
Can Information Representations Inspired by the Human Auditory Perception Benefit Computer Audition-based Disease Detection? An Interpretable Comparative Study.
IEEE Journal of Biomedical and Health Informatics Early Access. Nov. 2025. DOI
Can Information Representations Inspired by the Human Auditory Perception Benefit Computer Audition-based Disease Detection? An Interpretable Comparative Study.
IEEE Journal of Biomedical and Health Informatics Early Access. Nov. 2025. DOI
[88]
Y. Li • Z. Wei • H. Yu • J. Xue • H. Zhou • B. W. Schuller
DOTA-ME-CS: Daily Oriented Text Audio-Mandarin English-Code Switching Dataset.
Preprint (Nov. 2025). arXiv
DOTA-ME-CS: Daily Oriented Text Audio-Mandarin English-Code Switching Dataset.
Preprint (Nov. 2025). arXiv
[87]
A. Gebhard • A. Triantafyllopoulos • I. Tsangko • B. W. Schuller
Towards Audio-based Zero-Shot Action Recognition in Kitchen Environments.
DCASE 2025 - Workshop on Detection and Classification of Acoustic Scenes and Events. Barcelona, Spain, Oct 30-31, 2025. DOI
Towards Audio-based Zero-Shot Action Recognition in Kitchen Environments.
DCASE 2025 - Workshop on Detection and Classification of Acoustic Scenes and Events. Barcelona, Spain, Oct 30-31, 2025. DOI
[86]
J. Löchner • M. Bolivar • L. Booth • S. Canella • M. Calobro • J. Firth • A. Garcia-Palacios • A. Kyritsaka • L. B. Sander • C. Seiferth • L. Teesson • J. Tyrowicz • L. Vogel • E. Wheeler • J. Wolstein • B. W. Schuller
Multidisciplinary perspectives on personalised prevention in youth mental health.
Frontiers in Digital Health 7. Oct. 2025. DOI
Multidisciplinary perspectives on personalised prevention in youth mental health.
Frontiers in Digital Health 7. Oct. 2025. DOI
[85]
M. Schlicher • Y. Li • S. M. K. Murthy • Q. Sun • B. W. Schuller
Emotionally Adaptive Support: A Narrative Review of Affective Computing for Mental Health.
Frontiers in Digital Health 7. Oct. 2025. DOI
Emotionally Adaptive Support: A Narrative Review of Affective Computing for Mental Health.
Frontiers in Digital Health 7. Oct. 2025. DOI
[84]
J. Cheng • R. Liang • Y. Ni • C. Xu • J. Li • W. Zhou • R. Liu • B. W. Schuller • X. Hao
I2RF-TFCKD: Intra-Inter Representation Fusion with Time-Frequency Calibration Knowledge Distillation for Speech Enhancement.
Preprint (Oct. 2025). arXiv
I2RF-TFCKD: Intra-Inter Representation Fusion with Time-Frequency Calibration Knowledge Distillation for Speech Enhancement.
Preprint (Oct. 2025). arXiv
[83]
A. Triantafyllopoulos • I. Tsangko • M. Müller • H. Schröter • B. W. Schuller
Speaker vs Noise Conditioning for Adaptive Speech Enhancement.
ITG-SC 2025 - 16th ITG Conference on Speech Communication. Berlin, Germany, Sep 24-26, 2025. URL
Speaker vs Noise Conditioning for Adaptive Speech Enhancement.
ITG-SC 2025 - 16th ITG Conference on Speech Communication. Berlin, Germany, Sep 24-26, 2025. URL
[82]
F. B. Pokorny • K. D. Bartl-Pokorny
Editorial: Artificial intelligence for child health and wellbeing.
Frontiers in Digital Health 7. Sep. 2025. DOI
Editorial: Artificial intelligence for child health and wellbeing.
Frontiers in Digital Health 7. Sep. 2025. DOI
[81]
X. Xu • B. W. Schuller • E. André • E. Cambria
Guest Editorial Extremely Low-Resource Autonomous Affective Learning.
IEEE Transactions on Affective Computing 16.3. Sep. 2025. DOI
Guest Editorial Extremely Low-Resource Autonomous Affective Learning.
IEEE Transactions on Affective Computing 16.3. Sep. 2025. DOI
[80]
J. Ding • Q. Sun • A. Akman • B. W. Schuller
Cross-Dialect Bird Species Recognition with Dialect-Calibrated Augmentation.
Preprint (Sep. 2025). arXiv
Cross-Dialect Bird Species Recognition with Dialect-Calibrated Augmentation.
Preprint (Sep. 2025). arXiv
[79]
S. M. K. Murthy • U. Airsang • K. Rajamani • B. W. Schuller
From Code to Carbon: Being Energy Efficient while Coding.
ICSRF 2025 - International Conference on Sustainable & Resilient Futures: Bridging Science, Policy, & Practice . Kollam, India, Aug 29-Sep 01, 2025. DOI
From Code to Carbon: Being Energy Efficient while Coding.
ICSRF 2025 - International Conference on Sustainable & Resilient Futures: Bridging Science, Policy, & Practice . Kollam, India, Aug 29-Sep 01, 2025. DOI
[78]
M. Gonzalez-Machorro • U. Reichel • P. Hecker • H. Hammer • H. Sagha • F. Eyben • R. Hoepner • B. W. Schuller
Speech-Based Depressive Mood Detection in the Presence of Multiple Sclerosis: A Cross-Corpus and Cross-Lingual Study.
ICNLSP 2025 - 8th International Conference on Natural Language and Speech Processing. Odense, Denmark, Aug 25-27, 2025. URL
Speech-Based Depressive Mood Detection in the Presence of Multiple Sclerosis: A Cross-Corpus and Cross-Lingual Study.
ICNLSP 2025 - 8th International Conference on Natural Language and Speech Processing. Odense, Denmark, Aug 25-27, 2025. URL
[77]
U. Reichel • M. Gonzalez Machorro • L. Ehlen • P. Hecker • D. Peitz • C. Werner • F. Burkhardt • C. Kohlschein • F. Eyben • B. W. Schuller
Domain adaptation and question-answer pooling for Aphasia modelling.
ICNLSP 2025 - 8th International Conference on Natural Language and Speech Processing. Odense, Denmark, Aug 25-27, 2025. URL
Domain adaptation and question-answer pooling for Aphasia modelling.
ICNLSP 2025 - 8th International Conference on Natural Language and Speech Processing. Odense, Denmark, Aug 25-27, 2025. URL
[76]
Y. Li • S. Shao • M. Milling • B. W. Schuller
Large language models for depression recognition in spoken language integrating psychological knowledge.
Frontiers in Computer Science. Aug. 2025. DOI GitHub
Large language models for depression recognition in spoken language integrating psychological knowledge.
Frontiers in Computer Science. Aug. 2025. DOI GitHub
[75]
H. Zhang • F. Tian • Y. Tan • L. Shen • E. Li • J. Ma • J. Liu • K. Qian • J. Li • B. Hu • Y. Yamamoto • B. W. Schuller
Towards Practical Colorectal Cancer Diagnosis: A Bowel Sound-Based System with Portable Sensor and On-Board Lightweight AI Model.
IEEE Internet of Things Journal Early Access. Aug. 2025. DOI
Towards Practical Colorectal Cancer Diagnosis: A Bowel Sound-Based System with Portable Sensor and On-Board Lightweight AI Model.
IEEE Internet of Things Journal Early Access. Aug. 2025. DOI
[74]
A. Triantafyllopoulos • I. Tsangko • A. Gebhard • A. Mesaros • T. Virtanen • B. W. Schuller
Computer Audition: From Task-Specific Machine Learning to Foundation Models.
Proceedings of the IEEE Early Access. Aug. 2025. DOI
Computer Audition: From Task-Specific Machine Learning to Foundation Models.
Proceedings of the IEEE Early Access. Aug. 2025. DOI
[73]
Y. Li • Q. Sun • M. Schlicher • Y. W. Lim • B. W. Schuller
Artificial Emotion: A Survey of Theories and Debates on Realising Emotion in Artificial Intelligence.
Preprint (Aug. 2025). arXiv
Artificial Emotion: A Survey of Theories and Debates on Realising Emotion in Artificial Intelligence.
Preprint (Aug. 2025). arXiv
[72]
Z. Ren • S. Pistrosch • B. Coşkun • K. Scheck • A. Batliner • B. W. Schuller • T. Schultz
An Introduction to Silent Paralinguistics.
Preprint (Aug. 2025). arXiv
An Introduction to Silent Paralinguistics.
Preprint (Aug. 2025). arXiv
[71]
A. Gonzalez-Machorro • R. von Heynitz • K. Scherzer • I. Cordts • B. W. Schuller
Detection of Amyotrophic Lateral Sclerosis with Computer Audition: An Impact Analysis of Different Speech Tasks.
EMBC 2025 - 47th Annual International Conference of the IEEE Engineering in Medicine and Biology Society. Copenhagen, Denmark, Jul 14-18, 2025. DOI
Detection of Amyotrophic Lateral Sclerosis with Computer Audition: An Impact Analysis of Different Speech Tasks.
EMBC 2025 - 47th Annual International Conference of the IEEE Engineering in Medicine and Biology Society. Copenhagen, Denmark, Jul 14-18, 2025. DOI
[70]
Z. Ge • X. Xu • H. Guo • B. W. Schuller
Multi-Task Partially Spoofed Speech Detection Using a Dual-View Graph Neural Network Assisted Segment-Level Module.
IEEE Transactions on Audio, Speech and Language Processing 33. Jul. 2025. DOI
Multi-Task Partially Spoofed Speech Detection Using a Dual-View Graph Neural Network Assisted Segment-Level Module.
IEEE Transactions on Audio, Speech and Language Processing 33. Jul. 2025. DOI
[69]
Y. Yang • R. Liang • Y. Ni • Y. Xie • C. Zou • B. W. Schuller
A Non-intrusive Speech Quality Evaluation Framework for Hearing Aids Based on Speech Label Assistance and Multi-task Learning Strategy.
IEEE Transactions on Audio, Speech and Language Processing Early Access. Jul. 2025. DOI
A Non-intrusive Speech Quality Evaluation Framework for Hearing Aids Based on Speech Label Assistance and Multi-task Learning Strategy.
IEEE Transactions on Audio, Speech and Language Processing Early Access. Jul. 2025. DOI
[68]
M. Keinert • S. Pistrosch • A. Mallol-Ragolta • B. W. Schuller • M. Berking
Facial Emotion Recognition of 16 Distinct Emotions From Smartphone Videos: Comparative Study of Machine Learning and Human Performance.
Journal of Medical Internet Research 27. Jul. 2025. DOI
Facial Emotion Recognition of 16 Distinct Emotions From Smartphone Videos: Comparative Study of Machine Learning and Human Performance.
Journal of Medical Internet Research 27. Jul. 2025. DOI
[67]
S. M. K. Murthy • K. Rajamani • S. T. Rajamani • Y. Li • Q. Sun • B. W. Schuller
Automatic Contouring of Spinal Vertebrae on X-Ray using a Novel Sandwich U-Net Architecture.
Preprint (Jul. 2025). arXiv
Automatic Contouring of Spinal Vertebrae on X-Ray using a Novel Sandwich U-Net Architecture.
Preprint (Jul. 2025). arXiv
[66]
A. Mallol-Ragolta • M. Gonzalez-Machorro • R. von Heynitz • K. Scherzer • I. Cordts • B. W. Schuller
Early Detection of ALS in Absence of Speech Impairments with Computer Audition.
AIME 2025 - 23rd International Conference on Artificial Intelligence in Medicine. Pavia, Italy, Jun 23-26, 2025. DOI
Early Detection of ALS in Absence of Speech Impairments with Computer Audition.
AIME 2025 - 23rd International Conference on Artificial Intelligence in Medicine. Pavia, Italy, Jun 23-26, 2025. DOI
[65]
I. Tsangko • A. Triantafyllopoulos • E. Kyriakidis • G. Margetis • B. W. Schuller
Large Language Models for the Analysis of Project Proposals.
AI-HCI 2025 - 6th International Conference on Artificial Intelligence in Human Computer Interaction. Gothenburg, Sweden, Jun 22-27, 2025. DOI
Large Language Models for the Analysis of Project Proposals.
AI-HCI 2025 - 6th International Conference on Artificial Intelligence in Human Computer Interaction. Gothenburg, Sweden, Jun 22-27, 2025. DOI
[64]
K. D. Bartl-Pokorny • A. Mallol-Ragolta • A. Spiesberger • A. Semertzidou • J. Löchner • F. B. Pokorny • B. W. Schuller
'Hey Smartphone, Am I Ill?' Detecting Diseases From The Voice.
Frontiers Frontiers for Young Minds. Jun. 2025. URL
'Hey Smartphone, Am I Ill?' Detecting Diseases From The Voice.
Frontiers Frontiers for Young Minds. Jun. 2025. URL
[63]
L. Mamede • R. C. Sabàb • S. Van Coillie • J. Prevot • S. Sánchez-Ramón • C. Poli • A. Barasa • B. W. Schuller • A. Hendel • N. Garcelon • C. Boersma • P. Lee • C. Booth • L. D. Notarangelo • J. Drabwell • N. L. Rider • F. Staal • S. O. Burns • M. van Hagen • M. Pergrnt • J. G. Rivière • N. Mahlaoui
Navigating disruption in the PID landscape: embracing opportunities and anticipating threats in the next ten years.
Frontiers in Immunology 16. May. 2025. DOI
Navigating disruption in the PID landscape: embracing opportunities and anticipating threats in the next ten years.
Frontiers in Immunology 16. May. 2025. DOI
[62]
X. Jing • J. Wang • I. Tsangko • A. Triantafyllopoulos • B. W. Schuller
MELT: Towards Automated Multimodal Emotion Data Annotation by Leveraging LLM Embedded Knowledge.
Preprint (May. 2025). arXiv
MELT: Towards Automated Multimodal Emotion Data Annotation by Leveraging LLM Embedded Knowledge.
Preprint (May. 2025). arXiv
[61]
X. Jing • K. Zhou • A. Triantafyllopoulos • B. W. Schuller
Enhancing Emotional Text-to-Speech Controllability with Natural Language Guidance through Contrastive Learning and Diffusion Models.
ICASSP 2025 - IEEE International Conference on Acoustics, Speech and Signal Processing. Hyderabad, India, Apr 06-11, 2025. DOI
Enhancing Emotional Text-to-Speech Controllability with Natural Language Guidance through Contrastive Learning and Diffusion Models.
ICASSP 2025 - IEEE International Conference on Acoustics, Speech and Signal Processing. Hyderabad, India, Apr 06-11, 2025. DOI
[60]
I. Tsangko • A. Triantafyllopoulos • M. Müller • H. Schröter • B. W. Schuller
DFingerNet: Noise-Adaptive Speech Enhancement for Hearing Aids.
ICASSP 2025 - IEEE International Conference on Acoustics, Speech and Signal Processing. Hyderabad, India, Apr 06-11, 2025. DOI
DFingerNet: Noise-Adaptive Speech Enhancement for Hearing Aids.
ICASSP 2025 - IEEE International Conference on Acoustics, Speech and Signal Processing. Hyderabad, India, Apr 06-11, 2025. DOI
[59]
W. Qi • X. Xu • K. Qian • B. W. Schuller • G. Fortino • A. Aliverti
A Review of AIoT-Based Human Activity Recognition: From Application to Technique.
IEEE Journal of Biomedical and Health Informatics 29.4. Apr. 2025. DOI
A Review of AIoT-Based Human Activity Recognition: From Application to Technique.
IEEE Journal of Biomedical and Health Informatics 29.4. Apr. 2025. DOI
[58]
X. Qiu • W. Qiu • Y. Zhang • K. Qian • C. Li • B. Hu • B. W. Schuller • Y. Yamamoto
FedKDC: Consensus-Driven Knowledge Distillation for Personalized Federated Learning in EEG-Based Emotion Recognition.
IEEE Journal of Biomedical and Health Informatics Early Access. Apr. 2025. DOI GitHub
FedKDC: Consensus-Driven Knowledge Distillation for Personalized Federated Learning in EEG-Based Emotion Recognition.
IEEE Journal of Biomedical and Health Informatics Early Access. Apr. 2025. DOI GitHub
[57]
L. Christ • S. Amiriparian • A. Kathan • N. Müller • A. König • B. W. Schuller
Towards Multimodal Prediction of Spontaneous Humor: A Novel Dataset and First Results.
IEEE Transactions on Affective Computing 16.2. Apr. 2025. DOI
Towards Multimodal Prediction of Spontaneous Humor: A Novel Dataset and First Results.
IEEE Transactions on Affective Computing 16.2. Apr. 2025. DOI
[56]
L. Shen • H. Zhang • C. Zhu • R. Li • K. Qian • F. Tian • B. Hu • B. W. Schuller • Y. Yamamoto
Enhancing Emotion Regulation in Mental Disorder Treatment: An AIGC-based Closed-Loop Music Intervention System.
IEEE Transactions on Affective Computing Early Access. Apr. 2025. DOI
Enhancing Emotion Regulation in Mental Disorder Treatment: An AIGC-based Closed-Loop Music Intervention System.
IEEE Transactions on Affective Computing Early Access. Apr. 2025. DOI
[55]
Z. Li • Z. Wang • X. Xu • Y. Chen • B. W. Schuller
Unsupervised Domain-Adaptive Semantic Segmentation for Surgical Instruments Leveraging Dropout-Enhanced Dual Heads and Coarse-Grained Classification Branch.
IEEE Transactions on Medical Robotics and Bionics Early Access. Apr. 2025. DOI
Unsupervised Domain-Adaptive Semantic Segmentation for Surgical Instruments Leveraging Dropout-Enhanced Dual Heads and Coarse-Grained Classification Branch.
IEEE Transactions on Medical Robotics and Bionics Early Access. Apr. 2025. DOI
[54]
L. Zhu • R. Wang • X. Jin • Y. Li • F. Tian • R. Cai • K. Qian • X. Hu • B. Hu • Y. Yamamoto • B. W. Schuller
Explainable Depression Classification Based on EEG Feature Selection from Audio Stimuli.
IEEE Transactions on Neural Systems and Rehabilitation Engineering 33. Apr. 2025. DOI
Explainable Depression Classification Based on EEG Feature Selection from Audio Stimuli.
IEEE Transactions on Neural Systems and Rehabilitation Engineering 33. Apr. 2025. DOI
[53]
J. Xie • Y. Wang • X. Qian • J. Zhang • B. W. Schuller
Improving Bird Vocalization Recognition in Open-Set Cross-Corpus Scenarios with Semantic Feature Reconstruction and Dual Strategy Scoring.
IEEE Signal Processing Letters 32. Mar. 2025. DOI
Improving Bird Vocalization Recognition in Open-Set Cross-Corpus Scenarios with Semantic Feature Reconstruction and Dual Strategy Scoring.
IEEE Signal Processing Letters 32. Mar. 2025. DOI
[52]
Y. Li • M. Milling • B. W. Schuller
Neuroplasticity in Artificial Intelligence -- An Overview and Inspirations on Drop In & Out Learning.
Preprint (Mar. 2025). arXiv
Neuroplasticity in Artificial Intelligence -- An Overview and Inspirations on Drop In & Out Learning.
Preprint (Mar. 2025). arXiv
[51]
Y. Li • Q. Sun • S. M. K. Murthy • E. Alturki • B. W. Schuller
GatedxLSTM: A Multimodal Affective Computing Approach for Emotion Recognition in Conversations.
Preprint (Mar. 2025). arXiv
GatedxLSTM: A Multimodal Affective Computing Approach for Emotion Recognition in Conversations.
Preprint (Mar. 2025). arXiv
[50]
Z. Sun • J. Kang • K. Qian • B. W. Schuller • B. Hu
Creating Healthier Living Environments: The Role of Soundscapes in Promoting Mental Health and Well-Being.
IEEE Transactions on Computational Social Systems 12.1. Feb. 2025. DOI
Creating Healthier Living Environments: The Role of Soundscapes in Promoting Mental Health and Well-Being.
IEEE Transactions on Computational Social Systems 12.1. Feb. 2025. DOI
[49]
M. Milling • S. D. Rampp • A. Triantafyllopoulos • M. P. Plaza • J. O. Brunner • C. Traidl-Hoffmann • B. W. Schuller • A. Damialis
Automating airborne pollen classification: Identifying and interpreting hard samples for classifiers.
Heliyon 11.2. Jan. 2025. DOI GitHub
Automating airborne pollen classification: Identifying and interpreting hard samples for classifiers.
Heliyon 11.2. Jan. 2025. DOI GitHub
[48]
F. Tian • H. Zhang • Y. Tan • L. Zhu • L. Shen • K. Qian • B. Hu • B. W. Schuller • Y. Yamamoto
An On-Board Executable Multi-Feature Transfer-Enhanced Fusion Model for Three-Lead EEG Sensor-Assisted Depression Diagnosis.
IEEE Journal of Biomedical and Health Informatics 29.1. Jan. 2025. DOI
An On-Board Executable Multi-Feature Transfer-Enhanced Fusion Model for Three-Lead EEG Sensor-Assisted Depression Diagnosis.
IEEE Journal of Biomedical and Health Informatics 29.1. Jan. 2025. DOI
[47]
A. Akman • Q. Sun • B. W. Schuller
Improving Audio Explanations using Audio Language Models.
IEEE Signal Processing Letters 32. Jan. 2025. DOI
Improving Audio Explanations using Audio Language Models.
IEEE Signal Processing Letters 32. Jan. 2025. DOI
[46]
Y. Sun • Y. Zhou • X. Xu • J. Qi • F. Xu • Z. Ren • B. W. Schuller
Weakly-Supervised Depression Detection in Speech Through Self-Learning Based Label Correction.
IEEE Transactions on Audio, Speech and Language Processing 33. Jan. 2025. DOI
Weakly-Supervised Depression Detection in Speech Through Self-Learning Based Label Correction.
IEEE Transactions on Audio, Speech and Language Processing 33. Jan. 2025. DOI
[45]
W. Mayr • A. Triantafyllopoulos • A. Batliner • B. W. Schuller • T. M. Berghaus
Assessing the Clinical and Functional Status of COPD Patients Using Speech Analysis During and After Exacerbation.
International Journal of Chronic Obstructive Pulmonary Disease 20. Jan. 2025. DOI
Assessing the Clinical and Functional Status of COPD Patients Using Speech Analysis During and After Exacerbation.
International Journal of Chronic Obstructive Pulmonary Disease 20. Jan. 2025. DOI
[44]
S. Iqbal • X. Zhong • M. A. Khan • Z. Wu • N. A. Almujally • W. Liu • B. W. Schuller • A.
Cross-modal invariant learning with latent diffusion for reliable medical diagnosis under dynamic shifts.
Neurocomputing Early Access.132111. Jan. 2025. DOI
Cross-modal invariant learning with latent diffusion for reliable medical diagnosis under dynamic shifts.
Neurocomputing Early Access.132111. Jan. 2025. DOI
[43]
Z. Yang • M. Song • X. Jing • H. Zhang • K. Qian • B. Hu • K. Tamada • T. Takumi • B. W. Schuller • Y. Yamamoto
MADUV: The 1st INTERSPEECH Mice Autism Detection via Ultrasound Vocalization Challenge.
Preprint (Jan. 2025). arXiv
MADUV: The 1st INTERSPEECH Mice Autism Detection via Ultrasound Vocalization Challenge.
Preprint (Jan. 2025). arXiv
2024
[42]
A. Triantafyllopoulos • B. W. Schuller
Hearing aids in the era of foundation models.
GMS Zeitschrift für Audiologie 6.28. Dec. 2024. DOI
Hearing aids in the era of foundation models.
GMS Zeitschrift für Audiologie 6.28. Dec. 2024. DOI
[41]
Q. Sun • A. Akman • X. Jing • M. Milling • B. W. Schuller
Audio-based Kinship Verification Using Age Domain Conversion.
IEEE Signal Processing Letters 32. Dec. 2024. DOI
Audio-based Kinship Verification Using Age Domain Conversion.
IEEE Signal Processing Letters 32. Dec. 2024. DOI
[40]
L. Shen • H. Zhang • C. Zhu • R. Li • K. Qian • W. Meng • F. Tian • B. Hu • B. W. Schuller • Y. Yamamoto
A First Look at Generative Artificial Intelligence Based Music Therapy for Mental Disorders.
IEEE Transactions on Consumer Electronics Early Access. Dec. 2024. DOI
A First Look at Generative Artificial Intelligence Based Music Therapy for Mental Disorders.
IEEE Transactions on Consumer Electronics Early Access. Dec. 2024. DOI
[39]
Y. Li • M. Milling • L. Specia • B. W. Schuller
From Audio Deepfake Detection to AI-Generated Music Detection -- A Pathway and Overview.
Preprint (Dec. 2024). arXiv
From Audio Deepfake Detection to AI-Generated Music Detection -- A Pathway and Overview.
Preprint (Dec. 2024). arXiv
[38]
Q. Sun • Y. Li • E. Alturki • S. M. K. Murthy • B. W. Schuller
Towards Friendly AI: A Comprehensive Review and New Perspectives on Human-AI Alignment.
Preprint (Dec. 2024). arXiv
Towards Friendly AI: A Comprehensive Review and New Perspectives on Human-AI Alignment.
Preprint (Dec. 2024). arXiv
[37]
A. Kathan • S. Amiriparian • L. Christ • S. Eulitz • B. W. Schuller
Automatic Speech-Based Charisma Recognition and the Impact of Integrating Auxiliary Characteristics.
TELEPRESENCE 2024 - IEEE Conference on Telepresence. Pasadena, CA, USA, Nov 16-17, 2024. DOI
Automatic Speech-Based Charisma Recognition and the Impact of Integrating Auxiliary Characteristics.
TELEPRESENCE 2024 - IEEE Conference on Telepresence. Pasadena, CA, USA, Nov 16-17, 2024. DOI
[36]
S. Amiriparian • M. Gerczuk • J. Lutz • W. Strube • I. Papazova • A. Hasan • A. Kathan • B. W. Schuller
Non-Invasive Suicide Risk Prediction Through Speech Analysis.
EHB 2024 - 12th E-Health and Bioengineering Conference. IASI, Romania, Nov 14-15, 2024. DOI
Non-Invasive Suicide Risk Prediction Through Speech Analysis.
EHB 2024 - 12th E-Health and Bioengineering Conference. IASI, Romania, Nov 14-15, 2024. DOI
[35]
A. Mallol-Ragolta • M. Milling • B. W. Schuller
Multi-Triplet Loss-Based Models for Categorical Depression Recognition from Speech.
IberSPEECH 2024 - 7th Conference IberSPEECH 2024. Aveiro, Portugal, Nov 11-13, 2024. PDF
Multi-Triplet Loss-Based Models for Categorical Depression Recognition from Speech.
IberSPEECH 2024 - 7th Conference IberSPEECH 2024. Aveiro, Portugal, Nov 11-13, 2024. PDF
[34]
A. Mallol-Ragolta • A. Spiesberger • A. B. Salvador • B. W. Schuller
Prototypical Networks for Speech Emotion Recognition in Spanish.
IberSPEECH 2024 - 7th Conference IberSPEECH 2024. Aveiro, Portugal, Nov 11-13, 2024. PDF
Prototypical Networks for Speech Emotion Recognition in Spanish.
IberSPEECH 2024 - 7th Conference IberSPEECH 2024. Aveiro, Portugal, Nov 11-13, 2024. PDF
[33]
A. Mallol-Ragolta • A. Spiesberger • B. W. Schuller
Face Mask Type and Coverage Area Recognition from Speech with Prototypical Networks.
IberSPEECH 2024 - 7th Conference IberSPEECH 2024. Aveiro, Portugal, Nov 11-13, 2024. PDF
Face Mask Type and Coverage Area Recognition from Speech with Prototypical Networks.
IberSPEECH 2024 - 7th Conference IberSPEECH 2024. Aveiro, Portugal, Nov 11-13, 2024. PDF
[32]
K. D. Bartl-Pokorny • C. Zitta • M. Beirit • G. Vogrinec • B. W. Schuller • F. B. Pokorny
Focused review on artificial intelligence for disease detection in infants.
Frontiers in Digital Health 6. Nov. 2024. DOI
Focused review on artificial intelligence for disease detection in infants.
Frontiers in Digital Health 6. Nov. 2024. DOI
[31]
S. Rampp • M. Milling • A. Triantafyllopoulos • B. W. Schuller
Does the Definition of Difficulty Matter? Scoring Functions and their Role for Curriculum Learning.
Preprint (Nov. 2024). arXiv
Does the Definition of Difficulty Matter? Scoring Functions and their Role for Curriculum Learning.
Preprint (Nov. 2024). arXiv
[30]
S. Rampp • A. Triantafyllopoulos • M. Milling • B. W. Schuller
autrainer: A Modular and Extensible Deep Learning Toolkit for Computer Audition Tasks.
Preprint (Nov. 2024). arXiv
autrainer: A Modular and Extensible Deep Learning Toolkit for Computer Audition Tasks.
Preprint (Nov. 2024). arXiv
[29]
K. R. Scherer • F. Burkhardt • U. D. Reichel • F. Eyben • B. W. Schuller
Using voice analysis as an early indicator of risk for depression in young adults.
Preprint (Nov. 2024). arXiv
Using voice analysis as an early indicator of risk for depression in young adults.
Preprint (Nov. 2024). arXiv
[28]
Q. Sun • A. Akman • B. W. Schuller
Explainable Artificial Intelligence for Medical Applications: A Review.
Preprint (Nov. 2024). arXiv
Explainable Artificial Intelligence for Medical Applications: A Review.
Preprint (Nov. 2024). arXiv
[27]
A. Triantafyllopoulos • Y. Terhorst • I. Tsangko • F. B. Pokorny • K. D. Bartl-Pokorny • L. Seizer • A. Klein • J. Chim • D. Atzil-Slonim • M. Liakata • M. Bühner • J. Löchner • B. W. Schuller
Large language models for mental health.
Preprint (Nov. 2024). arXiv
Large language models for mental health.
Preprint (Nov. 2024). arXiv
[26]
S. Amiriparian • L. Christ • A. Kathan • M. Gerczuk • N. Müller • S. Klug • L. Stappen • A. König • E. Cambria • B. W. Schuller • S. Eulitz
The MuSe 2024 Multimodal Sentiment Analysis Challenge: Social Perception and Humor Recognition.
MuSe @MM 2024 - 5th on Multimodal Sentiment Analysis Challenge and Workshop: Social Perception and Humor at the 32nd ACM International Conference on Multimedia. Melbourne, Australia , Oct 28-Nov 01, 2024. DOI
The MuSe 2024 Multimodal Sentiment Analysis Challenge: Social Perception and Humor Recognition.
MuSe @MM 2024 - 5th on Multimodal Sentiment Analysis Challenge and Workshop: Social Perception and Humor at the 32nd ACM International Conference on Multimedia. Melbourne, Australia , Oct 28-Nov 01, 2024. DOI
[25]
M. M. Amin • R. Mao • E. Cambria • B. W. Schuller
A Wide Evaluation of ChatGPT on Affective Computing Tasks.
IEEE Transactions on Affective Computing 15.4. Oct. 2024. DOI
A Wide Evaluation of ChatGPT on Affective Computing Tasks.
IEEE Transactions on Affective Computing 15.4. Oct. 2024. DOI
[24]
M. M. Amin • B. W. Schuller
On Prompt Sensitivity of ChatGPT in Affective Computing.
ACII 2024 - 12th International Conference on Affective Computing and Intelligent Interaction. Glasgow, UK, Sep 15-18, 2024. DOI
On Prompt Sensitivity of ChatGPT in Affective Computing.
ACII 2024 - 12th International Conference on Affective Computing and Intelligent Interaction. Glasgow, UK, Sep 15-18, 2024. DOI
[23]
S. Amiriparian • F. Packań • M. Gerczuk • B. W. Schuller
ExHuBERT: Enhancing HuBERT Through Block Extension and Fine-Tuning on 37 Emotion Datasets.
INTERSPEECH 2024 - 25th Annual Conference of the International Speech Communication Association. Kos Island, Greece, Sep 01-05, 2024. DOI
ExHuBERT: Enhancing HuBERT Through Block Extension and Fine-Tuning on 37 Emotion Datasets.
INTERSPEECH 2024 - 25th Annual Conference of the International Speech Communication Association. Kos Island, Greece, Sep 01-05, 2024. DOI
[22]
L. Christ • S. Amiriparian • F. Hawighorst • A.-K. Schill • A. Boutalikakis • L. Graf-Vlachy • A. König • B. W. Schuller
This Paper Had the Smartest Reviewers -- Flattery Detection Utilising an Audio-Textual Transformer-Based Approach.
INTERSPEECH 2024 - 25th Annual Conference of the International Speech Communication Association. Kos Island, Greece, Sep 01-05, 2024. DOI
This Paper Had the Smartest Reviewers -- Flattery Detection Utilising an Audio-Textual Transformer-Based Approach.
INTERSPEECH 2024 - 25th Annual Conference of the International Speech Communication Association. Kos Island, Greece, Sep 01-05, 2024. DOI
[21]
M. Gerczuk • S. Amiriparian • J. Lutz • W. Strube • I. Papazova • A. Hasan • B. W. Schuller
Exploring Gender-Specific Speech Patterns in Automatic Suicide Risk Assessment.
INTERSPEECH 2024 - 25th Annual Conference of the International Speech Communication Association. Kos Island, Greece, Sep 01-05, 2024. DOI
Exploring Gender-Specific Speech Patterns in Automatic Suicide Risk Assessment.
INTERSPEECH 2024 - 25th Annual Conference of the International Speech Communication Association. Kos Island, Greece, Sep 01-05, 2024. DOI
[20]
S. Kalabakov • M. Gonzalez-Machorro • F. Eyben • B. W. Schuller • B. Arnrich
A Comparative Analysis of Federated Learning for Speech-Based Cognitive Decline Detection.
INTERSPEECH 2024 - 25th Annual Conference of the International Speech Communication Association. Kos Island, Greece, Sep 01-05, 2024. PDF
A Comparative Analysis of Federated Learning for Speech-Based Cognitive Decline Detection.
INTERSPEECH 2024 - 25th Annual Conference of the International Speech Communication Association. Kos Island, Greece, Sep 01-05, 2024. PDF
[19]
A. Kathan • M. Bürger • A. Triantafyllopoulos • S. Milkus • R. Musil • B. W. Schuller • S. Amiriparian
Real-world PTSD Recognition: A Cross-corpus and Cross-linguistic Evaluation.
INTERSPEECH 2024 - 25th Annual Conference of the International Speech Communication Association. Kos Island, Greece, Sep 01-05, 2024. DOI
Real-world PTSD Recognition: A Cross-corpus and Cross-linguistic Evaluation.
INTERSPEECH 2024 - 25th Annual Conference of the International Speech Communication Association. Kos Island, Greece, Sep 01-05, 2024. DOI
[18]
O. Schrüfer • M. Milling • F. Burkhardt • F. Eyben • B. W. Schuller
Are you sure? Analysing Uncertainty Quantification Approaches for Real-world Speech Emotion Recognition.
INTERSPEECH 2024 - 25th Annual Conference of the International Speech Communication Association. Kos Island, Greece, Sep 01-05, 2024. PDF
Are you sure? Analysing Uncertainty Quantification Approaches for Real-world Speech Emotion Recognition.
INTERSPEECH 2024 - 25th Annual Conference of the International Speech Communication Association. Kos Island, Greece, Sep 01-05, 2024. PDF
[17]
A. Spiesberger • A. Triantafyllopoulos • A. Kathan • A. Semertzidou • C. Gawrilow • T. Reinelt • W. Rauch • B. W. Schuller
'So... my child...' -- How Child ADHD Influences the Way Parents Talk.
INTERSPEECH 2024 - 25th Annual Conference of the International Speech Communication Association. Kos Island, Greece, Sep 01-05, 2024. PDF
'So... my child...' -- How Child ADHD Influences the Way Parents Talk.
INTERSPEECH 2024 - 25th Annual Conference of the International Speech Communication Association. Kos Island, Greece, Sep 01-05, 2024. PDF
[16]
A. Triantafyllopoulos • A. Batliner • S. Rampp • M. Milling • B. W. Schuller
INTERSPEECH 2009 Emotion Challenge Revisited: Benchmarking 15 Years of Progress in Speech Emotion Recognition.
INTERSPEECH 2024 - 25th Annual Conference of the International Speech Communication Association. Kos Island, Greece, Sep 01-05, 2024. DOI
INTERSPEECH 2009 Emotion Challenge Revisited: Benchmarking 15 Years of Progress in Speech Emotion Recognition.
INTERSPEECH 2024 - 25th Annual Conference of the International Speech Communication Association. Kos Island, Greece, Sep 01-05, 2024. DOI
[15]
A. Triantafyllopoulos • B. W. Schuller
Enrolment-based personalisation for improving individual-level fairness in speech emotion recognition.
INTERSPEECH 2024 - 25th Annual Conference of the International Speech Communication Association. Kos Island, Greece, Sep 01-05, 2024. PDF
Enrolment-based personalisation for improving individual-level fairness in speech emotion recognition.
INTERSPEECH 2024 - 25th Annual Conference of the International Speech Communication Association. Kos Island, Greece, Sep 01-05, 2024. PDF
[14]
A. Triantafyllopoulos • L. Christ • A. Gebhard • X. Jing • A. Kathan • M. Milling • I. Tsangko • S. Amiriparian • B. W. Schuller
Beyond deep learning: Charting the next frontiers of affective computing.
Intelligent Computing 3.0089. Sep. 2024. DOI
Beyond deep learning: Charting the next frontiers of affective computing.
Intelligent Computing 3.0089. Sep. 2024. DOI
[13]
M. Milling • S. Liu • A. Triantafyllopoulos • I. Aslan • B. W. Schuller
Audio Enhancement for Computer Audition -- An Iterative Training Paradigm Using Sample Importance.
Journal of Computer Science and Technology 39. Sep. 2024. DOI
Audio Enhancement for Computer Audition -- An Iterative Training Paradigm Using Sample Importance.
Journal of Computer Science and Technology 39. Sep. 2024. DOI
[12]
A. Triantafyllopoulos • A. Gebhard • M. Milling • S. Rampp • B. W. Schuller
An Automatic Analysis of Ultrasound Vocalisations for the Prediction of Interaction Context in Captive Egyptian Fruit Bats.
EUSIPCO 2024 - 32nd European Signal Processing Conference. Lyon, France, Aug 26-30, 2024. DOI
An Automatic Analysis of Ultrasound Vocalisations for the Prediction of Interaction Context in Captive Egyptian Fruit Bats.
EUSIPCO 2024 - 32nd European Signal Processing Conference. Lyon, France, Aug 26-30, 2024. DOI
[11]
L. Christ • S. Amiriparian • M. Milling • I. Aslan • B. W. Schuller
Modeling Emotional Trajectories in Written Stories Utilizing Transformers and Weakly-Supervised Learning.
Findings @ACL 2024 - Findings of the 62nd Annual Meeting of the Association for Computational Linguistics. Bangkok, Thailand, Aug 11-16, 2024. DOI
Modeling Emotional Trajectories in Written Stories Utilizing Transformers and Weakly-Supervised Learning.
Findings @ACL 2024 - Findings of the 62nd Annual Meeting of the Association for Computational Linguistics. Bangkok, Thailand, Aug 11-16, 2024. DOI
[10]
Z. Ren • Y. Chang • T. T. Nguyen • Y. Tan • K. Qian • B. W. Schuller
A Comprehensive Survey on Heart Sound Analysis in the Deep Learning Era.
IEEE Computational Intelligence Magazine 19.3. Aug. 2024. DOI
A Comprehensive Survey on Heart Sound Analysis in the Deep Learning Era.
IEEE Computational Intelligence Magazine 19.3. Aug. 2024. DOI
[9]
A. Kathan • S. Amiriparian • A. Triantafyllopoulos • A. Gebhard • S. Milkus • J. Hohmann • P. Muderlak • J. Schottdorf • R. Musil • B. W. Schuller
Personalised Speech-Based PTSD Prediction Using Weighted-Instance Learning.
EMBC 2024 - 46th Annual International Conference of the IEEE Engineering in Medicine and Biology Society. Orlando, FL, USA, Jul 15-19, 2024. DOI
Personalised Speech-Based PTSD Prediction Using Weighted-Instance Learning.
EMBC 2024 - 46th Annual International Conference of the IEEE Engineering in Medicine and Biology Society. Orlando, FL, USA, Jul 15-19, 2024. DOI
[8]
S. T. Rajamani • K. Rajamani • A. J • K. R • B. W. Schuller
CBAM_SAUNet: A novel attention U-Net for effective segmentation of corner cases.
EMBC 2024 - 46th Annual International Conference of the IEEE Engineering in Medicine and Biology Society. Orlando, FL, USA, Jul 15-19, 2024. DOI
CBAM_SAUNet: A novel attention U-Net for effective segmentation of corner cases.
EMBC 2024 - 46th Annual International Conference of the IEEE Engineering in Medicine and Biology Society. Orlando, FL, USA, Jul 15-19, 2024. DOI
[7]
A. Spiesberger • A. Mallol-Ragolta • A. Triantafyllopoulos • B. W. Schuller
Towards Predicting Menstrual Cycle Phases Exploiting Paralinguistic Features.
EMBC 2024 - 46th Annual International Conference of the IEEE Engineering in Medicine and Biology Society. Orlando, FL, USA, Jul 15-19, 2024. DOI
Towards Predicting Menstrual Cycle Phases Exploiting Paralinguistic Features.
EMBC 2024 - 46th Annual International Conference of the IEEE Engineering in Medicine and Biology Society. Orlando, FL, USA, Jul 15-19, 2024. DOI
[6]
W. Qiu • Y. Feng • Y. Li • Y. Chang • K. Qian • B. Hu • Y. Yamamoto • B. W. Schuller
Fed-MStacking: Heterogeneous Federated Learning With Stacking Misaligned Labels for Abnormal Heart Sound Detection.
IEEE Journal of Biomedical and Health Informatics 28.9. Jul. 2024. DOI
Fed-MStacking: Heterogeneous Federated Learning With Stacking Misaligned Labels for Abnormal Heart Sound Detection.
IEEE Journal of Biomedical and Health Informatics 28.9. Jul. 2024. DOI
[5]
P. Purucker • C. Reil • A. Höß • B. W. Schuller
Deep Neural Quality of Service Prediction for Unmanned Aircraft System Communications.
IWCMC 2024 - 20th International Wireless Communications and Mobile Computing Conference. Cyprus, Greece, May 27-31, 2024. DOI
Deep Neural Quality of Service Prediction for Unmanned Aircraft System Communications.
IWCMC 2024 - 20th International Wireless Communications and Mobile Computing Conference. Cyprus, Greece, May 27-31, 2024. DOI
[4]
W. Qiu • C. Quan • L. Zhu • Y. Yu • Z. Wang • Y. Ma • M. Sun • Y. Chang • K. Qian • B. Hu • Y. Yamamoto • B. W. Schuller
Heart Sound Abnormality Detection From Multi-Institutional Collaboration: Introducing a Federated Learning Framework.
IEEE Transactions on Biomedical Engineering 71.10. May. 2024. DOI
Heart Sound Abnormality Detection From Multi-Institutional Collaboration: Introducing a Federated Learning Framework.
IEEE Transactions on Biomedical Engineering 71.10. May. 2024. DOI
[3]
A. Triantafyllopoulos • B. W. Schuller
Expressivity and Speech Synthesis.
Oxford Handbook of Expressivity in Language. Apr. 2024. arXiv URL
Expressivity and Speech Synthesis.
Oxford Handbook of Expressivity in Language. Apr. 2024. arXiv URL
[2]
A. Mallol-Ragolta • B. W. Schuller
Coupling Sentiment and Arousal Analysis Towards an Affective Dialogue Manager.
IEEE Access 12. Feb. 2024. DOI
Coupling Sentiment and Arousal Analysis Towards an Affective Dialogue Manager.
IEEE Access 12. Feb. 2024. DOI
[1]
J. Xie • Y. Shi • D. Ni • M. Milling • S. Liu • J. Zhang • K. Qian • B. W. Schuller
Automatic Bird Sound Source Separation Based on Passive Acoustic Devices in Wild Environment.
IEEE Internet of Things Journal 11.9. Jan. 2024. DOI
Automatic Bird Sound Source Separation Based on Passive Acoustic Devices in Wild Environment.
IEEE Internet of Things Journal 11.9. Jan. 2024. DOI
©all images: LMU | TUM
Back to Top
2024-12-27 - Last modified: 2026-07-03