Home  | Publications | LLS+26

M6: Multi-Generator, Multi-Domain, Multi-Lingual and Cultural, Multi-Genres, Multi-Instrument Machine-Generated Music Detection Databases

MCML Authors

Link to Profile Björn Schuller

Björn Schuller

Prof. Dr.

Principal Investigator

Abstract

Machine-generated music (MGM) has become a powerful tool with applications in music therapy, personalised editing, and creative inspiration. However, its unregulated use threatens the entertainment, education, and arts sectors by diminishing the value of high-quality human compositions. Effective detection of machine-generated music (MGMD) is essential, yet progress is hindered by the lack of comprehensive datasets. To address this gap, we introduce M6, a large-scope benchmark dataset designed for MGMD research. M6 is distinguished by its diversity, encompassing multiple generators, domains, languages, cultural contexts, genres, and instruments, all provided in WAV format. We detail the data collection methodology and analysis, alongside baseline performance scores from foundational binary classification models, highlighting the complexity of MGMD and the need for further advancements. M6 serves as a robust resource to support future research in developing effective detection methods. The dataset is available at https://huggingface.co/datasets/yl7622/M6 to promote collaboration and innovation.

article LLS+26


Scientific Reports

In press. Feb. 2026.
Top Journal

Authors

Y. Li • H. Li • L. Specia • B. W. Schuller

Links

DOI

Research Area

 B3 | Multimodal Perception

BibTeXKey: LLS+26

Back to Top