Home | Publications | XTP26

From Noise to Signal to Selbstzweck: Reframing Human Label Variation in the Era of Post-Training in NLP

MCML Authors

Link to Profile Barbara Plank

Barbara Plank

Prof. Dr.

Core PI

Abstract

Human Label Variation (HLV) refers to legitimate disagreement in annotation that reflects the diversity of human perspectives rather than mere error. Long treated in NLP as noise to be eliminated, HLV has only recently been reframed as a signal for improving model robustness. With the rise of large language models (LLMs) and post-training methods such as human feedback-based alignment, the role of HLV has become increasingly consequential. Yet current preference-learning datasets routinely collapse multiple annotations into a single label, flattening diverse perspectives into artificial consensus. Preserving HLV is necessary not only for pluralistic alignment but also for sociotechnical safety evaluation, where model behavior must be assessed in relation to human interaction and societal context.This position paper argues that preserving HLV as an embodiment of human pluralism must be treated as a Selbstzweck, an intrinsic value in itself. We analyze the limitations of existing preference datasets and propose actionable strategies for incorporating HLV into dataset construction to better preserve pluralistic human values.

inproceedings XTP26


Findings @ACL 2026

Findings at the 64th Annual Meeting of the Association for Computational Linguistics. San Diego, CA, USA, Jul 02-07, 2026. To be published. Preprint available.
Conference logo

Authors

S. Xu • S. T. Y. S. S • B. Plank

Links

arXiv

In Collaboration

 Amazon


Research Area

 B2 | Natural Language Processing

BibTeXKey: XTP26

Back to Top