Home | Publications | DLG+26

Min-K Sampling: Decoupling Truncation From Temperature Scaling via Relative Logit Dynamics

MCML Authors

Esteban Garces Arias

→ Group Bernd Bischl
Statistical Learning and Data Science

Matthias Aßenmacher

Dr.

→ Group Bernd Bischl
Statistical Learning and Data Science

Abstract

The quality of text generated by large language models depends critically on the decoding sampling strategy. While mainstream methods such as Top-k, Top-p, and Min-p achieve a balance between diversity and accuracy through probability-space truncation, they share an inherent limitation: extreme sensitivity to the temperature parameter. Recent logit-space approaches like Top-nσ achieve temperature invariance but rely on global statistics that are susceptible to long-tail noise, failing to capture fine-grained confidence structures among top candidates. We propose Min-k Sampling, a novel dynamic truncation strategy that analyzes the local shape of the sorted logit distribution to identify 'semantic cliffs': sharp transitions from high-confidence core tokens to uncertain long-tail tokens. By computing a position-weighted relative decay rate, Min-k dynamically determines truncation boundaries at each generation step. We formally prove that Min-k achieves strict temperature invariance and empirically demonstrate its low sensitivity to hyperparameter choices. Experiments on multiple reasoning benchmarks, creative writing tasks, and human evaluation show that Min-k consistently improves text quality, maintaining robust performance even under extreme temperature settings where probability-based methods collapse. We make our code, models, and analysis tools publicly available.

inproceedings DLG+26

ACL 2026

64th Annual Meeting of the Association for Computational Linguistics. San Diego, CA, USA, Jul 02-07, 2026.

Authors

Y. Ding • M. Li • E. Garces Arias • M. Aßenmacher • C. Heumann • C. Zhang

Links

DOI

Research Area

A1 | Statistical Foundations & Explainability

BibTeXKey: DLG+26

#p-bischl