Home  | Publications | Sza 25

From Calculation to Adjudication: Examining LLM Judges on Mathematical Reasoning Tasks

MCML Authors

Abstract

inproceedings SZA+25


GEM2 @ACL 2025

4th Workshop on Generation, Evaluation and Metrics at the 63rd Annual Meeting of the Association for Computational Linguistics. Vienna, Austria, Jul 27-Aug 01, 2025.

Authors

A. Stephan • D. Zhu • M. Aßenmacher • X. Shen • B. Roth

Links

URL

Research Area

 A1 | Statistical Foundations & Explainability

BibTeXKey: SZA+25

Back to Top