Home  | Publications | CNW+25

Human vs. Machine -- 1:3. Joint Analysis of Classical and ML-Based Summary Statistics of the Lyman-Α Forest

MCML Authors

Abstract

In order to compress and more easily interpret Lyman-α forest (LyαF) datasets, summary statistics, e.g. the power spectrum, are commonly used. However, such summaries unavoidably lose some information, weakening the constraining power on parameters of interest. Recently, machine learning (ML)-based summary approaches have been proposed as an alternative to human-defined statistical measures. This raises a question: can ML-based summaries contain the full information captured by traditional statistics, and vice versa? In this study, we apply three human-defined techniques and one ML-based approach to summarize mock LyαF data from hydrodynamical simulations and infer two thermal parameters of the intergalactic medium, assuming a power-law temperature-density relation. We introduce a metric for measuring the improvement in the figure of merit when combining two summaries. Consequently, we demonstrate that the ML-based summary approach not only contains almost all of the information from the human-defined statistics, but also that it provides significantly stronger constraints by a ratio of better than 1:3 in terms of the posterior volume on the temperature-density relation parameters.

article


Astronomy & Astrophysics

In press. Aug. 2025. Preprint.
Top Journal

Authors

S. Chang • P. Nayak • M. Walther • D. Grün

Links

arXiv

Research Area

 C3 | Physics and Geo Sciences

BibTeXKey: CNW+25

Back to Top