Home  | Events

04

Feb

Teaser image to Large Language Models for Statistical Inference: Context Augmentation with Applications to the Two-Sample Problem and Regression

Colloquium

Large Language Models for Statistical Inference: Context Augmentation With Applications to the Two-Sample Problem and Regression

Marc Ratkovic, University of Mannheim

   04.02.2026

   4:15 pm - 5:45 pm

   LMU Munich, Department of Statistics and via zoom

The colloquium introduces context expansion, an approach that uses large language models to generate additional contexts around strings to enable valid statistical inferences.

These contexts reduce uncertainty, provide additional information and improve the interpretability of the results. Using synthetic data and a two-sample test, it is shown that the method has correct null behavior, is powerful and can be easily replicated. In addition, a text-to-text regression is introduced in which generated contexts serve as mediating variables and semantic from syntactic effects can be separated.

Theoretical analyzes provide identification conditions, efficiency gains and error bounds and overall show that context extension meaningfully combines LLMs with classical statistics.

Marc Ratkovic is Professor and Chair of Social Data Science at the Department of Political Science, University of Mannheim. He is working to integrate machine learning and deep learning methods into practical social science methodology.


Related

Link to How Machines Explore, Conjecture, and Discover Mathematics

Munich AI Lectures  •  12.02.2026  •  LMU Munich, Main Building, Room D209

How Machines Explore, Conjecture, and Discover Mathematics

Munich AI Lecture on Feb 12 features Sebastian Pokutta from Zuse Institute Berlin (ZIB).


Back to Top