Home | Tags | #p-kreuter

#p-kreuter

MCS+26

EACL 2026

#p-kreuter #p-plank

BGG+26

On the Impossibility of Separating Intelligence From Judgment: The Computational Intractability of Filtering for AI Alignment

ICLR 2026

#p-kreuter

LKD+26

Sensing What Surveys Miss: Understanding and Personalizing Proactive LLM Support by User Modeling

CHI 2026

#p-kreuter

BAK+26

Reading Between the Tokens: Improving Preference Predictions Through Mechanistic Forecasting

Preprint (Feb. 2026)

#p-kreuter

LGZ+26

Do Large Language Models Think Like the Brain? Sentence-Level Evidence From FMRI and Hierarchical Embeddings

AAAI 2026

#p-kreuter

BWH+26

Clustering Mouse Movement Behavior in Surveys Using ResNet Embeddings

NLDL 2026

#p-kreuter

WBL+26

Comparison of Neural Networks and Gradient Boosting Models on Ordinal Age Class Prediction Using Mouse Trajectories

NLDL 2026

#p-kreuter

GDM+26

A Survey on Mental Health Datasets and Resources

Preprint (Jan. 2026)

#p-kreuter

MM26

The Imperfective Paradox in Large Language Models

Preprint (Jan. 2026)

#p-kreuter

YMZ+26

Moral Lenses, Political Coordinates: Towards Ideological Positioning of Morally Conditioned LLMs

Preprint (Jan. 2026)

#p-kasneci-gjergji #p-kreuter

NYM+25

Decomposed Prompting: Probing Multilingual Linguistic Structure Knowledge in Large Language Models

Findings @IJCNLP 2025

#p-kreuter #p-schuetze

ZMB+25

Enhancing Multi-Epitope Vaccine Effectiveness Through State-of-the-Art Proteasomal Cleavage Prediction With Deep Learning

BIPM 2025

#p-bischl #p-kreuter

SKK25

Fairness in Machine Learning for National Statistical Organizations

Foundations and Advances of Machine Learning in Official Statistics. Dec. 2025

#p-kern #p-kreuter

FNW+25

Measuring Sexism in US Elections: A Comparative Analysis of X Discourse From 2020 to 2024

CODI @EMNLP 2025

#p-kreuter

WML+25

M-ABSA: A Multilingual Dataset for Aspect-Based Sentiment Analysis

EMNLP 2025

#p-kreuter #p-plank #p-schuetze

WCL+25

Multimodal Emotion Recognition in Conversations: A Survey of Methods, Trends, Challenges and Prospects

Findings @EMNLP 2025

#p-kreuter

EMK+25

Aligning NLP Models With Target Population Perspectives Using PAIR: Population-Aligned Instance Replication

NLPerspectives @EMNLP 2025

#p-kern #p-kreuter #p-plank

Kre25b

AAPOR Presidential Address 'That Ain’t the Way I Heard It!' on the Role of Surveys in Shaping (And Being Shaped By) Generative AI

Public Opinion Quarterly 89.3. Nov. 2025

#p-kreuter

KSG+25

Unintended Impacts of Automation for Integration? Simulating Integration Outcomes of Algorithm-Based Refugee Allocation in Germany

AIES 2025

#p-kern #p-kreuter

Hey25

Who Counts? the Potentials and Pitfalls of Using LLMs in Survey Research

NLPOR @COLM 2025

#p-kreuter

HHW+25a

AIn't Nothing but a Survey? Using Large Language Models for Coding German Open-Ended Survey Responses on Survey Motivation

NLPOR @COLM 2025

#p-kreuter

Bec25

Improving Annotation Quality: Empirical Insights Into Bias, Human-AI Collaboration, and Workflow Design

Dissertation LMU München. Oct. 2025

#p-kreuter

BHR+25

Toward Understanding the Transferability of Adversarial Suffixes in Large Language Models

Preprint (Oct. 2025)

#p-kreuter

BH25

Don't Walk the Line: Boundary Guidance for Filtered Generation

Preprint (Oct. 2025)

#p-kreuter

HWN+25a

Systematic Evaluation of Uncertainty Estimation Methods in Large Language Models

Preprint (Oct. 2025)

#p-kreuter

MYH+25a

Capabilities and Evaluation Biases of Large Language Models in Classical Chinese Poetry Generation: A Case Study on Tang Poetry

Preprint (Oct. 2025)

#p-kreuter

SFF+25

Bias Begins With Data: The FairGround Corpus for Robust and Reproducible Research on Algorithmic Fairness

Preprint (Oct. 2025)

#p-kern #p-kreuter

ZMF+25

Table Question Answering in the Era of Large Language Models: A Comprehensive Survey of Tasks, Methods, and Evaluation

Preprint (Oct. 2025)

#p-kreuter

DKG+25

Problem Solving Through Human-AI Preference-Based Cooperation

Computational Linguistics 51.4. Sep. 2025

#p-huellermeier #p-kreuter #p-schuetze

Kre25a

Modernizing Data Collection

Journal of Official Statistics 41.3. Sep. 2025

#p-kreuter

BEK+25

Bias in the Loop: How Humans Evaluate AI-Generated Suggestions

Preprint (Sep. 2025)

#p-kern #p-kreuter

HHW+25

AIn't Nothing but a Survey? Using Large Language Models for Coding German Open-Ended Survey Responses on Survey Motivation

JSM 2025

#p-kreuter

BSD+25

Addressing Data Gaps in Sustainability Reporting: A Benchmark Dataset for Greenhouse Gas Emission Extraction

Scientific Data 12.1497. Aug. 2025

#p-kreuter

MLZ+25

Pragmatics in the Era of Large Language Models: A Survey on Datasets, Evaluation, Opportunities and Challenges

ACL 2025

#p-kreuter #p-plank

MYH+25

Algorithmic Fidelity of Large Language Models in Generating Synthetic German Public Opinions: A Case Study

ACL 2025

#p-bischl #p-kreuter #p-plank

KHK25

Mind the Gap: Gender-Based Differences in Occupational Embeddings

GeBNLP @ACL 2025

#p-kreuter

KS25

Can Large Language Models Advance Occupational Coding? Evidence and Methodological Insights

ESRA 2025

#p-kreuter

Kre25

Adaptive Alignment: Designing AI for a Changing World - Frauke Kreuter

ICML 2025

#p-kreuter

NWK+25

Measuring Public Opinion Towards Artificial Intelligence: Development and Validation of a General AI Attitude Short Scale

AI and Society. Jul. 2025

#p-kern #p-kreuter

Hey25a

Who Counts? Survey Data Quality in the Age of AI

Dissertation Universität Mannheim. Jul. 2025. Co-Supervised

#p-kreuter

FKK25

Adjusting Survey Estimates With Multi-Accuracy Post-Processing

ITACOSM 2025

#p-kern #p-kreuter

SKB25a

Fares on Fairness: Using a Total Error Framework to Examine the Role of Measurement and Representation in Training Data on Model Fairness and Bias

EWAF 2025

#p-kern #p-kreuter

YNM+25

Why Lift So Heavy? Slimming Large Language Models by Cutting Off the Layers

IJCNN 2025

#p-kreuter #p-schuetze

WMZ+25

Evaluating Zero-Shot Multilingual Aspect-Based Sentiment Analysis With Large Language Models

International Journal of Machine Learning and Cybernetics 16.10. Jun. 2025

#p-kreuter

BAK+25

Human Preferences in Large Language Model Latent Space: A Technical Analysis on the Reliability of Synthetic Data in Voting Outcome Prediction

AAPOR 2025

#p-kreuter

Kon25

How ML-Filtered Answer Options Shape Responses and Interactions in CATI Surveys

AAPOR 2025

#p-kreuter

KFS+25

Algorithms for Reliable Decision-Making Need Causal Reasoning

Nature Computational Science 5. May. 2025

#p-feuerriegel #p-kern #p-kreuter

KBC+25

Preprint (May. 2025)

#p-fraser #p-kreuter

SMA+25

Connecting Natural Language Processing and Survey Methodology: Potentials, Challenges, and Open Questions

Preprint (May. 2025)

#p-kreuter

WAK+25a

AI Conversational Interviewing: Transforming Surveys With LLMs as Adaptive Interviewers

LaTeCH-CLfL @NAACL 2025

#p-bischl #p-kreuter

MHH25

Can Large Language Models Advance Crosswalks? the Case of Danish Occupation Codes

SRW @NAACL 2025

#p-kreuter

HHW25

Vox Populi, Vox AI? Using Language Models to Estimate German Public Opinion

Social Science Computer Review Online First. Apr. 2025

#p-kreuter

WMD+25

Multi-Scale and Multi-Objective Optimization for Cross-Lingual Aspect-Based Sentiment Analysis

Preprint (Feb. 2025)

#p-kreuter

BKD+25

Toward Integrating ChatGPT Into Satellite Image Annotation Workflows: A Comparison of Label Quality and Costs of Human and Automated Annotators

IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing 18. Jan. 2025

#p-kreuter

KSK25

The Impact of Question Framing on the Precision of Automatic Occupation Coding

Preprint (Jan. 2025)

#p-kreuter

FKB+24

Bridging the Gap: Towards an Expanded Toolkit for AI-Driven Decision-Making in the Public Sector

Government Information Quarterly 41.4. Dec. 2024

#p-kern #p-kreuter

MWH+24

The Potential and Challenges of Evaluating Attitudes, Opinions, and Values in Large Language Models

Findings @EMNLP 2024

#p-hedderich #p-kreuter #p-plank

KBM+24a

When Small Decisions Have Big Impact: Fairness Implications of Algorithmic Profiling Schemes

ACM Journal on Responsible Computing. Nov. 2024

#p-kern #p-kreuter

WHM24

Look at the Text: Instruction-Tuned Language Models Are More Robust Multiple Choice Selectors Than You Think

COLM 2024

#p-kreuter #p-plank

SK24

Connecting Algorithmic Fairness to Quality Dimensions in Machine Learning in Official Statistics and Survey Production

AStA Wirtschafts- Und Sozialstatistisches Archiv 18. Oct. 2024

#p-kern #p-kreuter

HHW24a

United in Diversity? Contextual Biases in LLM-Based Predictions of the 2024 European Parliament Elections

Preprint (Sep. 2024)

#p-kreuter

Ma24

Evaluating Lexical Aspect With Large Language Models

CMCL @ACL 2024

#p-kreuter

DDS+24

Informing Climate Risk Analysis Using Textual Information - A Research Agenda

ClimateNLP @ACL 2024

#p-fraser #p-kreuter

WMH+24

My Answer Is C: First-Token Probabilities Do Not Match Text Answers in Instruction-Tuned Language Models

Findings @ACL 2024

#p-kreuter #p-plank

EPK24

Position: Insights From Survey Methodology Can Improve Training Data

ICML 2024

#p-kreuter #p-plank

FKK24

The Missing Link: Allocation Performance in Causal Machine Learning

Workshop Humans, Algorithmic Decision-Making and Society @ICML 2024

#p-kern #p-kreuter

RMH+24

TOPCAT: Topic-Oriented Protocol for Content Analysis of Text – A Preliminary Study

NLP+CSS @NAACL 2024

#p-kreuter

BKP24

Understanding Jailbreak Success: A Study of Latent Space Dynamics in Large Language Models

Preprint (Jun. 2024)

#p-kreuter

MNY+24a

ToPro: Token-Level Prompt Decomposition for Cross-Lingual Sequence Labeling Tasks

EACL 2024

#p-kreuter #p-schuetze

BEM+24

Order Effects in Annotation Tasks: Further Evidence of Annotation Sensitivity

UncertaiNLP @EACL 2024

#p-kreuter

ZYM+23a

Baby’s CoThought: Leveraging Large Language Models for Enhanced Reasoning in Compact Models

BabyLM Challenge @CoNLL 2023)

#p-kreuter #p-ruegamer #p-schuetze

ZMB+23

Proteasomal Cleavage Prediction: State-of-the-Art and Future Directions

Preprint (Oct. 2023)

#p-bischl #p-kreuter

KBB+23

On the Challenges and Practices of Reinforcement Learning From Real Human Feedback

HLDM @ECML-PKDD 2023

#p-huellermeier #p-kreuter

MNS+23

Is Prompt-Based Finetuning Always Better Than Vanilla Finetuning? Insights From Cross-Lingual Language Understanding

KONVENS 2023

#p-kreuter #p-schuetze

THM+23

Augmenting Survey Data With Digital Trace Data: Is There a Threat to Panel Retention?

Journal of Survey Statistics and Methodology 11.3. Jun. 2023

#p-kreuter

ZMN+22

What Cleaves? Is Proteasomal Cleavage Prediction Reaching a Ceiling?

LMRL @NeurIPS 2022

#p-bischl #p-kreuter #p-ruegamer #p-schuetze

RBW+22

Association of Non-Pharmaceutical Interventions to Reduce the Spread of SARS-CoV-2 With Anxiety and Depressive Symptoms: A Multi-National Study of 43 Countries

International Journal of Public Health 67. Mar. 2022

#p-kreuter

VDK+22

Package ‘PracTools’

2022

#p-kreuter

#p-kreuter

Too Open for Opinion? Embracing Open-Endedness in Large Language Models for Social Simulation

On the Impossibility of Separating Intelligence From Judgment: The Computational Intractability of Filtering for AI Alignment

Sensing What Surveys Miss: Understanding and Personalizing Proactive LLM Support by User Modeling

Reading Between the Tokens: Improving Preference Predictions Through Mechanistic Forecasting

Do Large Language Models Think Like the Brain? Sentence-Level Evidence From FMRI and Hierarchical Embeddings

Clustering Mouse Movement Behavior in Surveys Using ResNet Embeddings

Comparison of Neural Networks and Gradient Boosting Models on Ordinal Age Class Prediction Using Mouse Trajectories

A Survey on Mental Health Datasets and Resources

The Imperfective Paradox in Large Language Models

Moral Lenses, Political Coordinates: Towards Ideological Positioning of Morally Conditioned LLMs

Decomposed Prompting: Probing Multilingual Linguistic Structure Knowledge in Large Language Models

Enhancing Multi-Epitope Vaccine Effectiveness Through State-of-the-Art Proteasomal Cleavage Prediction With Deep Learning

Fairness in Machine Learning for National Statistical Organizations

Measuring Sexism in US Elections: A Comparative Analysis of X Discourse From 2020 to 2024

M-ABSA: A Multilingual Dataset for Aspect-Based Sentiment Analysis

Multimodal Emotion Recognition in Conversations: A Survey of Methods, Trends, Challenges and Prospects

Aligning NLP Models With Target Population Perspectives Using PAIR: Population-Aligned Instance Replication

AAPOR Presidential Address 'That Ain’t the Way I Heard It!' on the Role of Surveys in Shaping (And Being Shaped By) Generative AI

Unintended Impacts of Automation for Integration? Simulating Integration Outcomes of Algorithm-Based Refugee Allocation in Germany

Who Counts? the Potentials and Pitfalls of Using LLMs in Survey Research

AIn't Nothing but a Survey? Using Large Language Models for Coding German Open-Ended Survey Responses on Survey Motivation

Improving Annotation Quality: Empirical Insights Into Bias, Human-AI Collaboration, and Workflow Design

Toward Understanding the Transferability of Adversarial Suffixes in Large Language Models

Don't Walk the Line: Boundary Guidance for Filtered Generation

Systematic Evaluation of Uncertainty Estimation Methods in Large Language Models

Capabilities and Evaluation Biases of Large Language Models in Classical Chinese Poetry Generation: A Case Study on Tang Poetry

Bias Begins With Data: The FairGround Corpus for Robust and Reproducible Research on Algorithmic Fairness

Table Question Answering in the Era of Large Language Models: A Comprehensive Survey of Tasks, Methods, and Evaluation

Problem Solving Through Human-AI Preference-Based Cooperation

Modernizing Data Collection

Bias in the Loop: How Humans Evaluate AI-Generated Suggestions

AIn't Nothing but a Survey? Using Large Language Models for Coding German Open-Ended Survey Responses on Survey Motivation

Addressing Data Gaps in Sustainability Reporting: A Benchmark Dataset for Greenhouse Gas Emission Extraction

Pragmatics in the Era of Large Language Models: A Survey on Datasets, Evaluation, Opportunities and Challenges

Algorithmic Fidelity of Large Language Models in Generating Synthetic German Public Opinions: A Case Study

Mind the Gap: Gender-Based Differences in Occupational Embeddings

Can Large Language Models Advance Occupational Coding? Evidence and Methodological Insights

Adaptive Alignment: Designing AI for a Changing World - Frauke Kreuter

Measuring Public Opinion Towards Artificial Intelligence: Development and Validation of a General AI Attitude Short Scale

Who Counts? Survey Data Quality in the Age of AI

Adjusting Survey Estimates With Multi-Accuracy Post-Processing

Fares on Fairness: Using a Total Error Framework to Examine the Role of Measurement and Representation in Training Data on Model Fairness and Bias

Why Lift So Heavy? Slimming Large Language Models by Cutting Off the Layers

Evaluating Zero-Shot Multilingual Aspect-Based Sentiment Analysis With Large Language Models

Human Preferences in Large Language Model Latent Space: A Technical Analysis on the Reliability of Synthetic Data in Voting Outcome Prediction

How ML-Filtered Answer Options Shape Responses and Interactions in CATI Surveys

Algorithms for Reliable Decision-Making Need Causal Reasoning

NLP for Social Good: A Survey of Challenges, Opportunities, and Responsible Deployment

Connecting Natural Language Processing and Survey Methodology: Potentials, Challenges, and Open Questions

AI Conversational Interviewing: Transforming Surveys With LLMs as Adaptive Interviewers

Can Large Language Models Advance Crosswalks? the Case of Danish Occupation Codes

Vox Populi, Vox AI? Using Language Models to Estimate German Public Opinion

Multi-Scale and Multi-Objective Optimization for Cross-Lingual Aspect-Based Sentiment Analysis

Toward Integrating ChatGPT Into Satellite Image Annotation Workflows: A Comparison of Label Quality and Costs of Human and Automated Annotators

The Impact of Question Framing on the Precision of Automatic Occupation Coding

Bridging the Gap: Towards an Expanded Toolkit for AI-Driven Decision-Making in the Public Sector

The Potential and Challenges of Evaluating Attitudes, Opinions, and Values in Large Language Models

When Small Decisions Have Big Impact: Fairness Implications of Algorithmic Profiling Schemes

Look at the Text: Instruction-Tuned Language Models Are More Robust Multiple Choice Selectors Than You Think

Connecting Algorithmic Fairness to Quality Dimensions in Machine Learning in Official Statistics and Survey Production

United in Diversity? Contextual Biases in LLM-Based Predictions of the 2024 European Parliament Elections

Evaluating Lexical Aspect With Large Language Models

Informing Climate Risk Analysis Using Textual Information - A Research Agenda

My Answer Is C: First-Token Probabilities Do Not Match Text Answers in Instruction-Tuned Language Models

Position: Insights From Survey Methodology Can Improve Training Data

The Missing Link: Allocation Performance in Causal Machine Learning

TOPCAT: Topic-Oriented Protocol for Content Analysis of Text – A Preliminary Study

Understanding Jailbreak Success: A Study of Latent Space Dynamics in Large Language Models

ToPro: Token-Level Prompt Decomposition for Cross-Lingual Sequence Labeling Tasks

Order Effects in Annotation Tasks: Further Evidence of Annotation Sensitivity

Baby’s CoThought: Leveraging Large Language Models for Enhanced Reasoning in Compact Models

Proteasomal Cleavage Prediction: State-of-the-Art and Future Directions

On the Challenges and Practices of Reinforcement Learning From Real Human Feedback

Is Prompt-Based Finetuning Always Better Than Vanilla Finetuning? Insights From Cross-Lingual Language Understanding

Augmenting Survey Data With Digital Trace Data: Is There a Threat to Panel Retention?

What Cleaves? Is Proteasomal Cleavage Prediction Reaching a Ceiling?

Association of Non-Pharmaceutical Interventions to Reduce the Spread of SARS-CoV-2 With Anxiety and Depressive Symptoms: A Multi-National Study of 43 Countries

Package ‘PracTools’