Do Large Language Models Think Like the Brain? Sentence-Level Evidence From FMRI and Hierarchical Embeddings
AAAI 2026
#p-kreuter
BWH+26
Clustering Mouse Movement Behavior in Surveys Using ResNet Embeddings
NLDL 2026
#p-kreuter
WBL+26
Comparison of Neural Networks and Gradient Boosting Models on Ordinal Age Class Prediction Using Mouse Trajectories
NLDL 2026
#p-kreuter
WML+25
M-ABSA: A Multilingual Dataset for Aspect-Based Sentiment Analysis
EMNLP 2025
#p-kreuter#p-plank#p-schuetze
WCL+25
Multimodal Emotion Recognition in Conversations: A Survey of Methods, Trends, Challenges and Prospects
Findings @EMNLP 2025
#p-kreuter
EMK+25
Aligning NLP Models With Target Population Perspectives Using PAIR: Population-Aligned Instance Replication
NLPerspectives @EMNLP 2025
#p-kern#p-kreuter#p-plank
KSG+25
Unintended Impacts of Automation for Integration? Simulating Integration Outcomes of Algorithm-Based Refugee Allocation in Germany
AIES 2025
#p-kern#p-kreuter
Hey25
Who Counts? the Potentials and Pitfalls of Using LLMs in Survey Research
NLPOR @COLM 2025
#p-kreuter
HHW+25a
AIn't Nothing but a Survey? Using Large Language Models for Coding German Open-Ended Survey Responses on Survey Motivation
NLPOR @COLM 2025
#p-kreuter
BHR+25
Toward Understanding the Transferability of Adversarial Suffixes in Large Language Models
Preprint (Oct. 2025)
#p-kreuter
BH25
Don't Walk the Line: Boundary Guidance for Filtered Generation
Preprint (Oct. 2025)
#p-kreuter
HWN+25a
Systematic Evaluation of Uncertainty Estimation Methods in Large Language Models
Preprint (Oct. 2025)
#p-kreuter
MCS+25
Too Open for Opinion? Embracing Open-Endedness in Large Language Models for Social Simulation
Preprint (Oct. 2025)
#p-kreuter#p-plank
MYH+25a
Capabilities and Evaluation Biases of Large Language Models in Classical Chinese Poetry Generation: A Case Study on Tang Poetry
Preprint (Oct. 2025)
#p-kreuter
SFF+25
Bias Begins With Data: The FairGround Corpus for Robust and Reproducible Research on Algorithmic Fairness
Preprint (Oct. 2025)
#p-kern#p-kreuter
ZMF+25
Table Question Answering in the Era of Large Language Models: A Comprehensive Survey of Tasks, Methods, and Evaluation
Preprint (Oct. 2025)
#p-kreuter
DKG+25
Problem Solving Through Human-AI Preference-Based Cooperation
Computational Linguistics. Sep. 2025
#p-huellermeier#p-kreuter#p-schuetze
Kre25a
Modernizing Data Collection
Journal of Official Statistics 41.3. Sep. 2025
#p-kreuter
BEK+25
Bias in the Loop: How Humans Evaluate AI-Generated Suggestions
Preprint (Sep. 2025)
#p-kern#p-kreuter
HHW+25
AIn't Nothing but a Survey? Using Large Language Models for Coding German Open-Ended Survey Responses on Survey Motivation
JSM 2025
#p-kreuter
BSD+25
Addressing Data Gaps in Sustainability Reporting: A Benchmark Dataset for Greenhouse Gas Emission Extraction
Scientific Data 12.1497. Aug. 2025
#p-kreuter
MLZ+25
Pragmatics in the Era of Large Language Models: A Survey on Datasets, Evaluation, Opportunities and Challenges
ACL 2025
#p-kreuter#p-plank
MYH+25
Algorithmic Fidelity of Large Language Models in Generating Synthetic German Public Opinions: A Case Study
ACL 2025
#p-bischl#p-kreuter#p-plank
KHK25
Mind the Gap: Gender-Based Differences in Occupational Embeddings
GeBNLP @ACL 2025
#p-kreuter
KS25
Can Large Language Models Advance Occupational Coding? Evidence and Methodological Insights
ESRA 2025
#p-kreuter
Kre25
Adaptive Alignment: Designing AI for a Changing World - Frauke Kreuter
ICML 2025
#p-kreuter
NWK+25
Measuring Public Opinion Towards Artificial Intelligence: Development and Validation of a General AI Attitude Short Scale
AI and Society. Jul. 2025
#p-kern#p-kreuter
FKK25
Adjusting Survey Estimates With Multi-Accuracy Post-Processing
ITACOSM 2025
#p-kern#p-kreuter
BGG+25
On the Impossibility of Separating Intelligence From Judgment: The Computational Intractability of Filtering for AI Alignment
Preprint (Jul. 2025)
#p-kreuter
SKB25a
Fares on Fairness: Using a Total Error Framework to Examine the Role of Measurement and Representation in Training Data on Model Fairness and Bias
EWAF 2025
#p-kern#p-kreuter
YNM+25
Why Lift So Heavy? Slimming Large Language Models by Cutting Off the Layers
IJCNN 2025
#p-kreuter#p-schuetze
WMZ+25
Evaluating Zero-Shot Multilingual Aspect-Based Sentiment Analysis With Large Language Models
International Journal of Machine Learning and Cybernetics. Jun. 2025
#p-kreuter
Kon25
How ML-Filtered Answer Options Shape Responses and Interactions in CATI Surveys
AAPOR 2025
#p-kreuter
BAK+25
Human Preferences in Large Language Model Latent Space: A Technical Analysis on the Reliability of Synthetic Data in Voting Outcome Prediction
AAPOR 2025
#p-kreuter
KFS+25
Algorithms for Reliable Decision-Making Need Causal Reasoning
Nature Computational Science 5. May. 2025
#p-feuerriegel#p-kern#p-kreuter
KBC+25
NLP for Social Good: A Survey of Challenges, Opportunities, and Responsible Deployment
Preprint (May. 2025)
#p-fraser#p-kreuter
SMA+25
Connecting Natural Language Processing and Survey Methodology: Potentials, Challenges, and Open Questions
Preprint (May. 2025)
#p-kreuter
WAK+25a
AI Conversational Interviewing: Transforming Surveys With LLMs as Adaptive Interviewers
LaTeCH-CLfL @NAACL 2025
#p-bischl#p-kreuter
MHH25
Can Large Language Models Advance Crosswalks? the Case of Danish Occupation Codes
SRW @NAACL 2025
#p-kreuter
HHW25
Vox Populi, Vox AI? Using Language Models to Estimate German Public Opinion
Social Science Computer Review Online First. Apr. 2025
#p-kreuter
WMD+25
Multi-Scale and Multi-Objective Optimization for Cross-Lingual Aspect-Based Sentiment Analysis
Preprint (Feb. 2025)
#p-kreuter
BKD+25
Toward Integrating ChatGPT Into Satellite Image Annotation Workflows: A Comparison of Label Quality and Costs of Human and Automated Annotators
IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing 18. Jan. 2025
#p-kreuter
KSK25
The Impact of Question Framing on the Precision of Automatic Occupation Coding
Preprint (Jan. 2025)
#p-kreuter
FKB+24
Bridging the Gap: Towards an Expanded Toolkit for AI-Driven Decision-Making in the Public Sector
Government Information Quarterly 41.4. Dec. 2024
#p-kern#p-kreuter
MWH+24
The Potential and Challenges of Evaluating Attitudes, Opinions, and Values in Large Language Models
Findings @EMNLP 2024
#p-hedderich#p-kreuter#p-plank
KBM+24a
When Small Decisions Have Big Impact: Fairness Implications of Algorithmic Profiling Schemes
ACM Journal on Responsible Computing. Nov. 2024
#p-kern#p-kreuter
WHM24
Look at the Text: Instruction-Tuned Language Models Are More Robust Multiple Choice Selectors Than You Think
COLM 2024
#p-kreuter#p-plank
SK24
Connecting Algorithmic Fairness to Quality Dimensions in Machine Learning in Official Statistics and Survey Production
AStA Wirtschafts- Und Sozialstatistisches Archiv 18. Oct. 2024
#p-kern#p-kreuter
HHW24a
United in Diversity? Contextual Biases in LLM-Based Predictions of the 2024 European Parliament Elections
Preprint (Sep. 2024)
#p-kreuter
Ma24
Evaluating Lexical Aspect With Large Language Models
CMCL @ACL 2024
#p-kreuter
DDS+24
Informing Climate Risk Analysis Using Textual Information - A Research Agenda
ClimateNLP @ACL 2024
#p-fraser#p-kreuter
WMH+24
My Answer Is C: First-Token Probabilities Do Not Match Text Answers in Instruction-Tuned Language Models
Findings @ACL 2024
#p-kreuter#p-plank
EPK24
Position: Insights From Survey Methodology Can Improve Training Data
ICML 2024
#p-kreuter#p-plank
FKK24
The Missing Link: Allocation Performance in Causal Machine Learning
Workshop Humans, Algorithmic Decision-Making and Society @ICML 2024
#p-kern#p-kreuter
RMH+24
TOPCAT: Topic-Oriented Protocol for Content Analysis of Text – A Preliminary Study
NLP+CSS @NAACL 2024
#p-kreuter
BKP24
Understanding Jailbreak Success: A Study of Latent Space Dynamics in Large Language Models
Preprint (Jun. 2024)
#p-kreuter
MNY+24a
ToPro: Token-Level Prompt Decomposition for Cross-Lingual Sequence Labeling Tasks
EACL 2024
#p-kreuter#p-schuetze
BEM+24
Order Effects in Annotation Tasks: Further Evidence of Annotation Sensitivity
UncertaiNLP @EACL 2024
#p-kreuter
NYM+24
Decomposed Prompting: Unveiling Multilingual Linguistic Structure Knowledge in English-Centric Large Language Models
Preprint (Feb. 2024)
#p-kreuter#p-schuetze
ZYM+23a
Baby's CoThought: Leveraging Large Language Models for Enhanced Reasoning in Compact Models
BabyLM Challenge @CoNLL 2023)
#p-kreuter#p-ruegamer#p-schuetze
ZMB+23
Proteasomal Cleavage Prediction: State-of-the-Art and Future Directions
Preprint (Oct. 2023)
#p-bischl#p-kreuter
KBB+23
On the Challenges and Practices of Reinforcement Learning From Real Human Feedback
HLDM @ECML-PKDD 2023
#p-huellermeier#p-kreuter
MNS+23
Is Prompt-Based Finetuning Always Better Than Vanilla Finetuning? Insights From Cross-Lingual Language Understanding
KONVENS 2023
#p-kreuter#p-schuetze
THM+23
Augmenting Survey Data With Digital Trace Data: Is There a Threat to Panel Retention?
Journal of Survey Statistics and Methodology 11.3. Jun. 2023
#p-kreuter
ZMN+22
What Cleaves? Is Proteasomal Cleavage Prediction Reaching a Ceiling?
LMRL @NeurIPS 2022
#p-bischl#p-kreuter#p-ruegamer#p-schuetze
RBW+22
Association of Non-Pharmaceutical Interventions to Reduce the Spread of SARS-CoV-2 With Anxiety and Depressive Symptoms: A Multi-National Study of 43 Countries
International Journal of Public Health 67. Mar. 2022