Home | Publications | WAL+25

Lost in Multilinguality: Dissecting Cross-Lingual Factual Inconsistency in Transformer Language Models

MCML Authors

Mingyang Wang

→ Group Hinrich Schütze
Computational Linguistics

Yihong Liu

Dr.

→ Group Hinrich Schütze
Computational Linguistics

Ercong Nie

Dr.

* Former Member

→ Group Hinrich Schütze
Computational Linguistics

Hinrich Schütze

Prof. Dr.

Core PI

Computational Linguistics

Abstract

Multilingual language models (MLMs) store factual knowledge across languages but often struggle to provide consistent responses to semantically equivalent prompts in different languages. While previous studies point out this cross-lingual inconsistency issue, the underlying causes remain unexplored. In this work, we use mechanistic interpretability methods to investigate cross-lingual inconsistencies in MLMs. We find that MLMs encode knowledge in a language-independent concept space through most layers, and only transition to language-specific spaces in the final layers. Failures during the language transition often result in incorrect predictions in the target language, even when the answers are correct in other languages. To mitigate this inconsistency issue, we propose a linear shortcut method that bypasses computations in the final layers, enhancing both prediction accuracy and cross-lingual consistency. Our findings shed light on the internal mechanisms of MLMs and provide a lightweight, effective strategy for producing more consistent factual outputs.

inproceedings WAL+25

ACL 2025

63rd Annual Meeting of the Association for Computational Linguistics. Vienna, Austria, Jul 27-Aug 01, 2025.

Authors

M. Wang • H. Adel • L. Lange • Y. Liu • E. Nie • J. Strötgen • H. Schütze

Links

DOI

In Collaboration

Bosch

Research Area

B2 | Natural Language Processing

BibTeXKey: WAL+25

#p-schuetze