Home  | Research | Research Groups | Fraser

Research Group Alexander Fraser


Link to website at TUM PI Matchmaking

Alexander Fraser

Prof. Dr.

Principal Investigator

Alexander Fraser

holds the Chair for Data Analytics & Statistics at TU Munich.

He is renowned for his work in machine learning approaches to machine translation, language modeling, and multilingual natural language processing. He focuses on addressing data sparsity and integrating linguistic and world knowledge in AI systems. Additionally, he collaborates with language communities to develop technology for their languages. His contributions to natural language processing and machine learning emphasize both theoretical advancements and practical applications.

Team members @MCML

PostDocs

Link to website

Daryna Dementieva

Dr.

Link to website

Lukas Edman

Dr.

Link to website

Shu Okabe

Dr.

PhD Students

Link to website

Faeze Ghorbanpour

Link to website

Katharina Hämmerl

Link to website

Wen Lai

Link to website

Tsedeniya Kinfe Temesgen

Recent News @MCML

Link to MCML at ACL 2025

MCML at ACL 2025

Link to MCML at ICML 2025

MCML at ICML 2025

Link to MCML at NAACL 2025

MCML at NAACL 2025

Link to MCML at EMNLP 2024

MCML at EMNLP 2024

Link to Alexander Fraser Receives EU Funding for Research on LLMs

16.10.2024

Alexander Fraser Receives EU Funding for Research on LLMs

Publications @MCML

2025


[46] A* Conference
F. GhorbanpourD. DementievaA. Fraser
Data-Efficient Hate Speech Detection via Cross-Lingual Nearest Neighbor Retrieval with Limited Labeled Data.
EMNLP 2025 - Conference on Empirical Methods in Natural Language Processing. Suzhou, China, Nov 04-09, 2025. To be published. Preprint available.

[45] A* Conference
Y. Shen • W. Lai • S. Wang • G. Gao • K. Luo • A. Fraser • M. Sun
From Unaligned to Aligned: Scaling Multilingual LLMs with Multi-Way Parallel Corpora.
EMNLP 2025 - Conference on Empirical Methods in Natural Language Processing. Suzhou, China, Nov 04-09, 2025. To be published. Preprint available.

[44] A* Conference
D. Dementieva • N. Babakov • A. Fraser
EmoBench-UA: A Benchmark Dataset for Emotion Detection in Ukrainian.
Findings @EMNLP 2025 - Findings of the Conference on Empirical Methods in Natural Language Processing. Suzhou, China, Nov 04-09, 2025. To be published. Preprint available.

[43]
S. OkabeD. Dementieva • M. Di Marco • L. EdmanK. Hämmerl • M. Měškank • A. Hendrichowa • A. Fraser
Findings of the WMT 2025 Shared Task LLMs with Limited Resources for Slavic Languages: MT and QA.
WMT @EMNLP 2025 - 10th Conference on Machine Translation at the Conference on Empirical Methods in Natural Language Processing. Suzhou, China, Nov 04-09, 2025. PDF

[42]
F. GhorbanpourA. Fraser
Evaluating the Sensitivity of LLMs to Harmful Contents in Long Input.
Preprint (Oct. 2025).

[41] A* Conference
F. Friedrich • K. Hämmerl • P. Schramowski • M. Brack • J. Libovicky • K. Kersting • A. Fraser
Multilingual Text-to-Image Generation Magnifies Gender Stereotypes and Prompt Engineering May Not Help You.
ACL 2025 - 63rd Annual Meeting of the Association for Computational Linguistics. Vienna, Austria, Jul 27-Aug 01, 2025. URL

[40] A* Conference
L. Kinder • L. EdmanA. Fraser • T. Käfer
Positional Overload: Positional Debiasing and Context Window Extension for Large Language Models using Set Encoding.
ACL 2025 - 63rd Annual Meeting of the Association for Computational Linguistics. Vienna, Austria, Jul 27-Aug 01, 2025. URL

[39] A* Conference
S. OkabeK. HämmerlA. Fraser
Improving Parallel Sentence Mining for Low-Resource and Endangered Languages.
ACL 2025 - 63rd Annual Meeting of the Association for Computational Linguistics. Vienna, Austria, Jul 27-Aug 01, 2025. URL

[38]
H. Asadpour • S. OkabeA. Fraser
A Practical Tool to Help Automate Interlinear Glossing: a Study on Mukrī Kurdish.
Field Matters @ACL 2025 - 4th Workshop on NLP Applications to Field Linguistics at the 63rd Annual Meeting of the Association for Computational Linguistics. Vienna, Austria, Jul 27-Aug 01, 2025. URL

[37] A* Conference
L. Edman • H. Schmid • A. Fraser
EXECUTE: A Multilingual Benchmark for LLM Token Understanding.
Findings @ACL 2025 - Findings at the 63rd Annual Meeting of the Association for Computational Linguistics. Vienna, Austria, Jul 27-Aug 01, 2025. URL

[36] A* Conference
W. LaiA. Fraser • I. Titov
Joint Localization and Activation Editing for Low-Resource Fine-Tuning.
ICML 2025 - 42nd International Conference on Machine Learning. Vancouver, Canada, Jul 13-19, 2025. URL

[35]
F. Ghorbanpour • T. Z. Malaguth • A. Akbaritabar
Differentiating Emigration from Return Migration of Scholars Using Name-Based Nationality Detection Models.
ICWSM 2025 - 19th International AAAI Conference on Web and Social Media. Copenhagen, Denmark, Jun 23-26, 2025. DOI

[34]
F. GhorbanpourD. DementievaA. Fraser
Can Prompting LLMs Unlock Hate Speech Detection across Languages? A Zero-shot and Few-shot Study.
Preprint (May. 2025).

[33]
A. Karamolegkou • A. Borah • E. Cho • S. R. Choudhury • M. Galletti • R. Ghosh • P. Gupta • O. Ignat • P. Kargupta • N. Kotonya • H. Lamba • S.-J. Lee • A. Mangla • I. Mondal • D. Nazarova • P. Nemkova • D. Pisarevskaya • N. Rizwan • N. Sabri • D. Stammbach • A. Steinberg • D. Tomás • S. R. Wilson • B. Yi • J. H. Zhu • A. Zubiaga • A. Søgaard • A. Fraser • Z. Jin • R. Mihalcea • J. R. Tetreault • D. Dementieva
NLP for Social Good: A Survey of Challenges, Opportunities, and Responsible Deployment.
Preprint (May. 2025).

[32] A Conference
F. Ghorbanpour • V. Hangya • A. Fraser
Fine-Grained Transfer Learning for Harmful Content Detection through Label-Specific Soft Prompt Tuning.
NAACL 2025 - Annual Conference of the North American Chapter of the Association for Computational Linguistics. Albuquerque, NM, USA, Apr 29-May 04, 2025. DOI

[31] A Conference
K. Hämmerl • T. Limisiewicz • J. Libovický • A. Fraser
Beyond Literal Token Overlap: Token Alignability for Multilinguality.
NAACL 2025 - Annual Conference of the North American Chapter of the Association for Computational Linguistics. Albuquerque, NM, USA, Apr 29-May 04, 2025. DOI

[30]
S. OkabeA. Fraser
Bilingual Sentence Mining for Low-Resource Languages: a Case Study on Upper and Lower Sorbian.
Compute-EL @ICLDC 2025 - 8th Workshop on The Use of Computational Methods in the Study of Endangered Languages at the 9th International Conference on Language Documentation and Conservation. Honolulu, Hawaii, USA, Mar 06-06, 2025. URL

[29]
Y. Shen • W. Lai • S. Wang • X. Zhang • K. Luo • A. Fraser • M. Sun
DCAD-2000: A Multilingual Dataset across 2000+ Languages with Data Cleaning as Anomaly Detection.
Preprint (Feb. 2025).

[28]
Y. Zhang • V. Hangya • A. Fraser
LLM Sensitivity Challenges in Abusive Language Detection: Instruction-Tuned vs. Human Feedback.
COLING 2025 - The 31st International Conference on Computational Linguistics. Abu Dhabi, United Arab Emirates, Jan 19-24, 2025. URL

2024


[27] A* Conference
M. Di Marco • A. Fraser
Subword Segmentation in LLMs: Looking at Inflection and Consistency.
EMNLP 2024 - Conference on Empirical Methods in Natural Language Processing. Miami, FL, USA, Nov 12-16, 2024. DOI

[26] A* Conference
L. Edman • H. Schmid • A. Fraser
CUTE: Measuring LLMs’ Understanding of Their Tokens.
EMNLP 2024 - Conference on Empirical Methods in Natural Language Processing. Miami, FL, USA, Nov 12-16, 2024. DOI

[25] A* Conference
W. LaiV. HangyaA. Fraser
Style-Specific Neurons for Steering LLMs in Text Style Transfer.
EMNLP 2024 - Conference on Empirical Methods in Natural Language Processing. Miami, FL, USA, Nov 12-16, 2024. DOI

[24]
K. Hämmerl • A. Manea • G. Vico • J. Helcl • J. Libovický
CUNI and LMU Submission to the MRL 2024 Shared Task on Multi-lingual Multi-task Information Retrieval.
MRL @EMNLP 2024 - 4th Multilingual Representation Learning Workshop at the Conference on Empirical Methods in Natural Language Processing. Miami, FL, USA, Nov 12-16, 2024. DOI

[23]
L. Edman • L. Bylinina • F. GhorbanpourA. Fraser
Are BabyLMs Second Language Learners?
Preprint (Oct. 2024).

[22]
A. Dimmelmeier • H. Doll • M. Schierholz • E. Kormanyos • M. Fehr • B. MaJ. BeckA. FraserF. Kreuter
Informing climate risk analysis using textual information - A research agenda.
ClimateNLP @ACL 2024 - 1st Workshop on Natural Language Processing Meets Climate Change at the 62nd Annual Meeting of the Association for Computational Linguistics. Bangkok, Thailand, Aug 11-16, 2024. DOI

[21] A* Conference
K. Hämmerl • J. Libovický • A. Fraser
Understanding Cross-Lingual Alignment—A Survey.
Findings @ACL 2024 - Findings of the 62nd Annual Meeting of the Association for Computational Linguistics. Bangkok, Thailand, Aug 11-16, 2024. DOI

[20] A* Conference
W. Lai • M. Mesgar • A. Fraser
LLMs Beyond English: Scaling the Multilingual Capability of LLMs with Cross-Lingual Feedback.
Findings @ACL 2024 - Findings of the 62nd Annual Meeting of the Association for Computational Linguistics. Bangkok, Thailand, Aug 11-16, 2024. DOI

[19]
P. Piccirilli • A. Fraser • S. Schulte im Walde
VOLIMET: A Parallel Corpus of Literal and Metaphorical Verb-Object Pairs for English–German and English–French.
*SEM 2024 - 13th Joint Conference on Lexical and Computational Semantics co-located with NAACL 2024. Mexico City, Mexico, Jun 20-21, 2024. DOI

[18]
Y. Zhang • V. HangyaA. Fraser
A Study of the Class Imbalance Problem in Abusive Language Detection.
WOAH @NAACL 2024 - 8th Workshop on Online Abuse and Harms at the Annual Conference of the North American Chapter of the Association for Computational Linguistics. Mexico City, Mexico, Jun 16-21, 2024. DOI

[17]
V. HangyaA. Fraser
How to Solve Few-Shot Abusive Content Detection Using the Data We Actually Have.
LREC-COLING 2024 - Joint International Conference on Computational Linguistics, Language Resources and Evalutaion. Torino, Italy, May 20-25, 2024. URL

[16]
M. Marco • A. Fraser
Analyzing the Understanding of Morphologically Complex Words in Large Language Models.
LREC-COLING 2024 - Joint International Conference on Computational Linguistics, Language Resources and Evalutaion. Torino, Italy, May 20-25, 2024. URL

[15]

2023


[14] A* Conference
M. Weller-Di Marco • K. HämmerlA. Fraser
A Study on Accessing Linguistic Information in Pre-Trained Language Models by Using Prompts.
EMNLP 2023 - Conference on Empirical Methods in Natural Language Processing. Singapore, Dec 06-10, 2023. DOI

[13] A* Conference
W. LaiA. ChronopoulouA. Fraser
Mitigating Data Imbalance and Representation Degeneration in Multilingual Machine Translation.
Findings @EMNLP 2023 - Findings of the Conference on Empirical Methods in Natural Language Processing. Singapore, Dec 06-10, 2023. DOI

[12]
V. Hangya • S. Severini • R. Ralev • A. FraserH. Schütze
Multilingual Word Embeddings for Low-Resource Languages using Anchors and a Chain of Related Languages.
MRL @EMNLP 2023 - 3rd Workshop on Multi-lingual Representation Learning at the Conference on Empirical Methods in Natural Language Processing. Singapore, Dec 06-10, 2023. DOI

[11]
W. LaiV. HangyaA. Fraser
Extending Multilingual Machine Translation through Imitation Learning.
Preprint (Nov. 2023).

[10]
V. HangyaA. Fraser
LMU at HaSpeeDe3: Multi-Dataset Training for Cross-Domain Hate Speech Detection.
EVALITA 2023 - Final Workshop of the 8th evaluation campaign. Parma, Italy, Sep 07-08, 2023. PDF

[9] A* Conference
K. Hämmerl • B. Deiseroth • P. Schramowski • J. Libovický • C. Rothkopf • A. Fraser • K. Kersting
Speaking Multiple Languages Affects the Moral Bias of Language Models.
Findings @ACL 2023 - Findings of the 61th Annual Meeting of the Association for Computational Linguistics. Toronto, Canada, Jul 09-14, 2023. DOI

[8] A* Conference
K. Hämmerl • A. Fastowski • J. Libovický • A. Fraser
Exploring Anisotropy and Outliers in Multilingual Language Models for Cross-Lingual Semantic Sentence Similarity.
Findings @ACL 2023 - Findings of the 61th Annual Meeting of the Association for Computational Linguistics. Toronto, Canada, Jul 09-14, 2023. DOI

[7]
Y. LiuA. ChronopoulouH. SchützeA. Fraser
On the Copying Problem of Unsupervised NMT: A Training Schedule with a Language Discriminator Loss.
IWSLT 2023 - 20th International Conference on Spoken Language Translation. Toronto, Canada, Jul 09-14, 2023. DOI

[6] A Conference
A. Chronopoulou • M. Peters • A. Fraser • J. Dodge
AdapterSoup: Weight Averaging to Improve Generalization of Pretrained Language Models.
Findings @EACL 2023 - Findings of the 17th Conference of the European Chapter of the Association for Computational Linguistics. Dubrovnik, Croatia, May 02-06, 2023. DOI

[5]
A. Chronopoulou • D. Stojanovski • A. Fraser
Language-Family Adapters for Low-Resource Multilingual Neural Machine Translation.
LoResMT @EACL 2023 - 6th Workshop on Technologies for Machine Translation of Low-Resource Languages at the 17th Conference of the European Chapter of the Association for Computational Linguistics. Dubrovnik, Croatia, May 02-06, 2023. DOI

2022


[4] A* Conference
V. Hangya • H. S. Saadi • A. Fraser
Improving Low-Resource Languages in Pre-Trained Multilingual Language Models.
EMNLP 2022 - Conference on Empirical Methods in Natural Language Processing. Abu Dhabi, United Arab Emirates, Nov 07-11, 2022. DOI

[3] A* Conference
W. LaiA. ChronopoulouA. Fraser
m4 Adapter: Multilingual Multi-Domain Adaptation for Machine Translation with a Meta-Adapter.
Findings @EMNLP 2022 - Findings of the Conference on Empirical Methods in Natural Language Processing. Abu Dhabi, United Arab Emirates, Nov 07-11, 2022. DOI

[2]
H. S. Saadi • V. Hangya • T. Eder • A. Fraser
Comparative Analysis of Cross-lingual Contextualized Word Embeddings.
MRL 2022 @EMNLP 2022 - 2nd Workshop on Multi-lingual Representation Learning at the Conference on Empirical Methods in Natural Language Processing. Abu Dhabi, United Arab Emirates, Nov 07-11, 2022. DOI

[1]
S. Severini • V. HangyaM. J. SabetA. FraserH. Schütze
Don't Forget Cheap Training Signals Before Building Unsupervised Bilingual Word Embeddings.
BUCC @LREC 2022 - 15th Workshop on Building and Using Comparable Corpora at the 13th International Conference on Language Resources and Evaluation. Marseille, France, Jun 21-23, 2022. URL