Home  | Publications | AHI+25

Charting the Landscape of African NLP: Mapping Progress and Shaping the Road Ahead

MCML Authors

Link to Profile Michael Hedderich PI Matchmaking

Michael Hedderich

Dr.

JRG Leader Human-Centered NLP

Abstract

With over 2,000 languages and potentially millions of speakers, Africa represents one of the richest linguistic regions in the world. Yet, this diversity is scarcely reflected in state-of-the-art natural language processing (NLP) systems and large language models (LLMs), which predominantly support a narrow set of high-resource languages. This exclusion not only limits the reach and utility of modern NLP technologies but also risks widening the digital divide across linguistic communities. Nevertheless, NLP research on African languages is active and growing. In recent years, there has been a surge of interest in this area, driven by several factors-including the creation of multilingual language resources, the rise of community-led initiatives, and increased support through funding programs. In this survey, we analyze 734 research papers on NLP for African languages published over the past five years, offering a comprehensive overview of recent progress across core tasks. We identify key trends shaping the field and conclude by outlining promising directions to foster more inclusive and sustainable NLP research for African languages.

inproceedings


EMNLP 2025

Conference on Empirical Methods in Natural Language Processing. Suzhou, China, Nov 04-09, 2025. To be published. Preprint available.
Conference logo
A* Conference

Authors

J. O. Alabi • M. A. Hedderich • D. I. Adelani • D. Klakow

Links


Research Area

 B2 | Natural Language Processing

BibTeXKey: AHI+25

Back to Top