Home  | Publications | BKP+24

MaiBaam Annotation Guidelines

MCML Authors

Abstract

This document provides the annotation guidelines for MaiBaam, a Bavarian corpus manually annotated with part-of-speech (POS) tags, syntactic dependencies, and German lemmas. MaiBaam belongs to the Universal Dependencies (UD) project, and our annotations elaborate on the general and German UD version 2 guidelines. In this document, we detail how to preprocess and tokenize Bavarian data, provide an overview of the POS tags and dependencies we use, explain annotation decisions that would also apply to closely related languages like German, and lastly we introduce and motivate decisions that are specific to Bavarian grammar.

misc


Preprint

Oct. 2024

Authors

V. Blaschke • B. Kovačić • S. PengB. Plank

Links


Research Area

 B2 | Natural Language Processing

BibTeXKey: BKP+24

Back to Top