|
Advanced search
Previous page
|
Title
Design and Development of Part-of-Speech-Tagging Resources for Wolof (Niger-Congo, spoken in Senegal) |
Full text
https://pub.uni-bielefeld.de/record/2955833 |
Date
2010 |
Author(s)
Dione, Cheikh M. Bamba; Kuhn, Jonas; Zarrieß, Sina |
Abstract
Dione CMB, Kuhn J, Zarrieß S. Design and Development of Part-of-Speech-Tagging Resources for Wolof (Niger-Congo, spoken in Senegal). In: <em>Proceedings of the Seventh International Conference on Language Resources and Evaluation ({LREC}'10)</em>. Valletta, Malta: European Language Resources Association (ELRA); 2010. - In this paper, we report on the design of a part-of-speech-tagset for Wolof and on the creation of a semi-automatically annotated gold standard. In order to achieve high-quality annotation relatively fast, we first generated an accurate lexicon that draws on existing word and name lists and takes into account inflectional and derivational morphology. The main motivation for the tagged corpus is to obtain data for training automatic taggers with machine learning approaches. Hence, we took machine learning considerations into account during tagset design and we present training experiments as part of this paper. The best automatic tagger achieves an accuracy of 95.2{\%} in cross-validation experiments. We also wanted to create a basis for experimenting with annotation projection techniques, which exploit parallel corpora. For this reason, it was useful to use a part of the Bible as the gold standard corpus, for which sentence-aligned parallel versions in many languages are easy to obtain. We also report on preliminary experiments exploiting a statistical word alignment of the parallel text. |
Language
eng |
Publisher
European Language Resources Association (ELRA) |
Type of publication
http://purl.org/coar/resource_type/c_5794; info:eu-repo/semantics/conferenceObject; doc-type:conferenceObject; text |
Rights
info:eu-repo/semantics/closedAccess |
Repository
Bielefeld - University of Bielefeld
|
Added to C-A: 2021-06-24;07:20:45 |
© Connecting-Africa 2004-2024 | Last update: Saturday, July 6, 2024 |
Webmaster
|