Combining Language Independent Part-of-Speech Tagging Tools

Authors György Orosz, László János Laki, Attila Novák, Borbála Siklósi



PDF
Thumbnail PDF

File

OASIcs.SLATE.2013.249.pdf
  • Filesize: 0.72 MB
  • 9 pages

Document Identifiers

Author Details

György Orosz
László János Laki
Attila Novák
Borbála Siklósi

Cite AsGet BibTex

György Orosz, László János Laki, Attila Novák, and Borbála Siklósi. Combining Language Independent Part-of-Speech Tagging Tools. In 2nd Symposium on Languages, Applications and Technologies. Open Access Series in Informatics (OASIcs), Volume 29, pp. 249-257, Schloss Dagstuhl – Leibniz-Zentrum für Informatik (2013)
https://doi.org/10.4230/OASIcs.SLATE.2013.249

Abstract

Part-of-speech tagging is a fundamental task of natural language processing. For languages with a very rich agglutinating morphology, generic PoS tagging algorithms do not yield very high accuracy due to data sparseness issues. Though integrating a morphological analyzer can efficiently solve this problem, this is a resource-intensive solution. In this paper we show a method of combining language independent statistical solutions -- including a statistical machine translation tool -- of PoS-tagging to effectively boost tagging accuracy. Our experiments show that, using the same training set, our combination of language independent tools yield an accuracy that approaches that of a language dependent system with an integrated morphological analyzer.
Keywords
  • part-of-speech tagging
  • combination
  • agglutinative languages
  • machine learning
  • machine translation

Metrics

  • Access Statistics
  • Total Accesses (updated on a weekly basis)
    0
    PDF Downloads
Questions / Remarks / Feedback
X

Feedback for Dagstuhl Publishing


Thanks for your feedback!

Feedback submitted

Could not send message

Please try again later or send an E-mail