Investigating the Possibilities of Using SMT for Text Annotation

Author László Laki

Thumbnail PDF


  • Filesize: 0.54 MB
  • 17 pages

Document Identifiers

Author Details

László Laki

Cite AsGet BibTex

László Laki. Investigating the Possibilities of Using SMT for Text Annotation. In 1st Symposium on Languages, Applications and Technologies. Open Access Series in Informatics (OASIcs), Volume 21, pp. 267-283, Schloss Dagstuhl – Leibniz-Zentrum für Informatik (2012)


In this paper I examine the applicability of SMT methodology for part-of-speech disambiguation and lemmatization in Hungarian. After the baseline system was created, different methods and possibilities were used to improve the efficiency of the system. I also applied some methods to decrease the size of the target dictionary and to find a proper solution to handle out-of-vocabulary words. The results show that such a light-weight system performs comparable results to other state-of-the-art systems.
  • SMT
  • POS-tagging
  • Lemmatization
  • Target language set
  • OOV


  • Access Statistics
  • Total Accesses (updated on a weekly basis)
    PDF Downloads