Investigating the Possibilities of Using SMT for Text Annotation

Author László Laki



PDF
Thumbnail PDF

File

OASIcs.SLATE.2012.267.pdf
  • Filesize: 0.54 MB
  • 17 pages

Document Identifiers

Author Details

László Laki

Cite As Get BibTex

László Laki. Investigating the Possibilities of Using SMT for Text Annotation. In 1st Symposium on Languages, Applications and Technologies. Open Access Series in Informatics (OASIcs), Volume 21, pp. 267-283, Schloss Dagstuhl – Leibniz-Zentrum für Informatik (2012) https://doi.org/10.4230/OASIcs.SLATE.2012.267

Abstract

In this paper I examine the applicability of SMT methodology for part-of-speech disambiguation and lemmatization in Hungarian. After the baseline system was created, different methods and possibilities were used to improve the efficiency of the system. I also applied some methods to decrease the size of the target dictionary and to find a proper solution to handle out-of-vocabulary words. The results show that such a light-weight system performs comparable results to other state-of-the-art systems.

Subject Classification

Keywords
  • SMT
  • POS-tagging
  • Lemmatization
  • Target language set
  • OOV

Metrics

  • Access Statistics
  • Total Accesses (updated on a weekly basis)
    0
    PDF Downloads
Questions / Remarks / Feedback
X

Feedback for Dagstuhl Publishing


Thanks for your feedback!

Feedback submitted

Could not send message

Please try again later or send an E-mail