Investigating the Possibilities of Using SMT for Text Annotation

Laki, László

doi:10.4230/OASIcs.SLATE.2012.267

Document

Investigating the Possibilities of Using SMT for Text Annotation

Author László Laki

Part of: Volume: 1st Symposium on Languages, Applications and Technologies (SLATE 2012)
Part of: Series: Open Access Series in Informatics (OASIcs)
Part of: Conference: Symposium on Languages, Applications and Technologies (SLATE)
License: Creative Commons Attribution-NonCommercial-NoDerivs 3.0 Unported license
Publication Date: 2012-06-21

PDF

File

PDF

OASIcs.SLATE.2012.267.pdf

Filesize: 0.54 MB
17 pages

Document Identifiers

DOI: 10.4230/OASIcs.SLATE.2012.267
URN: urn:nbn:de:0030-drops-35285

Subject Classification

Keywords

SMT
POS-tagging
Lemmatization
Target language set
OOV

Metrics

Access Statistics
Total Accesses (updated on a weekly basis)

0

Document

0

Metadata

Abstract

In this paper I examine the applicability of SMT methodology for part-of-speech disambiguation and lemmatization in Hungarian. After the baseline system was created, different methods and possibilities were used to improve the efficiency of the system. I also applied some methods to decrease the size of the target dictionary and to find a proper solution to handle out-of-vocabulary words. The results show that such a light-weight system performs comparable results to other state-of-the-art systems.

Cite As Get BibTex

László Laki. Investigating the Possibilities of Using SMT for Text Annotation. In 1st Symposium on Languages, Applications and Technologies. Open Access Series in Informatics (OASIcs), Volume 21, pp. 267-283, Schloss Dagstuhl – Leibniz-Zentrum für Informatik (2012) https://doi.org/10.4230/OASIcs.SLATE.2012.267

Author Details

László Laki

Any Issues?

Feedback on the Current Page

Thanks for your feedback!

Feedback submitted to Dagstuhl Publishing

Could not send message

Please try again later or send an E-mail