2 Search Results for "Laki, László"


Document
Combining Language Independent Part-of-Speech Tagging Tools

Authors: György Orosz, László János Laki, Attila Novák, and Borbála Siklósi

Published in: OASIcs, Volume 29, 2nd Symposium on Languages, Applications and Technologies (2013)


Abstract
Part-of-speech tagging is a fundamental task of natural language processing. For languages with a very rich agglutinating morphology, generic PoS tagging algorithms do not yield very high accuracy due to data sparseness issues. Though integrating a morphological analyzer can efficiently solve this problem, this is a resource-intensive solution. In this paper we show a method of combining language independent statistical solutions -- including a statistical machine translation tool -- of PoS-tagging to effectively boost tagging accuracy. Our experiments show that, using the same training set, our combination of language independent tools yield an accuracy that approaches that of a language dependent system with an integrated morphological analyzer.

Cite as

György Orosz, László János Laki, Attila Novák, and Borbála Siklósi. Combining Language Independent Part-of-Speech Tagging Tools. In 2nd Symposium on Languages, Applications and Technologies. Open Access Series in Informatics (OASIcs), Volume 29, pp. 249-257, Schloss Dagstuhl – Leibniz-Zentrum für Informatik (2013)


Copy BibTex To Clipboard

@InProceedings{orosz_et_al:OASIcs.SLATE.2013.249,
  author =	{Orosz, Gy\"{o}rgy and Laki, L\'{a}szl\'{o} J\'{a}nos and Nov\'{a}k, Attila and Sikl\'{o}si, Borb\'{a}la},
  title =	{{Combining Language Independent Part-of-Speech Tagging Tools}},
  booktitle =	{2nd Symposium on Languages, Applications and Technologies},
  pages =	{249--257},
  series =	{Open Access Series in Informatics (OASIcs)},
  ISBN =	{978-3-939897-52-1},
  ISSN =	{2190-6807},
  year =	{2013},
  volume =	{29},
  editor =	{Leal, Jos\'{e} Paulo and Rocha, Ricardo and Sim\~{o}es, Alberto},
  publisher =	{Schloss Dagstuhl -- Leibniz-Zentrum f{\"u}r Informatik},
  address =	{Dagstuhl, Germany},
  URL =		{https://drops-dev.dagstuhl.de/entities/document/10.4230/OASIcs.SLATE.2013.249},
  URN =		{urn:nbn:de:0030-drops-40441},
  doi =		{10.4230/OASIcs.SLATE.2013.249},
  annote =	{Keywords: part-of-speech tagging, combination, agglutinative languages, machine learning, machine translation}
}
Document
Investigating the Possibilities of Using SMT for Text Annotation

Authors: László Laki

Published in: OASIcs, Volume 21, 1st Symposium on Languages, Applications and Technologies (2012)


Abstract
In this paper I examine the applicability of SMT methodology for part-of-speech disambiguation and lemmatization in Hungarian. After the baseline system was created, different methods and possibilities were used to improve the efficiency of the system. I also applied some methods to decrease the size of the target dictionary and to find a proper solution to handle out-of-vocabulary words. The results show that such a light-weight system performs comparable results to other state-of-the-art systems.

Cite as

László Laki. Investigating the Possibilities of Using SMT for Text Annotation. In 1st Symposium on Languages, Applications and Technologies. Open Access Series in Informatics (OASIcs), Volume 21, pp. 267-283, Schloss Dagstuhl – Leibniz-Zentrum für Informatik (2012)


Copy BibTex To Clipboard

@InProceedings{laki:OASIcs.SLATE.2012.267,
  author =	{Laki, L\'{a}szl\'{o}},
  title =	{{Investigating the Possibilities of Using SMT for Text Annotation}},
  booktitle =	{1st Symposium on Languages, Applications and Technologies},
  pages =	{267--283},
  series =	{Open Access Series in Informatics (OASIcs)},
  ISBN =	{978-3-939897-40-8},
  ISSN =	{2190-6807},
  year =	{2012},
  volume =	{21},
  editor =	{Sim\~{o}es, Alberto and Queir\'{o}s, Ricardo and da Cruz, Daniela},
  publisher =	{Schloss Dagstuhl -- Leibniz-Zentrum f{\"u}r Informatik},
  address =	{Dagstuhl, Germany},
  URL =		{https://drops-dev.dagstuhl.de/entities/document/10.4230/OASIcs.SLATE.2012.267},
  URN =		{urn:nbn:de:0030-drops-35285},
  doi =		{10.4230/OASIcs.SLATE.2012.267},
  annote =	{Keywords: SMT, POS-tagging, Lemmatization, Target language set, OOV}
}
  • Refine by Author
  • 1 Laki, László
  • 1 Laki, László János
  • 1 Novák, Attila
  • 1 Orosz, György
  • 1 Siklósi, Borbála

  • Refine by Classification

  • Refine by Keyword
  • 1 Lemmatization
  • 1 OOV
  • 1 POS-tagging
  • 1 SMT
  • 1 Target language set
  • Show More...

  • Refine by Type
  • 2 document

  • Refine by Publication Year
  • 1 2012
  • 1 2013

Questions / Remarks / Feedback
X

Feedback for Dagstuhl Publishing


Thanks for your feedback!

Feedback submitted

Could not send message

Please try again later or send an E-mail