Retreading Dictionaries for the 21st Century

Authors Xavier Gómez Guinovart, Alberto Simões



PDF
Thumbnail PDF

File

OASIcs.SLATE.2013.115.pdf
  • Filesize: 476 kB
  • 12 pages

Document Identifiers

Author Details

Xavier Gómez Guinovart
Alberto Simões

Cite As Get BibTex

Xavier Gómez Guinovart and Alberto Simões. Retreading Dictionaries for the 21st Century. In 2nd Symposium on Languages, Applications and Technologies. Open Access Series in Informatics (OASIcs), Volume 29, pp. 115-126, Schloss Dagstuhl – Leibniz-Zentrum für Informatik (2013) https://doi.org/10.4230/OASIcs.SLATE.2013.115

Abstract

Even in the 21st century, paper dictionaries are still compiled and
developed using standard word processors. Many publishing companies
are, nowadays, working on converting their dictionaries into computer readable documents, so that they can be used to prepare new features, such as making them available online. Luckily, most of these publishers can pay review teams to fix and even enhance these
dictionaries. Unfortunately, research institutions cannot hire that  amount of workers.
  
In this article we present the process of retreading a Galician  dictionary that was first developed and compiled using Microsoft Word. This dictionary was converted, through automatic rewriting,   into a Text Encoding Initiative schema subset. This process will be
detailed, and the problems found will be discussed. Given a recent
normative that changed the Galician orthography, the dictionary has
undergone a semi-automatic modernization process. Finally, two  applications for the obtained dictionaries will be shown.

Subject Classification

Keywords
  • dictionary
  • markup language
  • language processing
  • lexical information retrieval
  • Galician language

Metrics

  • Access Statistics
  • Total Accesses (updated on a weekly basis)
    0
    PDF Downloads
Questions / Remarks / Feedback
X

Feedback for Dagstuhl Publishing


Thanks for your feedback!

Feedback submitted

Could not send message

Please try again later or send an E-mail