License
when quoting this document, please refer to the following
URN: urn:nbn:de:0030-drops-10573
URL: http://drops.dagstuhl.de/opus/volltexte/2007/1057/

Hauser, Andreas ; Heller, Markus ; Leiss, Elisabeth ; Schulz, Klaus U. ; Wanzeck, Christiane

Information Access to Historical Documents from the Early New High German Period

pdf-format:
Dokument 1.pdf (91 KB)


Abstract

With the new interest in historical documents insight grew that electronic access to these texts causes many specific problems. In the first part of the paper we survey the present role of digital historical documents. After collecting central facts and observations on historical language change we comment on the difficulties that result for retrieval and data mining on historical texts. In the second part of the paper we report on our own work in the area with a focus on special matching strategies that help to relate modern language keywords with old variants. The basis of our studies is a collection of documents from the Early New High German period. These texts come with a very rich spectrum on word variants and spelling variations.

BibTeX - Entry

@InProceedings{hauser_et_al:DSP:2007:1057,
  author =	{Andreas Hauser and Markus Heller and Elisabeth Leiss and Klaus U. Schulz and Christiane Wanzeck},
  title =	{Information Access to Historical Documents from the Early New High German Period},
  booktitle =	{Digital Historical Corpora- Architecture, Annotation, and Retrieval},
  year =	{2007},
  editor =	{Lou Burnard and Milena Dobreva and Norbert Fuhr and Anke L{\"u}deling },
  number =	{06491},
  series =	{Dagstuhl Seminar Proceedings},
  ISSN =	{1862-4405},
  publisher =	{Internationales Begegnungs- und Forschungszentrum f{\"u}r Informatik (IBFI), Schloss Dagstuhl, Germany},
  address =	{Dagstuhl, Germany},
  URL =		{http://drops.dagstuhl.de/opus/volltexte/2007/1057},
  annote =	{Keywords: Historical documents, information access, Early New High German,  historical language, information retrieval, word similarity, approximate matching}
}

Keywords: Historical documents, information access, Early New High German, historical language, information retrieval, word similarity, approximate matching
Seminar: 06491 - Digital Historical Corpora- Architecture, Annotation, and Retrieval
Issue date: 2007
Date of publication: 19.06.2007


DROPS-Home | Fulltext Search | Imprint Published by LZI