Computational Approaches to Digitised Historical Newspapers (Dagstuhl Seminar 22292)

Authors Maud Ehrmann, Marten Düring, Clemens Neudecker, Antoine Doucet and all authors of the abstracts in this report

Thumbnail PDF


  • Filesize: 12.18 MB
  • 68 pages

Document Identifiers

Author Details

Maud Ehrmann
  • EPFL - Lausanne, CH
Marten Düring
  • University of Luxembourg, LU
Clemens Neudecker
  • Staatsbibliothek zu Berlin, DE
Antoine Doucet
  • University of La Rochelle, FR
and all authors of the abstracts in this report

Cite AsGet BibTex

Maud Ehrmann, Marten Düring, Clemens Neudecker, and Antoine Doucet. Computational Approaches to Digitised Historical Newspapers (Dagstuhl Seminar 22292). In Dagstuhl Reports, Volume 12, Issue 7, pp. 112-179, Schloss Dagstuhl – Leibniz-Zentrum für Informatik (2023)


Historical newspapers are mirrors of past societies, keeping track of the small and great history and reflecting the political, moral, and economic environments in which they were produced. Highly valued as primary sources by historians and humanities scholars, newspaper archives have been massively digitised in libraries, resulting in large collections of machine-readable documents and, over the past half-decade, in numerous academic research initiatives on their automatic processing. The Dagstuhl Seminar 22292 "Computational Approaches to Digitised Historical Newspaper" gathered researchers and practitioners with backgrounds in natural language processing, computer vision, digital history and digital library involved in computational approaches to historical newspapers with the objectives to share experiences, analyse successes and shortcomings, deepen our understanding of the interplay between computational aspects and digital scholarship, and discuss future challenges. This report documents the program and the outcomes of the seminar.

Subject Classification

ACM Subject Classification
  • Computing methodologies → Information extraction
  • Computing methodologies → Machine learning
  • Information systems → Digital libraries and archives
  • Applied computing → Arts and humanities
  • Applied computing → Document management and text processing
  • Information systems → Information retrieval
  • Information systems → Data mining
  • Information systems → Document representation
  • Information systems → Document structure
  • Information systems → Structure and multilingual text search
  • Information systems → Users and interactive retrieval
  • historical document processing
  • document structure and layout analysis
  • natural language processing
  • information extraction
  • natural language processing
  • digital history
  • digital scholarship


  • Access Statistics
  • Total Accesses (updated on a weekly basis)
    PDF Downloads