License
When quoting this document, please refer to the following
DOI: 10.4230/LIPIcs.ICDT.2016.20
URN: urn:nbn:de:0030-drops-57897
URL: http://drops.dagstuhl.de/opus/volltexte/2016/5789/
Go to the corresponding LIPIcs Volume Portal


Czerwinski, Wojciech ; David, Claire ; Murlak, Filip ; Parys, Pawel

Reasoning About Integrity Constraints for Tree-Structured Data

pdf-format:
19.pdf (0.6 MB)


Abstract

We study a class of integrity constraints for tree-structured data modelled as data trees, whose nodes have a label from a finite alphabet and store a data value from an infinite data domain. The constraints require each tuple of nodes selected by a conjunctive query (using navigational axes and labels) to satisfy a positive combination of equalities and a positive combination of inequalities over the stored data values. Such constraints are instances of the general framework of XML-to-relational constraints proposed recently by Niewerth and Schwentick. They cover some common classes of constraints, including W3C XML Schema key and unique constraints, as well as domain restrictions and denial constraints, but cannot express inclusion constraints, such as reference keys. Our main result is that consistency of such integrity constraints with respect to a given schema (modelled as a tree automaton) is decidable. An easy extension gives decidability for the entailment problem. Equivalently, we show that validity and containment of unions of conjunctive queries using navigational axes, labels, data equalities and inequalities is decidable, as long as none of the conjunctive queries uses both equalities and inequalities; without this restriction, both problems are known to be undecidable. In the context of XML data exchange, our result can be used to establish decidability for a consistency problem for XML schema mappings. All the decision procedures are doubly exponential, with matching lower bounds. The complexity may be lowered to singly exponential, when conjunctive queries are replaced by tree patterns, and the number of data comparisons is bounded.

BibTeX - Entry

@InProceedings{czerwinski_et_al:LIPIcs:2016:5789,
  author =	{Wojciech Czerwinski and Claire David and Filip Murlak and Pawel Parys},
  title =	{{Reasoning About Integrity Constraints for Tree-Structured Data}},
  booktitle =	{19th International Conference on Database Theory (ICDT 2016)},
  pages =	{20:1--20:18},
  series =	{Leibniz International Proceedings in Informatics (LIPIcs)},
  ISBN =	{978-3-95977-002-6},
  ISSN =	{1868-8969},
  year =	{2016},
  volume =	{48},
  editor =	{Wim Martens and Thomas Zeume},
  publisher =	{Schloss Dagstuhl--Leibniz-Zentrum fuer Informatik},
  address =	{Dagstuhl, Germany},
  URL =		{http://drops.dagstuhl.de/opus/volltexte/2016/5789},
  URN =		{urn:nbn:de:0030-drops-57897},
  doi =		{10.4230/LIPIcs.ICDT.2016.20},
  annote =	{Keywords: data trees, integrity constraints, unions of conjunctive queries, schema mappings, entailment,  containment,  consistency}
}

Keywords: data trees, integrity constraints, unions of conjunctive queries, schema mappings, entailment, containment, consistency
Seminar: 19th International Conference on Database Theory (ICDT 2016)
Issue Date: 2016
Date of publication: 29.02.2016


DROPS-Home | Fulltext Search | Imprint Published by LZI