On Deciding the Data Complexity of Answering Linear Monadic Datalog Queries with LTL Operators

Artale, Alessandro; Gnatenko, Anton; Ryzhikov, Vladislav; Zakharyaschev, Michael

doi:10.4230/LIPIcs.ICDT.2025.31

On Deciding the Data Complexity of Answering Linear Monadic Datalog Queries with LTL Operators

Alessandro Artale

Faculty of Engineering, Free University of Bozen-Bolzano, Italy Anton Gnatenko

Faculty of Engineering, Free University of Bozen-Bolzano, Italy Vladislav Ryzhikov

Birkbeck, University of London, UK Michael Zakharyaschev

Birkbeck, University of London, UK

Abstract

Our concern is the data complexity of answering linear monadic datalog queries whose atoms in the rule bodies can be prefixed by operators of linear temporal logic LTL. We first observe that, for data complexity, answering any connected query with operators $\bigcirc$ / $\raisebox{1.07639pt}{\text{\scriptsize{$\bigcirc$}}}^{-}$ (at the next/previous moment) is either in $\textsc{AC}^{0}$ , or in ${\textsc{ACC}^{0}}\!\setminus\!{\textsc{AC}^{0}}$ , or $\textsc{NC}^{1}$ -complete, or L-hard and in NL. Then we show that the problem of deciding L-hardness of answering such queries is PSpace-complete, while checking membership in the classes $\textsc{AC}^{0}$ and $\textsc{ACC}^{0}$ as well as $\textsc{NC}^{1}$ -completeness can be done in ExpSpace. Finally, we prove that membership in $\textsc{AC}^{0}$ or in $\textsc{ACC}^{0}$ , $\textsc{NC}^{1}$ -completeness, and L-hardness are undecidable for queries with operators $\Diamond$ / $\raisebox{0.43057pt}{\text{\small{$\Diamond$}}}\!^{-}$ (sometime in the future/past) provided that ${\textsc{NC}^{1}}\neq\textsc{NL}$ and $\textsc{L}\neq\textsc{NL}$ .

Keywords and phrases:

Linear monadic datalog, linear temporal logic, data complexity

Copyright and License:

2012 ACM Subject Classification:

Theory of computation

\rightarrow

Database query processing and optimization (theory)

Related Version:

Extended version incl. full proofs: https://arxiv.org/abs/2501.13762 [4]

DOI:

10.4230/LIPIcs.ICDT.2025.31

Event:

28th International Conference on Database Theory (ICDT 2025)

Editors:

Sudeepa Roy and Ahmet Kara

Series and Publisher:

Leibniz International Proceedings in Informatics, Schloss Dagstuhl – Leibniz-Zentrum für Informatik

1 Introduction

We consider monadic datalog queries, in which atoms in the rule bodies can be prefixed by the temporal operators $\bigcirc$ / ${\raisebox{1.07639pt}{\text{\scriptsize{$\bigcirc$}}}^{-}}$ (at the next/previous moment) and $\Diamond$ / $\raisebox{0.43057pt}{\text{\small{$\Diamond$}}}\!^{-}$ (sometime in the future/past) of linear temporal logic LTL [17]. This query language, denoted $\textsl{datalog}_{m}^{\,\scriptscriptstyle\bigcirc{\smash{\raisebox{-1.3pt}{$% \diamond$}}}}$ , is intended for querying temporal graph databases and knowledge graphs in scenarios such as virus transmission [25, 45], transport networks [28], social media [38], supply chains [46], and power grids [37]. In this setting, data instances are finite sets of ground atoms that are timestamped by the moments of time they happen at. The rules in $\textsl{datalog}_{m}^{\,\scriptscriptstyle\bigcirc{\smash{\raisebox{-1.3pt}{$% \diamond$}}}}$ queries are assumed to hold at all times, with time being implicit in the rules and only accessible via temporal operators. We choose LTL for our formalism rather than, say, more expressive metric temporal logic MTL [30, 1] because LTL has been a well established query language in temporal databases since the 1990s (see [39, 41, 14, 32] and the discussion therein on point versus interval-based query languages), also suitable in the context of temporal knowledge graphs as recently argued in [19].

Example 1.

Imagine a PhD student working on a paper while hostel hopping. However, finishing the paper requires staying at the same hostel for at least two consecutive nights. Bus services between hostels, which vary from one day to the next, and hostel vacancies are given by a temporal data instance with atoms of the form $\textit{busService}(a,b,n)$ and $\textit{Vacant}(a,n)$ , where $a$ , $b$ are hostels and $n\in\mathbb{N}$ a timestamp (see Fig. 1 for an illustration). The following $\textsl{datalog}_{m}^{\,\scriptscriptstyle\bigcirc{\smash{\raisebox{-1.3pt}{$% \diamond$}}}}$ query $(\pi_{1},\textit{Success})$ finds pairs $(x,t)$ such that having started hopping at hostel $x$ on day $t$ , the student will eventually submit the paper:

\displaystyle\pi_{1}\colon\quad\begin{split}&\textit{Success}(X)\leftarrow% \textit{Vacant}(X)\land{\raisebox{1.07639pt}{\text{\scriptsize{$\bigcirc$}}}}% \textit{busService}(X,Y)\wedge{\raisebox{1.07639pt}{\text{\scriptsize{$% \bigcirc$}}}}\textit{Success}(Y),\\ &\textit{Success}(X)\leftarrow\textit{Vacant}(X)\land{\raisebox{1.07639pt}{% \text{\scriptsize{$\bigcirc$}}}}\textit{Vacant}(X).\end{split}

It is readily seen that answering this query is NL-complete for data complexity. If, however, we drop the next-time operator $\bigcirc$ from $\pi_{1}$ , it will become equivalent to $\textit{Vacant}(X)$ . The next query $(\pi_{2},\textit{Promising})$ simply looks for pairs $(x,t)$ with $x$ having vacancies for two consecutive nights some time later than $t$ :

\displaystyle\pi_{2}\colon\quad\begin{split}&\textit{Promising}(X)\leftarrow% \raisebox{0.43057pt}{\text{\small{$\Diamond$}}}\textit{Vacant42Nights}(X),\\ &\textit{Vacant42Nights}(X)\leftarrow\textit{Vacant}(X)\land{\raisebox{1.07639% pt}{\text{\scriptsize{$\bigcirc$}}}}\textit{Vacant}(X).\end{split}

This $\textsl{datalog}_{m}^{\,\scriptscriptstyle\bigcirc{\smash{\raisebox{-1.3pt}{$% \diamond$}}}}$ query can be equivalently expressed as the two-sorted first-order formula

\exists t^{\prime}\,\big{(}(t<t^{\prime})\wedge\textit{Vacant}(x,t^{\prime})% \wedge\textit{Vacant}(x,t^{\prime}+1)\big{)},

where $x$ ranges over objects (hostels) and $t,t^{\prime}$ over time points ordered by $<$ . Formulas in such a two-sorted first-order logic, denoted $\textup{FO}(<)$ , can be evaluated over finite data instances in $\textsc{AC}^{0}$ for data complexity [26]. $\lrcorner$

Refer to caption — (a) Hostels $h_{1},h_{2},h_{3},h_{4}$ on days $1,2,3,4$ .

Our main concern is the classical problem of deciding whether a given temporal monadic datalog query is equivalent to a first-order query (over any data instance). In the standard, atemporal database theory, this problem, known as predicate boundedness, has been investigated since the mid 1980s with the aim of optimising and parallelising datalog programs [27, 16]. Thus, predicate boundedness was shown to be undecidable for binary datalog queries [24] and 2ExpTime-complete for monadic ones (even with a single recursive rule) [15, 8, 29].

Datalog boundedness is closely related to the more general rewritability problem in the ontology-based data access paradigm [11, 3], which brought to light wider classes of ontology-mediated queries (OMQs) and ultimately aimed to decide the data complexity of answering any given OMQ and, thereby, the optimal database engine needed to execute that OMQ. Answering OMQs given in propositional linear temporal logic LTL is either in ${\textsc{AC}^{0}}$ , or in ${\textsc{ACC}^{0}}\setminus{\textsc{AC}^{0}}$ , or ${\textsc{NC}^{1}}$ -complete for data complexity [40], the classes well known from the circuit complexity theory for regular languages. For each of these three cases, deciding whether a given LTL-query falls into it is ExpSpace-complete, even if we restrict the language to temporal Horn formulas and atomic queries [31]. The data complexity of answering atemporal monadic datalog queries comes from four complexity classes ${\textsc{AC}^{0}}\subsetneqq\textsc{L}\subseteq\textsc{NL}\subseteq\textsc{P}$ [16, 15].

Our 2D query language $\textsl{datalog}_{m}^{\,\scriptscriptstyle\bigcirc{\smash{\raisebox{-1.3pt}{$% \diamond$}}}}$ is a combination of datalog and LTL. It can be seen as the monadic fragment, without negation and aggregate functions, of $\textsc{Dedalus}_{0}$ , a language for reasoning about distributed systems that evolve in time [2]. It is also close in spirit to temporal deductive databases, TDDs [13, 12], extending their monadic fragment with the eventuality operator $\Diamond$ . The main intriguing question we would like to answer in this paper is whether deciding membership of 2D queries in, say, ${\textsc{AC}^{0}}$ can be substantially harder than deciding membership in ${\textsc{AC}^{0}}$ of the corresponding 1D monadic datalog and LTL queries.

With this in mind, we focus on the sublanguage $\textsl{datalog}_{\it lm}^{\,\scriptscriptstyle\bigcirc{\smash{\raisebox{-1.3% pt}{$\diamond$}}}}$ of $\textsl{datalog}_{m}^{\,\scriptscriptstyle\bigcirc{\smash{\raisebox{-1.3pt}{$% \diamond$}}}}$ that consists of linear queries, that is, those that have at most one IDB (intensional, recursively definable) predicate in each rule body. While the full language inherits from TDDs a PSpace-complete query answering problem (for data complexity), we prove that linear queries can be answered in NL. By $\textsl{datalog}_{\it lm}^{\,\smash{\raisebox{0.0pt}{$\scriptscriptstyle% \bigcirc$}}}$ and $\textsl{datalog}_{\it lm}^{\,\smash{\raisebox{-1.0pt}{$\diamond$}}}$ we denote the fragments of $\textsl{datalog}_{\it lm}^{\,\scriptscriptstyle\bigcirc{\smash{\raisebox{-1.3% pt}{$\diamond$}}}}$ that only admit the $\bigcirc$ / $\raisebox{1.07639pt}{\text{\scriptsize{$\bigcirc$}}}^{-}$ and $\Diamond$ / $\raisebox{0.43057pt}{\text{\small{$\Diamond$}}}\!^{-}$ operators, respectively. All of our queries are assumed to be connected in the sense that the graph induced by the body of each rule is connected. These fragments retain practical interest: as argued in [15], atemporal datalog programs used in practice tend to be linear and connected. For example, SQL:1999 explicitly supports linear recursion [20], which together with connectedness is a common constraint in the context of querying graph databases and knowledge graphs [35, 42], where the focus is on path queries [21, 23].

It is known that deciding whether a linear monadic datalog query can be answered in ${\textsc{AC}^{0}}$ is PSpace-complete [15, 43] (without the monadicity restriction, the problem is undecidable [24]). The same problem for the propositional LTL fragments of $\textsl{datalog}_{\it lm}^{\,\smash{\raisebox{0.0pt}{$\scriptscriptstyle% \bigcirc$}}}$ and $\textsl{datalog}_{\it lm}^{\,\scriptscriptstyle\bigcirc{\smash{\raisebox{-1.3% pt}{$\diamond$}}}}$ is also PSpace-complete [31].

Our main results in this paper are as follows:

$\blacksquare$

Answering $\textsl{datalog}_{\it lm}^{\,\scriptscriptstyle\bigcirc{\smash{\raisebox{-1.3% pt}{$\diamond$}}}}$ queries is NL-complete for data complexity.
$\blacksquare$

It is undecidable whether a given $\textsl{datalog}_{\it lm}^{\,\smash{\raisebox{-1.0pt}{$\diamond$}}}$ -query can be answered in $\textsc{AC}^{0}$ , $\textsc{ACC}^{0}$ , or $\textsc{NC}^{1}$ (if ${\textsc{NC}^{1}}\neq\textsc{NL}$ ); it is undecidable whether such a query is L-hard (if $\textsc{L}\neq\textsc{NL}$ ).
$\blacksquare$

Answering any connected $\textsl{datalog}_{\it lm}^{\,\smash{\raisebox{0.0pt}{$\scriptscriptstyle% \bigcirc$}}}$ -query is either in ${\textsc{AC}^{0}}$ or in ${\textsc{ACC}^{0}}\setminus{\textsc{AC}^{0}}$ , or ${\textsc{NC}^{1}}$ -complete, or L-hard – and anyway it is in NL.
$\blacksquare$

It is PSpace-complete to decide whether a connected $\textsl{datalog}_{\it lm}^{\,\smash{\raisebox{0.0pt}{$\scriptscriptstyle% \bigcirc$}}}$ -query is L-hard; checking whether it is in $\textsc{AC}^{0}$ , or in ${\textsc{ACC}^{0}}\setminus{\textsc{AC}^{0}}$ , or is $\textsc{NC}^{1}$ -complete can be done in ExpSpace.

(Note that dropping the past-time operators $\raisebox{1.07639pt}{\text{\scriptsize{$\bigcirc$}}}^{-}$ and $\raisebox{0.43057pt}{\text{\small{$\Diamond$}}}\!^{-}$ from the languages has no impact on these complexity results.) Thus, the temporal operators $\bigcirc$ / $\raisebox{1.07639pt}{\text{\scriptsize{$\bigcirc$}}}^{-}$ and $\Diamond$ / $\raisebox{0.43057pt}{\text{\small{$\Diamond$}}}\!^{-}$ exhibit drastically different types of interaction between the object and temporal dimensions. To illustrate the reason for this phenomenon, consider first the following $\textsl{datalog}_{\it lm}^{\,\smash{\raisebox{0.0pt}{$\scriptscriptstyle% \bigcirc$}}}$ -program:

	$\displaystyle G(X)\leftarrow A(X)\wedge{\raisebox{1.07639pt}{\text{\scriptsize% {$\bigcirc$}}}}R(X,Y)\wedge{\raisebox{1.07639pt}{\text{\scriptsize{$\bigcirc$}% }}}D(Y),$		(1)
	$\displaystyle D(X)\leftarrow{\raisebox{1.07639pt}{\text{\scriptsize{$\bigcirc$% }}}}D(X),$		(2)
	$\displaystyle D(X)\leftarrow B(X).$		(3)

Suppose a data instance consists of timestamped atoms $A(a,0)$ , $R(a,b,1)$ , $B(b,5)$ . We obtain $G(a,0)$ by first applying rule (3) to infer $D(b,5)$ , then rule (2) to infer $D(b,4),D(b,3),D(b,2)$ , and $D(b,1)$ , and finally rule (1) to obtain $G(a,0)$ . Rules (2) and (3) are applied along the timeline of a single object, $b$ , while the final application passes from one object, $b$ , to another, $a$ . To do so, we check whether a certain condition holds for the joint timeline of $a$ and $b$ , namely, that they are connected by $R$ at time 1. However, if we are limited to the operators $\bigcirc$ / $\raisebox{1.07639pt}{\text{\scriptsize{$\bigcirc$}}}^{-}$ , the number of steps that we can investigate along such a joint timeline is bounded by the maximum number of nested $\bigcirc$ / $\raisebox{1.07639pt}{\text{\scriptsize{$\bigcirc$}}}^{-}$ in the program. Therefore, there is little interaction between the two phases of inference that explore the object and the temporal domains. In contrast, rules with $\Diamond$ / $\raisebox{0.43057pt}{\text{\small{$\Diamond$}}}\!^{-}$ can inspect both dimensions simultaneously as, for example, the rule

\displaystyle G(X)\leftarrow\raisebox{0.43057pt}{\text{\small{$\Diamond$}}}R(X% ,Y)\wedge D(Y).

(4)

In this case, inferring $G(a,0)$ requires checking the existence of an object $b$ satisfying $D(b,0)$ and $R(a,b,\ell)$ at some arbitrarily distant moment $\ell$ in the future. Given that our programs are monadic, the predicate $\raisebox{0.43057pt}{\text{\small{$\Diamond$}}}R(X,Y)$ cannot be expressed using operators $\bigcirc$ / $\raisebox{1.07639pt}{\text{\scriptsize{$\bigcirc$}}}^{-}$ only.

Our positive results are proved by generalising the automata-theoretic approach of [15]. As a by-product, we obtain a method to decompose every connected $\textsl{datalog}_{\it lm}^{\,\smash{\raisebox{0.0pt}{$\scriptscriptstyle% \bigcirc$}}}$ -query $(\pi,G)$ into a plain datalog part $(\pi_{d},G)$ and a plain LTL part $(\pi_{t},G)$ , which are, however, substantially larger than $(\pi,G)$ , so that the data complexity of answering $(\pi,G)$ equals the maximum of the respective data complexities of $(\pi_{d},G)$ and $(\pi_{t},G)$ . This reinforces the “weakness of interaction” between the relational and the temporal parts of the query when the latter is limited to operators $\bigcirc$ / $\raisebox{1.07639pt}{\text{\scriptsize{$\bigcirc$}}}^{-}$ . We also provide some evidence that, in contrast to the atemporal case, the automata-theoretic approach cannot be generalised to the case of disconnected queries. The undecidability of the decision problem for $\textsl{datalog}_{\it lm}^{\,\smash{\raisebox{-1.0pt}{$\diamond$}}}$ -queries is proved by a reduction of the halting problem for Minsky machines with two counters [33].

The paper is organised as follows. In Section 2, we give formal definitions of data instances and queries, and prove that every temporal monadic datalog query can be answered in P, and in NL if it is linear. In Section 3, we show that checking whether a given query with operator $\Diamond$ or $\raisebox{0.43057pt}{\text{\small{$\Diamond$}}}\!^{-}$ has data complexity lower than NL is undecidable. Section 4 considers $\textsl{datalog}_{\it lm}^{\,\smash{\raisebox{0.0pt}{$\scriptscriptstyle% \bigcirc$}}}$ -queries by presenting a generalisation of the automata-theoretic approach of [15], which is then used in Section 5 to provide the decidability results. We conclude with a discussion of future work and our final remarks in Section 6. Detailed proofs can be found in the full version of this paper [4].

2 Preliminaries

A relational schema $\Sigma$ is a finite set of relation symbols $R$ with associated arities $m\geq 0$ . A database $D$ over a schema $\Sigma$ is a set of ground atoms $R(d_{1},\dots,d_{m})$ , $R\in\Sigma$ , $m$ is the arity of $R$ . We call $d_{i}$ , $1\leq i\leq m$ , the domain objects or simply objects. We denote by $\Delta_{D}$ the set of objects occurring in $D$ . We denote by $|D|$ the number of atoms in $D$ . We denote by $[a,b]$ the set of integers $\{m\mid a\leqslant m\leqslant b\}$ , where $a,b\in\mathbb{Z}$ . A temporal database $\mathcal{D}$ over a schema $\Sigma$ is a finite sequence $\langle D_{l},D_{l+1}\dots,D_{r-1},D_{r}\rangle$ of databases over this schema for some $l<r$ , $l,r\in\mathbb{Z}$ . Each database $D_{i},l\leqslant i\leqslant r,$ is called the $i$ ’th slice of $\mathcal{D}$ and $i$ is called a timestamp. We denote $[l,r]$ by $\mathrm{tem}(\mathcal{D})$ . The size of $\mathcal{D}$ , denoted by $|\mathcal{D}|$ , is the maximum between $|\mathrm{tem}(\mathcal{D})|$ and $\max\{|D_{l}|,\dots,|D_{r}|\}$ . The domain of the temporal database $\mathcal{D}$ is $\bigcup_{l\leq i\leq r}\Delta_{D_{i}}$ and is denoted by $\Delta_{\mathcal{D}}$ . A homomorphism from $\mathcal{D}$ as above to $\mathcal{D}^{\prime}=\langle D_{l^{\prime}},D_{l^{\prime}+1}\dots,D_{r^{\prime% }-1},D_{r^{\prime}}\rangle$ is a function $h$ that maps $\Delta_{\mathcal{D}}\cup[l,r]$ to $\Delta_{\mathcal{D}^{\prime}}\cup[l^{\prime},r^{\prime}]$ so that $R(d_{1},\dots,d_{m})\in D_{\ell}$ if and only if $R(h(d_{1}),\dots,h(d_{m}))\in D^{\prime}_{h(\ell)}$ .

We will deal with temporal conjunctive queries (temporal CQs) that are formulas of the form $Q(\overline{X})=\exists\overline{U}\varphi(\overline{X},\overline{U})$ , where $\overline{X},\overline{U}$ are tuples of variables and $\varphi$ , the body of the query, is defined by the following BNF:

\varphi\Coloneqq R(Z_{1},\dots,Z_{m})\mid(\varphi\wedge\varphi)\mid\mathcal{O}\varphi

(5)

where $R\in\Sigma$ and $Z_{1},\dots,Z_{m}$ are variables of $\overline{X}\cup\overline{U}$ , and $\mathcal{O}$ is any of ${\raisebox{1.07639pt}{\text{\scriptsize{$\bigcirc$}}}},{\raisebox{1.07639pt}{% \text{\scriptsize{$\bigcirc$}}}^{-}},\raisebox{0.43057pt}{\text{\small{$% \Diamond$}}}$ and $\raisebox{0.43057pt}{\text{\small{$\Diamond$}}}\!^{-}$ (the operators $\bigcirc$ / $\raisebox{1.07639pt}{\text{\scriptsize{$\bigcirc$}}}^{-}$ mean “at the next/previous moment” and $\Diamond$ / $\raisebox{0.43057pt}{\text{\small{$\Diamond$}}}\!^{-}$ “sometime in the future/past”). For brevity, we will use the notation $\mathcal{O}^{n}$ , $\mathcal{O}\in\{{\raisebox{1.07639pt}{\text{\scriptsize{$\bigcirc$}}}},% \raisebox{0.43057pt}{\text{\small{$\Diamond$}}}\}$ , for a sequence of $n$ symbols $\mathcal{O}$ if $n>0$ , of $|n|$ symbols $\mathcal{O}^{-}\in\{{\raisebox{1.07639pt}{\text{\scriptsize{$\bigcirc$}}}^{-}}% ,\raisebox{0.43057pt}{\text{\small{$\Diamond$}}}\!^{-}\}$ if $n<0$ , and an empty sequence if $n=0$ . We call $\overline{X}$ the answer variables of $Q$ . To provide the semantics, we define $\mathcal{D},\ell\models\varphi(d_{1},\dots,d_{m})$ for $\ell\in\mathbb{Z}$ and $d_{1},\dots,d_{m}\in\Delta_{\mathcal{D}}$ as follows:

	$\displaystyle\mathcal{D},\ell\models R(d_{1},\dots,d_{m})\iff\ell\in[l,r]\text% { and }R(d_{1},\dots,d_{m})\in D_{\ell}$		(6)
	$\displaystyle\mathcal{D},\ell\models\varphi_{1}\wedge\varphi_{2}\iff\mathcal{D% },\ell\models\varphi_{1}\text{ and }\mathcal{D},\ell\models\varphi_{2}$		(7)
	$\displaystyle\mathcal{D},\ell\models{\raisebox{1.07639pt}{\text{\scriptsize{$% \bigcirc$}}}}\varphi\iff\mathcal{D},{\ell+1}\models\varphi$		(8)
	$\displaystyle\mathcal{D},\ell\models\raisebox{0.43057pt}{\text{\small{$% \Diamond$}}}\varphi\iff\mathcal{D},{\ell^{\prime}}\models\varphi\text{ for % some }\ell^{\prime}>\ell$		(9)

and symmetrically for ${\raisebox{1.07639pt}{\text{\scriptsize{$\bigcirc$}}}^{-}}$ and $\raisebox{0.43057pt}{\text{\small{$\Diamond$}}}\!^{-}$ . Given a temporal database $\mathcal{D}$ , a timestamp $\ell\in\mathbb{Z}$ , and a query $Q(\overline{X})=\exists\overline{U}\varphi(\overline{X},\overline{U})$ , we say that $\mathcal{D},\ell\models Q(d_{1},\dots,d_{k})$ if there exist $\delta_{1},\dots,\delta_{s}\in\Delta_{\mathcal{D}}$ such that $\mathcal{D},\ell\models\varphi(d_{1},\dots,d_{k},\delta_{1},\dots,\delta_{s})$ , where $k=|\overline{X}|,s=|\overline{U}|$ .

The problem of answering a temporal CQ $Q$ is to check, given $\mathcal{D}$ , $\ell\in\mathrm{tem}(\mathcal{D})$ , and $\bar{d}=\langle d_{1},\dots,d_{k}\rangle$ , whether $\mathcal{D},\ell\models Q(\bar{d})$ . Answering temporal CQs is not harder than that for non-temporal CQs. Indeed, we show that any $Q$ is $\textup{FO}(<)$ -rewritable in the sense that there exists an $\textup{FO}(<)$ -formula $\psi(\overline{X},t)$ such that for all $\ell$ and $\bar{d}$ as above $\mathcal{D},\ell\models Q(\bar{d})$ whenever $\psi(\bar{d},\ell)$ is true in the two-sorted first-order structure $\mathfrak{S}_{\mathcal{D}}$ , whose domain is $\Delta_{\mathcal{D}}\cup\mathrm{tem}(\mathcal{D})$ , and where $R(\bar{d},\ell)$ is true whenever $\mathcal{D},\ell\models R(\bar{d})$ and $(\ell<\ell^{\prime})$ is true whenever $\ell<\ell^{\prime}$ (see [5] for details). It follows by [26] that the problem of answering a temporal CQ $Q$ is in $\textsc{AC}^{0}$ .

We outline the construction of the rewriting. Let $Q(\overline{X})=\exists\overline{U}\varphi(\overline{X},\overline{U})$ . The main issue with the construction is that temporal subformulas of $\varphi$ , say $\raisebox{0.43057pt}{\text{\small{$\Diamond$}}}\varkappa$ , may be true for $\bar{d}$ at $\ell\in\mathrm{tem}(\mathcal{D})$ , because $\mathcal{D},\ell^{\prime}\models\varkappa(\bar{d})$ for $\ell^{\prime}\not\in\mathrm{tem}(\mathcal{D})$ . Had that not been the case, we could construct the rewriting for $Q$ straightforwardly by induction of $\varphi$ . To overcome this, let $N$ be the number of temporal operators in $\varphi$ . We use a property that for all tuples $\bar{d}$ of objects from $\Delta_{\mathcal{D}}$ and subformulas $\varkappa$ of $\varphi$ , we have

\begin{split}&\mathcal{D},r+N+1\models\varkappa(\bar{d})\iff\mathcal{D},\ell% \models\varkappa(\bar{d})\text{ for all }\ell>r+N\\ &\mathcal{D},l-N-1\models\varkappa(\bar{d})\iff\mathcal{D},\ell\models% \varkappa(\bar{d})\text{ for all }\ell<l-N\end{split}

(10)

Thus, for any subformula $\varkappa(\overline{Z})$ of $\varphi$ , we construct, by induction, the formulas $\psi_{\varkappa}(\overline{Z},t)$ and $\psi_{\varkappa}^{i}(\overline{Z})$ for $i\in[-N-1,\dots,-1]\cup[1,\dots,N+1]$ , so that for any $\mathcal{D}$ , $\mathrm{tem}(\mathcal{D})=[l,r]$ , and any objects $\bar{d}\in\Delta_{\mathcal{D}}^{|\overline{Z}|}$ ,

$\displaystyle\mathcal{D},\ell\models\varkappa(\bar{d})$	$\displaystyle\iff$	$\displaystyle\mathfrak{S}_{\mathcal{D}}\models\psi_{\varkappa}(\bar{d},\ell)$	$\displaystyle\text{ for }\ell\in[l,r],$
$\displaystyle\mathcal{D},(r+i)\models\varkappa(\bar{d})$	$\displaystyle\iff$	$\displaystyle\mathfrak{S}_{\mathcal{D}}\models\psi_{\varkappa}^{i}(\bar{d})$	$\displaystyle\text{ for }1\leqslant i\leqslant N+1,$
$\displaystyle\mathcal{D},(l+i)\models\varkappa(\bar{d})$	$\displaystyle\iff$	$\displaystyle\mathfrak{S}_{\mathcal{D}}\models\psi_{\varkappa}^{i}(\bar{d})$	$\displaystyle\text{ for }-N-1\leqslant i\leqslant-1.$

For the base case, we set $\psi_{R}(\overline{Z},t)=R(\overline{Z},t)$ and $\psi_{R}^{i}(\overline{Z})=\bot$ . For an induction step, e.g., $\psi_{{\raisebox{0.75346pt}{\text{\scriptsize{$\bigcirc$}}}}\varkappa}^{-1}(% \overline{Z})=\psi_{\varkappa}(\overline{Z},\min)$ , $\psi_{{\raisebox{0.75346pt}{\text{\scriptsize{$\bigcirc$}}}}\varkappa}^{N+1}(% \overline{Z})=\psi^{N+1}_{\varkappa}(\overline{Z})$ , $\psi_{{\raisebox{0.75346pt}{\text{\scriptsize{$\bigcirc$}}}}\varkappa}^{i}(% \overline{Z})=\psi_{\varkappa}^{i+1}(\overline{Z})$ for all other $i$ , and finally $\psi_{{\raisebox{0.75346pt}{\text{\scriptsize{$\bigcirc$}}}}\varkappa}(% \overline{Z},t)=\exists t^{\prime}((t^{\prime}=t+1)\land\psi_{\varkappa}(% \overline{Z},t^{\prime}))\lor((t=\max)\land\psi_{\varkappa}^{1}(\overline{Z}))$ . Here, $\min$ and $\max$ are defined in $\textup{FO}(<)$ as, respectively, $<$ -minimal and $<$ -maximal elements in $\mathrm{tem}(\mathcal{D})$ ( $(t^{\prime}=t+1)$ is $\textup{FO}(<)$ -definable as well). The required rewriting of $Q$ is then the formula $\exists\overline{U}\psi_{\varphi}(\overline{X},\overline{U},t)$ .

Proposition 2.

For a temporal CQ $Q(X_{1},\dots,X_{k})$ , checking $\mathcal{D},\ell\models Q(d_{1},\dots,d_{k})$ is in $\textsc{AC}^{0}$ for data complexity.

In the non-temporal setting, a body of a CQ (a conjunction of atoms), can be seen as a database (a set of atoms). We have a similar correspondence in the temporal setting, for queries without operators $\Diamond$ and $\raisebox{0.43057pt}{\text{\small{$\Diamond$}}}\!^{-}$ . Indeed, observe that ${\raisebox{1.07639pt}{\text{\scriptsize{$\bigcirc$}}}}^{a}(\varphi_{1}\wedge% \varphi_{2})\equiv{\raisebox{1.07639pt}{\text{\scriptsize{$\bigcirc$}}}}^{a}% \varphi_{1}\wedge{\raisebox{1.07639pt}{\text{\scriptsize{$\bigcirc$}}}}^{a}% \varphi_{2}$ and ${\raisebox{1.07639pt}{\text{\scriptsize{$\bigcirc$}}}}{\raisebox{1.07639pt}{% \text{\scriptsize{$\bigcirc$}}}^{-}}\varphi\equiv{\raisebox{1.07639pt}{\text{% \scriptsize{$\bigcirc$}}}^{-}}{\raisebox{1.07639pt}{\text{\scriptsize{$% \bigcirc$}}}}\varphi\equiv\varphi$ . Hence we can assume that any temporal CQ body is a conjunction of temporalised atoms of the form ${\raisebox{1.07639pt}{\text{\scriptsize{$\bigcirc$}}}}^{k}R(Z_{1},\dots,Z_{m})$ . Given a temporal CQ $Q$ of this form, let $l$ be the least and $r$ the greatest number such that ${\raisebox{1.07639pt}{\text{\scriptsize{$\bigcirc$}}}}^{l}R(\overline{Z})$ and ${\raisebox{1.07639pt}{\text{\scriptsize{$\bigcirc$}}}}^{r}R^{\prime}(\overline% {Z}^{\prime})$ appear in $Q$ , for some $R,R^{\prime},\overline{Z}$ and $\overline{Z}^{\prime}$ . Let $\mathcal{D}_{Q}$ be a temporal database whose objects are the variables of $Q$ , and $\mathrm{tem}(\mathcal{D})$ equals $[l,r]$ if $0\in[l,r]$ , $[0,r]$ if $0<l$ , and $[l,0]$ if $r<0$ . Furthermore, let $R(\overline{Z})\in D_{\ell}$ whenever ${\raisebox{1.07639pt}{\text{\scriptsize{$\bigcirc$}}}}^{\ell}R(\overline{Z})$ is in $Q$ , $\ell\in\mathrm{tem}(\mathcal{D})$ . Then we can, just as in the non-temporal case, characterise the relation $\models$ in terms of homomorphisms:

Lemma 3.

For any temporal CQ $Q(X_{1},\dots,X_{k})$ without $\Diamond$ and $\raisebox{0.43057pt}{\text{\small{$\Diamond$}}}\!^{-}$ , $\mathcal{D},\ell\models Q(d_{1},\dots,d_{k})$ if and only if there is a homomorphism $h$ from $\mathcal{D}_{Q}$ to $\mathcal{D}$ such that $h(X_{i})=d_{i},1\leqslant i\leqslant k$ , and $h(0)=\ell$ .

2.1 Temporal Datalog

We define a temporalised version of datalog to be able to use recursion in querying temporal databases. We call this language temporal datalog, or $\textsl{datalog}^{\,\scriptscriptstyle\bigcirc{\smash{\raisebox{-1.3pt}{$% \diamond$}}}}$ . A rule of $\textsl{datalog}^{\,\scriptscriptstyle\bigcirc{\smash{\raisebox{-1.3pt}{$% \diamond$}}}}$ over a schema $\Sigma$ has the form:

C(\overline{X})\leftarrow\mathcal{O}^{*}R_{1}(\overline{U}_{1})\wedge\dots% \wedge\mathcal{O}^{*}R_{s}(\overline{U}_{s})

(11)

where $R_{i}$ and $C$ are relation symbols over $\Sigma$ and $\mathcal{O}^{*}$ is an arbitrary sequence of temporal operators ${\raisebox{1.07639pt}{\text{\scriptsize{$\bigcirc$}}}},{\raisebox{1.07639pt}{% \text{\scriptsize{$\bigcirc$}}}^{-}},\raisebox{0.43057pt}{\text{\small{$% \Diamond$}}}$ and $\raisebox{0.43057pt}{\text{\small{$\Diamond$}}}\!^{-}$ . The part of the rule to the left of the arrow is called its head and the right-hand side – its body. All variables from the head must appear in the body.

A program is a finite set of rules. The relations that appear in rule heads constitute its IDB schema, $\textit{IDB}(\pi)$ , while the rest form the EDB schema, $\textit{EDB}(\pi)$ . A rule is linear if its body contains at most one IDB atom and monadic if the arity of its head is 1. A program $\pi$ is linear (monadic) if so are all its rules. We say that the program is in plain datalog if it does not use the temporal operators and in plain LTL if all its relations have arity 1 and every rule uses just one variable. Recursive rules are those that contain IDB atoms in their bodies, other rules are called initialisation rules. The arity of a program is the maximal arity of its IDB atoms.

Our results are all about connected programs. Namely, define the Gaifman graph of a temporal CQ to be a graph whose nodes are the variables and where two variables are connected by an edge if they appear in the same atom. A rule body is connected if so is its Gaifman graph, and a program is connected when all rules are connected. The size of a program $\pi$ , denoted $|\pi|$ , is the number of symbols needed to write it down, where every relation symbol $R\in\textit{EDB}(\pi)\cup\textit{IDB}(\pi)$ is counted as one symbol, and a sequence of operators the form $\mathcal{O}^{k}$ is counted as $|k|$ symbols.

When a program $\pi$ is fixed, we assume that all temporal databases that we work with are defined over $\textit{EDB}(\pi)$ . So let $\pi$ be a program and $\mathcal{D}$ a temporal database. An enrichment of $\mathcal{D}$ is an (infinite) temporal database $\mathcal{E}=\langle E_{\ell}\rangle_{\ell\in\mathbb{Z}}$ over the schema $\textit{EDB}(\pi)\cup\textit{IDB}(\pi)$ such that $\Delta_{\mathcal{E}}=\Delta_{\mathcal{D}}$ and for any $R\in\textit{EDB}(\pi)$ and any $\ell\in\mathbb{Z}$ , $R(d_{1},\dots,d_{m})\in E_{\ell}$ if and only if $R(d_{1},\dots,d_{m})\in D_{\ell}$ . Thus, the only EDB atoms in $\mathcal{E}$ are those of $\mathcal{D}$ , but $\mathcal{E}$ “enriches” $\mathcal{D}$ with various IDB atoms at different points of time. We say that $\mathcal{E}$ is a model of $\pi$ and $\mathcal{D}$ if $(i)$ $\mathcal{E}$ is an enrichment of $\mathcal{D}$ ; and $(ii)$ for any rule $C(\overline{X})\leftarrow\psi(\overline{X},\overline{U})$ of $\pi$ , $\mathcal{E}\models C(\overline{X})\leftarrow\psi(\overline{X},\overline{U})$ , i.e., for all $\ell\in\mathbb{Z}$ and any tuples $\bar{d}\in\Delta_{\mathcal{E}}^{|\overline{X}|},\bar{\delta}\in\Delta_{% \mathcal{E}}^{|\overline{U}|}$ , $\mathcal{E},\ell\models\psi(\bar{d},\bar{\delta})$ implies $\mathcal{E},\ell\models C(\bar{d})$ . We write $\mathcal{D},\pi,\ell\models C(\bar{d})$ if for every model $\mathcal{E}$ of $\pi$ and $\mathcal{D}$ it follows that $\mathcal{E},\ell\models C(\bar{d})$ .

A $\textsl{datalog}^{\,\scriptscriptstyle\bigcirc{\smash{\raisebox{-1.3pt}{$% \diamond$}}}}$ query is a pair $(\pi,G)$ , where $\pi$ is a $\textsl{datalog}^{\,\scriptscriptstyle\bigcirc{\smash{\raisebox{-1.3pt}{$% \diamond$}}}}$ program and $G$ an IDB atom, called the goal predicate. The arity of a query is the arity of $G$ . Given a temporal database $\mathcal{D}$ , a timestamp $\ell\in\mathrm{tem}(\mathcal{D})$ , a tuple $(d_{1},\ldots,d_{k})\in\Delta_{\mathcal{D}}^{k}$ and a $\textsl{datalog}^{\,\scriptscriptstyle\bigcirc{\smash{\raisebox{-1.3pt}{$% \diamond$}}}}$ query $(\pi,G)$ of arity $k$ , the pair $\langle(d_{1},\ldots,d_{k}),\ell\rangle$ is a certain answer to $(\pi,G)$ over $\mathcal{D}$ if $\mathcal{D},\pi,\ell\models G(d_{1},\ldots,d_{k})$ . The answering problem for a $\textsl{datalog}^{\,\scriptscriptstyle\bigcirc{\smash{\raisebox{-1.3pt}{$% \diamond$}}}}$ query $(\pi,G)$ over a temporal database $\mathcal{D}$ is that of checking, given a tuple $(d_{1},\dots,d_{k})\in\Delta_{\mathcal{D}}^{k}$ , and $\ell\in\mathrm{tem}(\mathcal{D})$ , if $\langle(d_{1},\ldots,d_{k}),\ell\rangle$ is a certain answer to $(\pi,G)$ over $\mathcal{D}$ . We use the term “complexity of the query $(\pi,G)$ ” to refer to the data complexity of the associated answering problem, and say, e.g., that $(\pi,G)$ is complete for polynomial time (for P) or for nondeterministic logarithmic space (NL) if the answering problem for $(\pi,G)$ is such.

Our main concern is how the data complexity of the query answering problem is affected by the features of $\pi$ . The following theorem relies on similar results obtained for temporal deductive databases [13, 12] and temporal description logics [6, 7, 22].

Theorem 4.

Answering (monadic) $\textsl{datalog}^{\,\scriptscriptstyle\bigcirc{\smash{\raisebox{-1.3pt}{$% \diamond$}}}}$ queries is PSpace-complete for data complexity; answering linear (monadic) $\textsl{datalog}^{\,\scriptscriptstyle\bigcirc{\smash{\raisebox{-1.3pt}{$% \diamond$}}}}$ queries is NL-complete.

Proof (Sketch).

For full $\textsl{datalog}^{\,\scriptscriptstyle\bigcirc{\smash{\raisebox{-1.3pt}{$% \diamond$}}}}$ , PSpace-completeness can be shown by reusing the techniques for temporal deductive databases [13, 12]. However, we prove that a linear query can be answered in NL. Indeed, fix a linear query $(\pi,G)$ . Without loss of generality, we assume that temporalised IDB atoms in rule bodies of $\pi$ have the form ${\raisebox{1.07639pt}{\text{\scriptsize{$\bigcirc$}}}}^{k}C(\overline{Y})$ , where $|k|\leqslant 1$ (the cases of $\Diamond$ / $\raisebox{0.43057pt}{\text{\small{$\Diamond$}}}\!^{-}$ and consecutive $\bigcirc$ / $\raisebox{1.07639pt}{\text{\scriptsize{$\bigcirc$}}}^{-}$ can be expressed via recursion). Given a temporal database $\mathcal{D}$ , tuples of objects $\boldsymbol{c}$ and $\boldsymbol{d}$ from $\Delta_{\mathcal{D}}$ , and $\ell\in\mathbb{Z}$ , we write $C(\boldsymbol{c})\leftarrow_{\ell,k}D(\boldsymbol{d})$ if $\pi$ has a rule $C(\overline{X})\leftarrow\varphi(\overline{X},\overline{Y},\overline{U})\wedge% {\raisebox{1.07639pt}{\text{\scriptsize{$\bigcirc$}}}}^{k}D(\overline{Y})$ such that $\mathcal{D},\ell\models\exists\overline{U}\varphi(\boldsymbol{c},\boldsymbol{d% },\overline{U})$ . Analogously, we write $C(\boldsymbol{c})\leftarrow_{\ell}$ if there is an initialisation rule $C(\overline{X})\leftarrow\varphi(\overline{X},\overline{U})$ and $\mathcal{D},\ell\models\exists\overline{U}\varphi(\boldsymbol{c},\overline{U})$ . Then, given a linear $\textsl{datalog}^{\,\scriptscriptstyle\bigcirc{\smash{\raisebox{-1.3pt}{$% \diamond$}}}}$ query $(\pi,G)$ , we have $\mathcal{D},\pi,\ell\models G(\boldsymbol{d})$ if and only if $G=C_{0}$ , $\boldsymbol{d}=\boldsymbol{c}_{0}$ , and there exists a sequence

\displaystyle C_{0}(\boldsymbol{c}_{0})\leftarrow_{k_{0}}C_{1}(\boldsymbol{c}_% {1})\leftarrow_{k_{1}}\dots\leftarrow_{k_{n-1}}C_{n}(\boldsymbol{c}_{n})

(12)

such that $C_{i}\in\textit{IDB}(\pi)$ , tuples $\boldsymbol{c}_{i}$ are from $\Delta_{\mathcal{D}}$ , $\ell_{0}=\ell$ , $\ell_{i+1}=\ell_{i}+k_{i}$ for $0\leq i<n$ , $C_{i}(\boldsymbol{c}_{i})\leftarrow_{\ell_{i},k_{i}}C_{i+1}(\boldsymbol{c}_{i+% 1})$ , and $C_{n}(\boldsymbol{c}_{n})\leftarrow_{\ell_{n}}$ . Let $\mathrm{tem}(\mathcal{D})=[l,r]$ . Using property (10), we observe that a rule $C(\boldsymbol{c})\leftarrow_{\ell,k}D(\boldsymbol{d})$ either holds or does not hold simultaneously for all $\ell>r+N$ , where $N$ is the number of temporal operators in $\pi$ , and, similarly, for all $\ell<l-N$ . Now, consider a sequence (12) and $\ell$ , where all $\ell_{i}>r+N$ (the case for $\ell<l-N$ is analogous). Any loop of the form $C_{i}(\boldsymbol{c}_{i})\leftarrow_{k_{i}}\dots\leftarrow_{k_{j-1}}C_{j}(% \boldsymbol{c}_{j})$ in it with $C_{i}(\boldsymbol{c}_{i})=C_{j}(\boldsymbol{c}_{j})$ can be removed as long as in the resulting sequence

C_{0}(\boldsymbol{c}_{0})\leftarrow_{k_{0}}C_{1}(\boldsymbol{c}_{1})\leftarrow% _{k_{1}}\dots\leftarrow_{k_{i-1}}C_{i}(\boldsymbol{c}_{i})\leftarrow_{k_{j}}C_% {j+1}(\boldsymbol{c}_{j+1})\dots\leftarrow_{k_{n-1}}C_{n}(\boldsymbol{c}_{n}),

the sum of all $k_{t}$ remains $\geq 0$ . This allows us to convert any sequence (12) to a sequence with the same $C_{0}$ in the beginning, the same $C_{n}$ in the end, and where all $\ell_{i}$ do not exceed $\ell+O(|\textit{IDB}(\pi)|\cdot|\Delta_{\mathcal{D}}|^{a})$ , for $a$ equal to the maximal arity of a relation in $\textit{IDB}(\pi)$ . This means that, for $\ell\in\mathrm{tem}(\mathcal{D})$ , we can check $\mathcal{D},\pi,\ell\models G(\boldsymbol{d})$ using timestamps in the range $[l-O(|\pi|\cdot|\Delta_{\mathcal{D}}|^{a}),r+O(|\pi|\cdot|\Delta_{\mathcal{D}}% |^{a})]$ . Clearly, the existence of such a derivation can be checked in NL. $\hfill\blacktriangleleft$

However, individual queries may be easier to answer than in the general case. Since $\textsl{datalog}^{\,\scriptscriptstyle\bigcirc{\smash{\raisebox{-1.3pt}{$% \diamond$}}}}$ combines features of plain datalog and of linear temporal logic, its queries can correspond to a variety of complexity classes. Recall, for example, queries $(\pi_{1},\textit{Good})$ and $(\pi_{2},\textit{Satisfactory})$ from Section 1, the first of which is hard for logarithmic space (L-hard) and the second lies in $\textsc{AC}^{0}$ , the class of problems decidable by unbounded fan-in, polynomial size and constant depth boolean circuits. Furthermore, by using unary relations and operator $\bigcirc$ , one can simulate any regular language, giving rise to queries that lie in ${\textsc{ACC}^{0}}$ , the class obtained from $\textsc{AC}^{0}$ by allowing “MOD $m$ ” gates, or are complete for ${\textsc{NC}^{1}}$ , the class defined similarly for bounded fan-in polynomial circuits of logarithmic depth. Intuitively, the problems in $\textsc{AC}^{0}$ , $\textsc{ACC}^{0}$ , and $\textsc{NC}^{1}$ are solvable in short (constant or logarithmic) time on a parallel architecture; see [40] for more details.

In the remainder of the paper we focus on deciding the data complexity for the linear monadic fragment of $\textsl{datalog}^{\,\scriptscriptstyle\bigcirc{\smash{\raisebox{-1.3pt}{$% \diamond$}}}}$ , denoted $\textsl{datalog}_{\it lm}^{\,\scriptscriptstyle\bigcirc{\smash{\raisebox{-1.3% pt}{$\diamond$}}}}$ . It is well-known that a plain datalog query can be characterised via an infinite set of conjunctive queries called its expansions [34]. We define expansions for $\textsl{datalog}_{\it lm}^{\,\scriptscriptstyle\bigcirc{\smash{\raisebox{-1.3% pt}{$\diamond$}}}}$ and use them as the main tool in our (un)decidability proofs.

2.1.1 Expansions for Linear Monadic Queries

Let $\pi$ be a $\textsl{datalog}_{\it lm}^{\,\scriptscriptstyle\bigcirc{\smash{\raisebox{-1.3% pt}{$\diamond$}}}}$ program and $Q(X)$ be a unary temporal conjunctive query with a single answer variable and containing a unique IDB atom, say $D(Y)$ , from $\pi$ . Let $P(X)$ be another temporal conjunctive query with a single answer variable, and let $P^{\prime}(Y)$ be obtained from $P(X)$ by substituting $X$ by $Y$ and all other variables with fresh ones. A composition of $Q$ and $P$ , denoted $Q\circ P$ , has the form of $Q$ with $D(Y)$ substituted with $P^{\prime}(Y)$ . We note that the variables of $Q$ remain present in $Q\circ P$ and $X$ is an answer variable of $Q\circ P$ . If $P$ contains an IDB atom and $K(X)$ is another temporal conjunctive query, the composition can be extended in the same fashion to $(Q\circ P)\circ K$ , and so on. Note that, up to renaming of variables, $(Q\circ P)\circ K$ and $Q\circ(P\circ K)$ are the same queries, so we will omit the brackets and write $Q\circ P\circ K$ .

Expansions are compositions of rule bodies of a program $\pi$ . Let $B_{1},B_{2},\dots,B_{n-1}$ be such that $B_{i}$ is the body of the recursive rule:

C_{i}(X)\leftarrow A_{i}(X,Y,U_{1},\dots,U_{m_{i}})\ \wedge\ {\raisebox{1.0763% 9pt}{\text{\scriptsize{$\bigcirc$}}}}^{k_{i}}C_{i+1}(Y),

(13)

and $B_{n}$ is the body of an initialization rule

C_{n}(X)\leftarrow B(X,V_{1},\dots,V_{m_{n}}).

(14)

The composition $B_{1}\circ\dots\circ B_{n}$ is called an expansion of $(\pi,C_{1})$ , and $n$ is its length. The set of all expansions of $(\pi,C_{1})$ is denoted $\textit{expand}(\pi,C_{1})$ . Moreover, let $\Gamma^{r}_{\pi}$ be the set of all recursive rule bodies of $\pi$ and $\Gamma^{i}_{\pi}$ be the set of all initialization rule bodies. Then each expansion can be regarded (by omitting the symbol $\circ$ ) as a word in $(\Gamma^{r}_{\pi})^{*}\Gamma^{i}_{\pi}$ , and $\textit{expand}(\pi,C_{1})$ as a sublanguage of $(\Gamma^{r}_{\pi})^{*}\Gamma^{i}_{\pi}$ . Adopting a language-theoretic notation, we will use small Latin letters $w, u, v$ , etc. to denote expansions. To highlight the fact that each expansion is a temporal conjunctive query with the answer variable $X$ , we sometimes write $w(X)\in\textit{expand}(\pi,C)$ .

It is a direct generalization from the case of plain datalog [34] that $\mathcal{D},\pi,\ell\models C(d)$ if and only if there exists $w(X)\in\textit{expand}(\pi,C)$ such that $D_{\ell}\models w(d)$ .

3 Undecidability for Queries with $\Diamond$

Our first result about deciding the complexity of a given query is negative.

Theorem 5.

It is undecidable whether a given $\textsl{datalog}_{\it lm}^{\,\smash{\raisebox{-1.0pt}{$\diamond$}}}$ -query can be answered in $\textsc{AC}^{0}$ , $\textsc{ACC}^{0}$ , or $\textsc{NC}^{1}$ (if ${\textsc{NC}^{1}}\neq\textsc{NL}$ ). It is undecidable whether the query is L-hard (if $\textsc{L}\neq\textsc{NL}$ ).

The proof is by a reduction from the halting problem of 2-counter machines [33]. Namely, given a 2-counter machine $M$ we construct a query $(\pi_{M},G)$ that is in $\textsc{AC}^{0}$ if $M$ halts and NL-complete otherwise.

Recall, that a 2-counter machine is defined by a finite set of states $S=\{s_{0},\dots,s_{n}\}$ , with a distinguished initial state $s_{0}$ , two counters able to store non-negative integers, and a transition function $\Theta$ . On each step the machine performs a transition by changing its state and incrementing or decrementing each counter by 1 with a restriction that their values remain non-negative. The next transition is chosen according to the current state and the values of the counters. However, the machine is only allowed to perform zero-tests on counters and does not distinguish between two different positive values. Formally, transitions are given by a partial function $\Theta$ :

S\times\{0,+\}\times\{0,+\}\rightarrow S\times\{-1,0,1\}\times\{-1,0,1\}.

(15)

Let $\mathrm{sgn}(0)=0$ and $\mathrm{sgn}(k)=+$ for all $k>0$ . A computation of $M$ is a sequence of configurations:

(s_{0},a_{0},b_{0}),(s_{1},a_{1},b_{1}),(s_{2},a_{2},b_{2})\dots(s_{m},a_{m},b% _{m}),

(16)

such that for each $i,0\leqslant i<m$ , holds $\Theta(s_{i},\mathrm{sgn}(a_{i}),\mathrm{sgn}(b_{i}))=(s_{i+1},\varepsilon_{1}% ,\varepsilon_{2})$ and $a_{i+1}=a_{i}+\varepsilon_{1}$ , $b_{i+1}=b_{i}+\varepsilon_{2}$ . We assume that $a_{0},b_{0}=0$ and $\Theta$ is such that $a_{i},b_{i}\geqslant 0$ for all $i$ . We call $m$ the length of the computation. We say that $M$ halts if $\Theta(s_{m},\mathrm{sgn}(a_{m}),\mathrm{sgn}(b_{m}))$ is not defined in a computation. Thus, $M$ either halts after $m$ steps or goes through infinitely many configurations.

Let $M$ be a 2-counter machine. We construct a connected linear query $(\pi_{M},G)$ using the operator $\Diamond$ only, such that its evaluation is in $\textsc{AC}^{0}$ if $M$ halts, but becomes NL-complete otherwise. The construction with the operator $\raisebox{0.43057pt}{\text{\small{$\Diamond$}}}\!^{-}$ instead of $\Diamond$ is symmetric.

We set the EDB schema $\Sigma=\{T,U_{1},U_{2}\}$ of three relations which stand for “transition”, “first counter update”, and “second counter update”, respectively. Intuitively, domain elements of a temporal database represent configurations of $M$ , and role triples of $T$ , $U_{1}$ and $U_{2}$ , arranged according to certain rules described below, will play the role of transitions. A sequence of nodes connected by such triples will thus represent a computation of $M$ . Our program $\pi_{M}$ will generate an expansion along such a sequence, trying to assign to each configuration an IDB that represents a state of $M$ , which will be possible while the placement of the connecting roles on the temporal line follows the rules of $\Theta$ . If the machine halts, there is a maximum number of steps we can make, so the check can be done in $\textsc{AC}^{0}$ . If $M$ does not halt, however, it has arbitrarily long computations, and the query evaluation becomes NL-complete.

Here are the details. A configuration $(s,a,b)$ is represented by an object $d$ and three timestamps $\ell_{0},\ell_{1},\ell_{2}$ such that $\ell_{1}=\ell_{0}+a$ and $\ell_{2}=\ell_{0}+b$ . The values of the counters $a$ and $b$ are indicated, respectively, by existence of connections $U_{1}^{\ell_{1}}(d,d^{\prime})$ and $U_{2}^{\ell_{2}}(d,d^{\prime})$ to some object $d^{\prime}$ that is supposed to represent the next configuration in the computation. For the transition to happen, we also require $T^{\ell_{0}}(d,d^{\prime})$ . Given a computation of the form (16), the corresponding computation path is a pair $({\langle}d_{0},\dots,d_{n}{\rangle},\ell_{0})$ , where for each $i,0\leqslant i<n$ , there are $T^{\ell_{0}}(d_{i},d_{i+1})$ , $U_{1}^{\ell_{0}+a_{i}}(d_{i},d_{i+1})$ , and $U_{2}^{\ell_{0}+b_{i}}(d_{i},d_{i+1})$ in the database, with an additional requirement that $a_{0}=b_{0}=0$ . Two types of problems may be encountered on such a path. A representation violation occurs when an object has more than one outgoing edge of type $T$ , $U_{1}$ , or $U_{2}$ . A transition violation of type $(s_{i},\alpha,\beta)$ is detected when there are two consecutive nodes $d_{i},d_{i+1}$ , $\alpha=\mathrm{sgn}(a_{i}),\beta=\mathrm{sgn}(b_{i})$ , and $\Theta(s_{i},\alpha,\beta)\neq(s_{i+1},a_{i+1}-a_{i},b_{i+1}-b_{i})$ . The program $\pi_{M}$ will look for such violations. It will have an IDB relation symbol $S_{i}$ per each state $s_{i}$ of $M$ . The initialisation rules allow to infer any state IDB from a representation violation, and the IDB $S_{i}$ from a transition violation of type $(s_{i},\alpha,\beta)$ . The recursive rules then push the state IDB up a computation path following the rules of $\Theta$ and tracing “backwards” a computation of $M$ . Once $\pi_{M}$ infers the initial state IDB at a position with zero counter values, we are done. It remains to give the rules explicitly.

We use a shortcut $\raisebox{0.43057pt}{\text{\small{$\Diamond$}}}^{*}$ to mean a “reflexive” version of $\Diamond$ , i.e. $\raisebox{0.43057pt}{\text{\small{$\Diamond$}}}^{*}\varphi\equiv\raisebox{0.43% 057pt}{\text{\small{$\Diamond$}}}\varphi\vee\varphi$ . Clearly, every rule $\raisebox{0.43057pt}{\text{\small{$\Diamond$}}}^{*}$ can be rewritten to an equivalent set of rules without it. We need the rules:

	$\displaystyle S_{i}(X)\leftarrow\raisebox{0.43057pt}{\text{\small{$\Diamond$}}% }^{*}RV(X),\quad 0\leqslant i\leqslant n\$	$\displaystyle RV(X)\leftarrow T(X,Y)\wedge\raisebox{0.43057pt}{\text{\small{$% \Diamond$}}}T(X,Z)$		(17)
	$\displaystyle RV(X)\leftarrow U_{1}(X,Y)\wedge\raisebox{0.43057pt}{\text{% \small{$\Diamond$}}}U_{1}(X,Z),\$	$\displaystyle RV(X)\leftarrow U_{2}(X,Y)\wedge\raisebox{0.43057pt}{\text{% \small{$\Diamond$}}}U_{2}(X,Z)$		(18)

to detect representation violations. For transition violations, we first define IDBs $NE_{c}^{\varepsilon}$ , where $c\in{1,2}$ stand for the respective counter and $\epsilon\in\{-1,0,1\}$ stands for the change of that counter value in a transition. Each $NE_{c}^{\varepsilon}$ detects situations when a correct transition was not executed, e.g. having $\epsilon=-1$ , the timestamps $\ell$ and $\ell^{\prime}$ that are marked by an outgoing $U_{c}$ in consecutive configurations satisfy $\ell-1>\ell^{\prime}$ , $\ell=\ell^{\prime}$ , or $\ell<\ell^{\prime}$ , which they should not. The rules are the following:

$\displaystyle NE_{c}^{-1}(Y)\leftarrow U_{c}(Y,Z)\wedge U_{c}(X,Y)$	$\displaystyle NE_{c}^{-1}(Y)\leftarrow U_{c}(Y,Z)\wedge\raisebox{0.43057pt}{% \text{\small{$\Diamond$}}}\raisebox{0.43057pt}{\text{\small{$\Diamond$}}}U_{c}% (X,Y)$	(19)
$\displaystyle NE_{c}^{-1}(Y)\leftarrow U_{c}(X,Y)\wedge\raisebox{0.43057pt}{% \text{\small{$\Diamond$}}}U_{c}(Y,Z)$	$\displaystyle NE_{c}^{1}(Y)\leftarrow U_{c}(Y,Z)\wedge U_{c}(X,Y)$	(20)
$\displaystyle NE_{c}^{1}(Y)\leftarrow U_{c}(Y,Z)\wedge\raisebox{0.43057pt}{% \text{\small{$\Diamond$}}}U_{c}(X,Y)$	$\displaystyle NE_{c}^{1}(Y)\leftarrow U_{c}(X,Y)\wedge\raisebox{0.43057pt}{% \text{\small{$\Diamond$}}}\raisebox{0.43057pt}{\text{\small{$\Diamond$}}}U_{c}% (Y,Z)$	(21)
$\displaystyle NE_{c}^{0}(Y)\leftarrow U_{c}(Y,Z)\wedge\raisebox{0.43057pt}{% \text{\small{$\Diamond$}}}U_{c}(X,Y)$	$\displaystyle NE_{c}^{0}(Y)\leftarrow U_{c}(X,Y)\wedge\raisebox{0.43057pt}{% \text{\small{$\Diamond$}}}U_{c}(Y,Z)$	(22)

for $c\in\{1,2\}$ . Then, again, any state can be inferred from a violation:

\displaystyle S_{i}(X)\leftarrow\raisebox{0.43057pt}{\text{\small{$\Diamond$}}% }^{*}NE_{1}^{\varepsilon_{1}}(Y)\wedge\raisebox{0.43057pt}{\text{\small{$% \Diamond$}}}^{\alpha}U_{1}(X,Y)

\displaystyle S_{i}(X)\leftarrow\raisebox{0.43057pt}{\text{\small{$\Diamond$}}% }^{*}NE_{2}^{\varepsilon_{2}}(Y)\wedge\raisebox{0.43057pt}{\text{\small{$% \Diamond$}}}^{\,\beta}U_{2}(X,Y)

(23)

for each transition $\Theta(s_{i},\alpha,\beta)=(s_{j},\varepsilon_{1},\varepsilon_{2})$ , and for all $\varepsilon_{1},\varepsilon_{2}$ when $\Theta(s_{i},\alpha,\beta)$ is not defined. Finally, we require

		$\displaystyle S_{i}(X)\leftarrow T(X,Y)\wedge\raisebox{0.43057pt}{\text{\small% {$\Diamond$}}}^{\alpha}U_{1}(X,Y)\land\raisebox{0.43057pt}{\text{\small{$% \Diamond$}}}^{\,\beta}U_{2}(X,Y)\wedge S_{j}(Y),$		(24)
		$\displaystyle G(X)\leftarrow S_{0}(X)\wedge U_{1}(X,Y)\wedge U_{2}(X,Y).$		(25)

This finalises the construction of $(\pi_{M},G)$ . Any expansion of $(\pi_{M},G)$ starts with the body of the rule (25) and then continues by a sequence of bodies of the rules of the form (24) and ends with a detection either of a representation violation, defined by (17) – (18), or of a transition violation, defined by (23). If $M$ halts in $m$ steps, it is enough to consider expansions containing no more than $m$ bodies of (24). Thus, in this case $(\pi_{M},G)$ is in $\textsc{AC}^{0}$ . If, on the contrary, $M$ does not halt, we can use expansions representing arbitrarily long prefixes of its computation for a reduction from directed reachability problem, rendering the query NL-hard.

Lemma 6.

If $M$ halts, $(\pi_{M},G)$ is in $\textsc{AC}^{0}$ . Otherwise, it is NL-hard.

4 Automata-Theoretic Tools for Queries with $\bigcirc$

From now on, we focus on queries that use operators $\bigcirc$ and $\raisebox{1.07639pt}{\text{\scriptsize{$\bigcirc$}}}^{-}$ only. In this section, we develop a generalisation of the automata-theoretic approach to analysing query expansions proposed in [15]. In Section 5 we use this approach to study the data complexity of this kind of queries.

Recall from Section 2.1.1 the definitions of composition and expansion of rule bodies, alphabets $\Gamma_{\pi}^{r}$ and $\Gamma_{\pi}^{i}$ , and the language $\textit{expand}(\pi,G)\subseteq(\Gamma_{\pi}^{r})^{*}(\Gamma_{\pi}^{i})$ . We observe that for any sequence of rule bodies $B_{1},\dots,B_{n-1}\in\Gamma_{\pi}^{r}$ and $B_{n}\in\Gamma_{\pi}^{i}$ the compositions $B_{1}\circ\dots\circ B_{n-1}$ and $B_{1}\circ\dots\circ B_{n}$ are well-defined. They are words in the languages $(\Gamma_{\pi}^{r})^{*}$ and $(\Gamma_{\pi}^{r})^{*}(\Gamma_{\pi}^{i})$ , respectively. Note that $B_{1}\circ\dots\circ B_{n}$ is a temporal CQ over schema $\textit{EDB}(\pi)$ , while $B_{1}\circ\dots\circ B_{n-1}$ is that over $\textit{EDB}(\pi)\cup\textit{IDB}(\pi)$ , since it contains the IDB atom of $B_{n-1}$ . For $w\in(\Gamma_{\pi}^{r})^{*}(\Gamma_{\pi}^{i})$ , $\mathcal{D}_{w}$ is defined as the (temporal) database corresponding to (temporal) CQ $w$ , while for $w\in(\Gamma_{\pi}^{r})^{*}$ we define $\mathcal{D}_{w}$ as the database corresponding to the CQ obtained from $w$ by omitting the IDB atom. For either $w$ , $\mathcal{D}_{w}$ is over the schema $\textit{EDB}(\pi)$ . Having that, we define the language $\textit{accept}(\pi,G)\subseteq(\Gamma_{\pi}^{r})^{*}\cup(\Gamma_{\pi}^{r})^{*% }(\Gamma_{\pi}^{i})$ of all words $w\in(\Gamma_{\pi}^{r})^{*}\cup(\Gamma_{\pi}^{r})^{*}(\Gamma_{\pi}^{i})$ such that $\mathcal{D}_{w},\pi,0\models G(X)$ .

Plain datalog queries are either in $\textsc{AC}^{0}$ (called bounded), or L-hard (unbounded), and a criterion of unboundedness can be formulated in language-theoretic terms [15]: a connected linear monadic plain datalog query $(\pi,G)$ is unbounded if and only if for every $k$ there is $w\in\textit{expand}(\pi,G)$ , $|w|>k$ , such that its prefix of length $k$ is not in $\textit{accept}(\pi,G)$ .

Example 7.

The query $(\pi,G)$ , where $\pi$ is given by the following plain datalog rules, is unbounded.

	$\displaystyle G(X)\leftarrow R(X,Y)\wedge S(Y,X)\wedge G(Y)$		(26)
	$\displaystyle G(X)\leftarrow A(X)$		(27)

However, if we substitute the rule (27) with another initialisation rule

\displaystyle G(X)\leftarrow S(X,Y)\wedge S(Y,Z)

(28)

it becomes bounded because for every $w\in\textit{expand}(\pi,G)$ , $|w|>2$ its prefix of length $2$ is in $\textit{accept}(\pi,G)$ . To see this, consider the expansions of the query, i.e., $\textit{expand}(\{\eqref{rule:pure-datalog-recursive},\eqref{rule:pure-datalog% -initial-bounded}\},G)$ given in Figure 2.

The authors of [15] construct finite state automata for languages $\textit{expand}(\pi,G)$ and $\textit{notaccept}(\pi,G)$ , the complement of $\textit{accept}(\pi,G)$ , and use them to check their criterion in polynomial space. Our goal is to generalise this technique to the temporalised case. However, we can not use finite automata to work directly with sequences of rule bodies, since the language $\textit{accept}(\pi,G)$ , in the presence of time, may be non-regular, as demonstrated by the following example.

Example 8.

Consider a program

	$\displaystyle G(X)\leftarrow R(X,Y)\wedge G(Y)$	$\displaystyle G(X)\leftarrow A(X)\wedge{\raisebox{1.07639pt}{\text{\scriptsize% {$\bigcirc$}}}}G(X)$
	$\displaystyle G(X)\leftarrow P(X)\wedge{\raisebox{1.07639pt}{\text{\scriptsize% {$\bigcirc$}}}^{-}}G(X)$	$\displaystyle G(X)\leftarrow P(X)\wedge R(Y,X).$

Denote its rule bodies $B_{1}(X,Y)=R(X,Y)\wedge G(Y)$ , $B_{2}(X)=A(X)\wedge{\raisebox{1.07639pt}{\text{\scriptsize{$\bigcirc$}}}}G(X)$ , and $B_{3}(X)=P(X)\wedge{\raisebox{1.07639pt}{\text{\scriptsize{$\bigcirc$}}}^{-}}G% (X)$ , and the composition $w=B_{1}B_{2}B_{2}B_{3}B_{3}B_{3}$ . The composition $w$ is in $\textit{accept}(\pi,G)$ , and the corresponding database $\mathcal{D}_{w}$ is given in Figure 3(a). In general, a composition of the form $B_{1}B_{2}^{n}B_{3}^{k}$ is in $\textit{accept}(\pi,G)$ if and only if $n<k$ , which, by a simple application of the pumping lemma, is a non-regular language.

To overcome this, we introduce a larger alphabet and define more general versions of the languages $\textit{expand}(\pi,G)$ and $\textit{accept}(\pi,G)$ to regain their regularity. Recall that every composition $w$ of rule bodies gives rise to a temporal database $\mathcal{D}_{w}$ . Instead of working with $w$ , we use an exponentially larger alphabet $\Omega=2^{\Gamma_{\pi}^{r}\cup\Gamma_{\pi}^{i}\cup\{\bot,\top\}}$ to describe $\mathcal{D}_{w}$ as a word and define analogues of $\textit{expand}(\pi,G)$ and $\textit{accept}(\pi,G)$ over that alphabet. Consider a recursive rule

D(X)\leftarrow{\raisebox{1.07639pt}{\text{\scriptsize{$\bigcirc$}}}}^{k_{1}}R_% {1}(\overline{U}_{1})\wedge\dots\wedge{\raisebox{1.07639pt}{\text{\scriptsize{% $\bigcirc$}}}}^{k_{s}}R_{s}(\overline{U}_{s})\wedge{\raisebox{1.07639pt}{\text% {\scriptsize{$\bigcirc$}}}}^{k}E(Y),

(29)

where $E(Y)$ is the unique IDB atom in the rule body and $k_{i},k\in\mathbb{Z}$ . We call such a rule horizontal if $X=Y$ and vertical otherwise. In a composition $w\in(\Gamma^{r}_{\pi})^{*}\cup(\Gamma^{r}_{\pi})^{*}\Gamma_{\pi}^{i}$ , a vertical (horizontal) segment is a maximal subword that consists of vertical (respectively, horizontal) rule bodies. For every composition $w$ we define the description $[w]\in\Omega^{*}$ of the respective database $\mathcal{D}_{w}$ as follows. Let $w=x_{1}y_{1}x_{2}y_{2}\dots x_{n}y_{n}$ , where $x_{i}$ are vertical segments and $y_{i}$ are horizontal segments, with $x_{1}$ and $y_{n}$ possibly empty. For each $x_{i}=B_{1}\dots B_{n}$ we set $[x_{i}]$ to be the sequence $\{B_{1}\}\dots\{B_{n}\}$ of singleton sets, each of which contains a vertical rule body. For each $y_{i}=B_{1}\dots B_{n}$ , we construct $[y_{i}]$ in $n$ steps, as follows. Recall that $B_{j}$ , $1\leqslant j<n$ , has the form $A_{j}(X,\mathbf{U})\wedge{\raisebox{1.07639pt}{\text{\scriptsize{$\bigcirc$}}}% }^{m_{j}}D(X)$ , where $D(X)$ is the unique IDB atom in $B_{j}$ . Let $\ell_{1}=0$ and $\ell_{j}=\sum_{i=1}^{j-1}m_{i}$ for $j\leq n$ . Intuitively, $\ell_{j}$ is the moment of time where the body $B_{j}$ lands in the composition $B_{1}\circ\dots\circ B_{n}$ . Let $\ell^{\prime}_{1},\dots,\ell^{\prime}_{s}$ be the ordering of the the numbers in the set $\{\ell_{j}\}_{j=1}^{n}$ in the increasing order. We set, $\alpha_{\ell^{\prime}_{k}}$ to be $\{B_{j}\mid\ell_{j}=\ell^{\prime}_{k}\}$ for $\ell^{\prime}_{k}\in\{\ell_{1},\ell_{n}\}$ ; $\{B_{j}\mid\ell_{j}=\ell_{1}\}\cup\{\bot\}$ for $\ell^{\prime}_{k}=\ell_{1}\neq\ell_{n}$ ; $\{B_{j}\mid\ell_{j}=\ell_{n}\}\cup\{\top\}$ for $\ell^{\prime}_{k}=\ell_{n}\neq\ell_{1}$ ; and $\{B_{j}\mid\ell_{j}=\ell_{1}\}\cup\{\top,\bot\}$ for $\ell^{\prime}_{k}=\ell_{1}=\ell_{n}$ . Additionally, we set $\alpha_{k}=\emptyset$ for $k\in[\ell^{\prime}_{1},\ell^{\prime}_{s}]\setminus\{\ell^{\prime}_{1},\dots,% \ell^{\prime}_{s}\}$ . Now we take $[y_{i}]=\alpha$ .

Finally, $[w]=[x_{1}][y_{1}]\dots[x_{n}][y_{n}]$ . We use the symbol $\Lambda$ to refer to letters of $[w]$ . We call letters of the form $\{B\}$ , where $B$ is a vertical rule body, vertical letters, and the rest – horizontal letters. Consequently, we can speak of vertical and horizontal segments of $[w]$ , meaning maximal segments composed of vertical (respectively, horizontal) letters only.

Intuitively, $[w]$ describes $\mathcal{D}_{w}$ , which can be seen as composed of $\mathcal{D}_{x_{1}},\mathcal{D}_{y_{1}},\dots,\mathcal{D}_{x_{n}},\mathcal{D}_% {y_{n}}$ , described by $[x_{1}],[y_{1}],\dots,[x_{n}],[y_{n}]$ , respectively. The symbol $\bot$ , representing a vertical line meeting a horizontal one, marks the point in time where $\mathcal{D}_{x_{i}}$ is connected to $\mathcal{D}_{y_{i}}$ , while the symbol $\top$ , analogously, shows where $\mathcal{D}_{y_{i}}$ is connected to $\mathcal{D}_{x_{i+1}}$ .

Example 9.

Recall the rule bodies of Example 8 and consider the composition $w=B_{1}^{2}B_{2}^{2}B_{3}^{5}B_{2}B_{1}$ . The corresponding $\mathcal{D}_{w}$ is depicted in Figure 3(b). Then $x_{1}=B_{1}^{2}$ , $y_{1}=B_{2}^{2}B_{3}^{5}B_{2}$ , $x_{2}=B_{1}$ and $y_{2}$ is empty. Thus, $[w]$ is equal to

\{B_{1}\}\{B_{1}\}\{B_{2}\}\{\top,B_{3}\}\{B_{3}\}\{\bot,B_{2},B_{3}\}\{B_{2},% B_{3}\}\{B_{3}\}\{B_{1}\}

(30)

Not every word over $\Omega$ correctly describes a temporal database. This motivates the following definition: $\alpha\in\Omega^{*}$ is correct if (i) every symbol $\Lambda$ is either a singleton (standing for a vertical rule body) or a set of horizontal rule bodies, with a possible addition of $\bot,\top$ , and (ii) every horizontal segment of $\alpha$ preceded by a vertical segment contains exactly one $\bot$ , and every horizontal segment followed by a vertical one – exactly one $\top$ . A correct word $\alpha\in\Omega^{*}$ describes a temporal database $\mathcal{D}_{\alpha}$ similarly to how $[w]$ describes $\mathcal{D}_{w}$ . Formally, break $\alpha$ into vertical/horizontal segments $\chi_{1}\upsilon_{1}\dots\chi_{n}\upsilon_{n}$ . For a vertical segment $\chi_{i}=\{B_{1}\}\dots\{B_{n}\}$ , set $Q_{\chi_{i}}=B_{1}\circ\dots\circ B_{n}$ . For a horizontal segment $\upsilon_{i}=\Lambda_{1}\dots\Lambda_{s}$ , let $\Lambda_{j_{\bot}}$ be the one containing $\bot$ and $\Lambda_{j_{\top}}$ – containing $\top$ . Then $Q_{\upsilon_{i}}$ is the conjunction of rule bodies from $\upsilon_{i}$ , where each $B\in\Lambda_{j}$ is prefixed by ${\raisebox{1.07639pt}{\text{\scriptsize{$\bigcirc$}}}}^{j-j_{\bot}}$ , plus, if $i\neq n$ , an IDB atom ${\raisebox{1.07639pt}{\text{\scriptsize{$\bigcirc$}}}}^{j_{\top}-j_{\bot}}D(X)$ . Finally, set $\mathcal{D}_{\alpha}=\mathcal{D}_{Q_{\alpha}}$ for $Q_{\alpha}=Q_{\chi_{1}}\circ Q_{\upsilon_{1}}\circ\dots\circ Q_{\chi_{n}}\circ Q% _{\upsilon_{n}}$ . This $Q_{\alpha}$ will be also useful further.

We are now ready to define languages over $\Omega$ that will be useful to study the data complexity of our queries. Let ${\textit{Accept}}(\pi,G)$ be the language of correct words $\alpha$ such that $\mathcal{D}_{\alpha},\pi,0\models G(X)$ , with $X\in\Delta_{\mathcal{D}_{\alpha}}$ , and $\textit{NotAccept}(\pi,G)$ be its complement. We need to define the language of expansions over the alphabet $\Omega$ that we will use together with the language $\textit{NotAccept}(\pi,G)$ to formulate a criterion for L-hardness similar to the one of [15]. It would be natural to take the language of expansions as $\{[w]\mid w\in\textit{expand}(\pi,G)\}$ . However, it is harder to define an automaton recognising such a language than it is for the language of $[w]$ s as above where each horizontal letter $[w]_{i}$ may be extended (as a set) with arbitrary “redundant” rule bodies, and each horizontal segment may be extended, from the left and right, by “redundant” horizontal letters. For example, we will include into the language of expansions the word $\{B_{1}\}\{B_{1}\}\{B_{3}\}\{B_{2},B_{3}\}\{\top,B_{3}\}\{B_{2},B_{3}\}\{\bot,% B_{2},B_{3}\}\{B_{2},B_{3}\}\{B_{3}\}\{B_{1}\}$ alongside (30). It turns out that the latter language works for our required criteria as good as the former one. Formally, if $\alpha,\beta\in\Omega^{*}$ , we write $\alpha\preccurlyeq\beta$ if $\alpha=x_{1}y_{1}\dots x_{n}y_{n}$ and $\beta=x_{1}y^{\prime}_{1}\dots x_{n}y^{\prime}_{n}$ , where $x_{i}$ are vertical segments and $y_{i},y^{\prime}_{i}$ are horizontal segments, and if $y_{i}=a_{1}\dots a_{m}$ , $y^{\prime}_{i}=b_{1}\dots b_{s}$ , then there is $k\geqslant 0$ such that $m+k\leqslant s$ and $a_{j}\subseteq b_{j+k}$ , $1\leqslant j\leqslant m$ . We define $\textit{Expand}_{\preccurlyeq}(\pi,G)$ to be the set of correct words $\alpha\in\Omega^{*}$ such that $[w]\preccurlyeq\alpha$ for some $w\in\textit{expand}(\pi,G)$ .

It is important for our purposes that these languages are regular. For $\textit{Expand}_{\preccurlyeq}(\pi,G)$ , the rules of the program $\pi$ may be naturally seen as transition rules of a two-way automaton whose states are the IDBs of $\pi$ (plus a final state). The initial state is the one associated with $G$ , and the final state is reached by an application of an initialisation rule.

Lemma 10.

For any $\textsl{datalog}_{\it lm}^{\,\smash{\raisebox{0.0pt}{$\scriptscriptstyle% \bigcirc$}}}$ -query $(\pi,G)$ , the language $\textit{Expand}_{\preccurlyeq}(\pi,G)$ is regular.

The case of $\textit{NotAccept}(\pi,G)$ is more involved. Generalising from [15], for an automaton to recognise if $\alpha\in\textit{NotAccept}(\pi,G)$ it suffices to guess (by nondeterminism) an enrichment $\mathcal{E}$ of $\mathcal{D}_{\alpha}$ , and check that $\mathcal{E}$ is a model of $\pi$ and that $\mathcal{E}$ does not contain the atom $G(X)$ . Moreover, since $\pi$ is connected, for $\mathcal{E}$ to be a model it is enough that every piece of $E$ of radius $|\pi|$ , in terms of Gaifman graph, satisfies all the rules. The idea is to precompute the answer for each such piece and encode them in the state-space of the automaton. The problem, specific to the temporalised case, is that enrichments are infinite in the temporal dimension. To resolve this, we observe that there are still finitely many EDB atoms in $\mathcal{E}$ . Since, once again, $\pi$ is connected, to check that rules with EDBs are satisfied it is enough to consider only those pieces of $\mathcal{E}$ that contain an EDB atom. For the rest of the rules, it suffices to check if such a finite piece can be extended into an infinitely in time to give a model of $\pi$ . The rules without EDBs can be seen as plain LTL rules, so we can employ satisfiability checking for LTL to perform this check.

Lemma 11.

For any $\textsl{datalog}_{\it lm}^{\,\smash{\raisebox{0.0pt}{$\scriptscriptstyle% \bigcirc$}}}$ -query $(\pi,G)$ , the language $\textit{NotAccept}(\pi,G)$ is regular.

Corollary 12.

For any $\textsl{datalog}_{\it lm}^{\,\smash{\raisebox{0.0pt}{$\scriptscriptstyle% \bigcirc$}}}$ -query $(\pi,G)$ , the language ${\textit{Accept}}(\pi,G)$ is regular.

5 Decidability for Connected Linear Queries with $\bigcirc$

We use the automata introduced in the previous section to prove a positive result.

Theorem 13.

(i) Every connected $\textsl{datalog}_{\it lm}^{\,\smash{\raisebox{0.0pt}{$\scriptscriptstyle% \bigcirc$}}}$ query is either in $\textsc{AC}^{0}$ , or in ${\textsc{ACC}^{0}}\setminus{\textsc{AC}^{0}}$ , or $\textsc{NC}^{1}$ -complete, or L-hard. (ii) It is PSpace-complete to check whether such a query is L-hard; whether it belongs to $\textsc{AC}^{0}$ , ${\textsc{ACC}^{0}}$ , or is $\textsc{NC}^{1}$ -complete can be decided in ExpSpace.

We first deal with (i). Intuitively, L-hardness is a consequence of the growth of query expansions in the relational domain. If this growth is limited, the query essentially defines a certain temporal property, which can be checked in $\textsc{NC}^{1}$ . Formally, given a word $\alpha\in\Omega^{*}$ , we define the $0pt{\alpha}$ as the number of vertical letters in $\alpha$ . Then, we call a query vertically unbounded if for every $k$ there is a word $\alpha\in\textit{Accept}(\pi,G)$ , $0pt{\alpha}>k$ , such that every prefix of $\alpha$ of height $k$ is in $\textit{NotAccept}(\pi,G)$ . Otherwise, the query is called vertically bounded.

Vertically unbounded queries can be shown to be L-hard by a direct reduction from the undirected reachability problem. Namely, take the deterministic automata for $\textit{NotAccept}(\pi,G)$ and ${\textit{Accept}}(\pi,G)$ , supplied by Lemma 11 and Corollary 12, and apply the pumping lemma to obtain words $\xi,\upsilon,\zeta,\gamma$ such that $0pt{\upsilon}>0$ , $\xi\upsilon^{i}\zeta\in\textit{NotAccept}(\pi,G)$ and $\xi\upsilon^{i}\zeta\gamma\in{\textit{Accept}}(\pi,G)$ , for all $i\geqslant 0$ . Then, given a graph $\mathcal{G}$ and two nodes $s, t$ , use copies of $\mathcal{D}_{\upsilon}$ to simulate the edges of $\mathcal{G}$ , and attach $\mathcal{D}_{\xi}$ to $s$ and $\mathcal{D}_{\zeta\gamma}$ to $t$ . Thus, you will obtain a temporal database $\mathcal{D}_{\mathcal{G}}$ , where $\mathcal{D}_{\mathcal{G}},\pi,0\models G(s)$ if and only if there is a path from $s$ to $t$ .

Lemma 14.

If a query is vertically unbounded, then it is L-hard.

Observe that $\mathcal{D},\pi,\ell\models G(d)$ whenever $\mathcal{D},\ell\models Q_{\alpha}(d)$ for some $\alpha\in\textit{Accept}(\pi,G)$ . If $\mathcal{D},\ell\models Q_{\alpha}(d)$ , then $x_{1},y_{1},\dots,y_{n},x_{n}$ , the vertical and horizontal segments of $\alpha$ , appear in $\mathcal{D}$ , starting from $d$ at time $\ell$ . Expand $y_{i}$ to $y^{\prime}_{i}$ , the $\preccurlyeq$ -maximal horizontal segment fitting in $\mathcal{D}$ , and let $\beta=x_{1}y^{\prime}_{1}\dots x_{n}y^{\prime}_{n}$ . Then $\beta\in\textit{Accept}(\pi,G)$ and $\mathcal{D},\ell\models Q_{\beta}(d)$ . Thus, to check $\mathcal{D},\pi,\ell\models G(d)$ , it suffices to find vertical segments of some $\alpha\in\textit{Accept}(\pi,G)$ , taking $\preccurlyeq$ -maximal horizontal segments, and check that $\beta\in\textit{Accept}(\pi,G)$ . If $(\pi,G)$ is vertically bounded, there are finitely many vertical segments of interest. Finding vertical segments, as well as extracting $\preccurlyeq$ -maximal horizontal ones, can be done by an $\textsc{AC}^{0}$ circuit.

Lemma 15.

If $(\pi,G)$ is vertically bounded, then the data complexity of $(\pi,G)$ coincides with that of checking membership in ${\textit{Accept}}(\pi,G)$ , modulo reductions computable in $\textsc{AC}^{0}$ .

Recall that ${\textit{Accept}}(\pi,G)$ is a regular language, so checking membership in it is either in $\textsc{AC}^{0}$ , or ${\textsc{ACC}^{0}}\setminus{\textsc{AC}^{0}}$ , or $\textsc{NC}^{1}$ -complete [40]. This settles the part (i) of Theorem 13: a query is either vertically unbounded and thus L-hard, or vertically bounded and thus belongs to one of the three classes mentioned above.

It remains to address the part (ii) of Theorem 13. By a careful analysis of the proof of Lemma 11 one can show that $\textit{NotAccept}(\pi,G)$ is recognised by a nondeterministic automaton of size $2^{{\mathrm{poly}\left(|\pi|\right)}}$ . Further, checking whether its language belongs to ${\textsc{AC}^{0}}$ , ${\textsc{ACC}^{0}}\setminus{\textsc{AC}^{0}}$ , or is $\textsc{NC}^{1}$ -complete, can be done via the polynomial-space procedure developed in [31]. Thus, given a vertically bounded query, its data complexity can be established in exponential space.

For the vertical boundedness itself, note that substituting $\textit{Accept}(\pi,G)$ with $\textit{Expand}_{\preccurlyeq}(\pi,G)$ in the respective definition preserves all the results proven so far. For $\textit{Expand}_{\preccurlyeq}(\pi,G)$ , we can get from Lemma 10 an exponential-size one-way automaton. Checking that the query is vertically unbounded can be done in nondeterministic space logarithmic in the size of the automata for $\textit{Expand}_{\preccurlyeq}(\pi,G)$ and $\textit{NotAccept}(\pi,G)$ , as it boils down to checking reachability in their Cartesian product.

Lemma 16.

Checking if a connected $\textsl{datalog}_{\it lm}^{\,\smash{\raisebox{0.0pt}{$\scriptscriptstyle% \bigcirc$}}}$ query is vertically bounded is in PSpace.

For the lower bound, we show that checking boundedness is already PSpace-hard for connected linear monadic queries in plain datalog. In fact, PSpace-hardness was proved in [15] for program boundedness of disconnected programs. A program $\pi$ is called bounded if $(\pi,G)$ is bounded for every IDB $G$ in $\pi$ . We were able to regain the connectedness of $\pi$ by focusing on query boundedness (also called predicate boundedness) instead. The idea combines that of [15] with that of Section 3: define an IDB $F$ that slides along a computation, this time of a space-bounded Turing machine, looking for an erroneous transition.

Lemma 17.

Deciding boundedness of connected linear monadic queries in plain datalog is PSpace-hard.

Lemmas 14 and 15 bring us to an interesting consideration: to understand the data complexity of a given query one should analyse its behaviour in the relational domain separately from that in the temporal domain. This can be given a precise sense as follows.

Proposition 18.

For every connected $\textsl{datalog}_{\it lm}^{\,\smash{\raisebox{0.0pt}{$\scriptscriptstyle% \bigcirc$}}}$ query $(\pi,G)$ there exist a plain datalog query $(\pi_{d},G)$ and a plain LTL query $(\pi_{t},G)$ , such that:

1.

$(\pi,G)$ is vertically bounded if and only if $(\pi_{d},G)$ is bounded;
2.

if $(\pi,G)$ is vertically bounded then its data complexity coincides with that of $(\pi_{t},G)$ .

Technically, both $\pi_{d}$ and $\pi_{t}$ are obtained by simulating the deterministic automaton $\mathcal{A}_{(\pi,G)}$ for the language ${\textit{Accept}}(\pi,G)$ provided by Corollary 12. In both programs, the IDBs correspond to the states of $\mathcal{A}_{(\pi,G)}$ and EDBs to the letters of $\Omega$ . For $\pi_{t}$ these EDBs are unary and all the rules are horizontal, so that the expansions unwind fully in the temporal domain. In the case of $\pi_{d}$ , the EDBs are binary and every vertical transition of $\mathcal{A}_{(\pi,G)}$ is a step by a binary relation, while the horizontal transitions are skipped (thus, vertical boundedness becomes just boundedness). In both programs, initialisation rules correspond to $\mathcal{A}_{(\pi,G)}$ reaching an accepting state.

We conclude this section with a consideration on disconnected queries. In [15], deciding boundedness is based on the fact that $\textit{accept}(\pi,G)$ remains regular even when $\pi$ has disconnected rules. This is not the case for $\textsl{datalog}_{\it lm}^{\,\smash{\raisebox{0.0pt}{$\scriptscriptstyle% \bigcirc$}}}$ , as can be seen from the following example.

Example 19.

Consider the program $\pi$ of four rules:

	$\displaystyle G(X)\leftarrow A(X)\wedge{\raisebox{1.07639pt}{\text{\scriptsize% {$\bigcirc$}}}}D(X)$	$\displaystyle D(X)\leftarrow{\raisebox{1.07639pt}{\text{\scriptsize{$\bigcirc$% }}}}D(X)$
	$\displaystyle D(X)\leftarrow R(X,Y)\wedge{\raisebox{1.07639pt}{\text{% \scriptsize{$\bigcirc$}}}^{-}}B(Y)\wedge{\raisebox{1.07639pt}{\text{% \scriptsize{$\bigcirc$}}}^{-}}D(Y)$	$\displaystyle D(X)\leftarrow A(Y)\wedge B(X)$

Let $B_{1}=A(X)\wedge{\raisebox{1.07639pt}{\text{\scriptsize{$\bigcirc$}}}}D(X)$ , $B_{2}={\raisebox{1.07639pt}{\text{\scriptsize{$\bigcirc$}}}}D(X)$ , and $B_{3}=R(X,Y)\wedge{\raisebox{1.07639pt}{\text{\scriptsize{$\bigcirc$}}}^{-}}B(% Y)\wedge{\raisebox{1.07639pt}{\text{\scriptsize{$\bigcirc$}}}^{-}}D(Y)$ . Then $B_{1}B_{2}^{n}B_{3}^{m}\in\textit{accept}(\pi,G)$ , and, consequently, $\{\bot,B_{1}\}\{B_{2}\}^{n}\{\top\}\{B_{3}\}^{m}\in\textit{Accept}(\pi,G)$ , whenever $m>n\geqslant 1$ . This property is not recognisable by any finite state automaton. For more general classes of automata suitable to recognise $\textit{accept}(\pi,G)$ or $\textit{Accept}(\pi,G)$ in the disconnected setting, the properties that are to be checked along the lines of [15], such as language emptiness or finiteness, become undecidable. Therefore, disconnected queries possibly require a different approach for analysing the data complexity.

6 Conclusions and Future Work

We have started investigating the complexity of determining the data complexity of answering monadic datalog queries with temporal operators. For linear connected queries with operators $\bigcirc$ / $\raisebox{1.07639pt}{\text{\scriptsize{$\bigcirc$}}}^{-}$ , we have generalised the automata-theoretic technique of [15], developed originally for plain datalog, to establish an ${\textsc{AC}^{0}}/{\textsc{ACC}^{0}}/{\textsc{NC}^{1}}/\textsc{L}/\textsc{NL}$ classification of temporal query answering and proved that deciding L-hardness of a given query is PSpace-complete, while checking its membership in $\textsc{AC}^{0}$ or $\textsc{ACC}^{0}$ can be done in ExpSpace. As a minor side product, we have established PSpace-hardness of deciding boundedness of atemporal connected linear monadic datalog queries. Rather surprisingly and in sharp contrast to the $\bigcirc$ / $\raisebox{1.07639pt}{\text{\scriptsize{$\bigcirc$}}}^{-}$ case, it turns out that checking (non-trivial) membership of queries with operators $\Diamond$ / $\raisebox{0.43057pt}{\text{\small{$\Diamond$}}}\!^{-}$ in the above complexity classes is undecidable. The results of this paper lead to a plethora of natural and intriguing open questions. Some of them are briefly discussed below.

1.

What happens if we disallow applications of $\raisebox{0.43057pt}{\text{\small{$\Diamond$}}}/\raisebox{0.43057pt}{\text{% \small{$\Diamond$}}}\!^{-}$ to binary EDB predicates in $\textsl{datalog}_{\it lm}^{\,\smash{\raisebox{-1.0pt}{$\diamond$}}}$ -queries? We conjecture that this restriction makes checking membership in the above complexity classes decidable. In fact, this conjecture follows from a positive answer to the next question.
2.

Can our decidability results for $\textsl{datalog}_{\it lm}^{\,\smash{\raisebox{0.0pt}{$\scriptscriptstyle% \bigcirc$}}}$ be lifted to $\textsl{datalog}_{m}^{\,\smash{\raisebox{0.0pt}{$\scriptscriptstyle\bigcirc$}}}$ -queries? Dropping the linearity restriction in the atemporal case results in the extra data complexity class, P, and the higher complexity, 2ExpTime-completeness, of deciding boundedness. The upper bound was obtained using tree automata in [15], and we believe that this approach can be generalised to connected $\textsl{datalog}_{m}^{\,\smash{\raisebox{0.0pt}{$\scriptscriptstyle\bigcirc$}}}$ -queries in a way similar to what we have done above.
3.

On the other hand, dropping the connectedness restriction might turn out to be trickier, if at all possible, as shown by Example 19. Finding a new automata-theoretic characterisation for disconnected $\textsl{datalog}_{\it lm}^{\,\smash{\raisebox{0.0pt}{$\scriptscriptstyle% \bigcirc$}}}$ -queries remains a challenging open problem.
4.

A decisive step in understanding the data complexity of answering queries mediated by a description logic ontology and monadic disjunctive datalog queries was made in [9, 18] by establishing a close connection with constraint satisfaction problems (CSPs). In our case, quantified CSPs (see, e.g., [47]) seem to be more appropriate. Connecting the two areas might be beneficial to both of them.
5.

In the context of streaming data, it would be interesting to investigate the data complexity classes and the complexity of recognising them for datalogMTL-queries [10, 36, 44].

References

[1] Rajeev Alur and Thomas A. Henzinger. Real-time logics: Complexity and expressiveness. Inf. Comput., 104(1):35–77, 1993. doi:10.1006/inco.1993.1025.
[2] Peter Alvaro, William R. Marczak, Neil Conway, Joseph M. Hellerstein, David Maier, and Russell Sears. Dedalus: Datalog in time and space. In Oege de Moor, Georg Gottlob, Tim Furche, and Andrew Jon Sellers, editors, Datalog Reloaded - First International Workshop, Datalog 2010, Oxford, UK, March 16-19, 2010. Revised Selected Papers, volume 6702 of Lecture Notes in Computer Science, pages 262–281. Springer, 2010. doi:10.1007/978-3-642-24206-9_16.
[3] Alessandro Artale, Diego Calvanese, Roman Kontchakov, and Michael Zakharyaschev. The dl-lite family and relations. J. Artif. Intell. Res., 36:1–69, 2009. doi:10.1613/jair.2820.
[4] Alessandro Artale, Anton Gnatenko, Vladislav Ryzhikov, and Michael Zakharyaschev. On deciding the data complexity of answering linear monadic datalog queries with LTL operators (extended version), 2025. URL: https://arxiv.org/abs/2501.13762.
[5] Alessandro Artale, Roman Kontchakov, Alisa Kovtunova, Vladislav Ryzhikov, Frank Wolter, and Michael Zakharyaschev. First-order rewritability of ontology-mediated queries in linear temporal logic. Artif. Intell., 299:103536, 2021. doi:10.1016/j.artint.2021.103536.
[6] Alessandro Artale, Roman Kontchakov, Vladislav Ryzhikov, and Michael Zakharyaschev. The complexity of clausal fragments of LTL. In Kenneth L. McMillan, Aart Middeldorp, and Andrei Voronkov, editors, Logic for Programming, Artificial Intelligence, and Reasoning - 19th International Conference, LPAR-19, Stellenbosch, South Africa, December 14-19, 2013. Proceedings, volume 8312 of Lecture Notes in Computer Science, pages 35–52. Springer, 2013. doi:10.1007/978-3-642-45221-5_3.
[7] Alessandro Artale, Roman Kontchakov, Vladislav Ryzhikov, and Michael Zakharyaschev. A cookbook for temporal conceptual data modelling with description logics. ACM Trans. Comput. Log., 15(3):25:1–25:50, 2014. doi:10.1145/2629565.
[8] Michael Benedikt, Balder ten Cate, Thomas Colcombet, and Michael Vanden Boom. The complexity of boundedness for guarded logics. In 30th Annual ACM/IEEE Symposium on Logic in Computer Science, LICS 2015, Kyoto, Japan, July 6-10, 2015, pages 293–304. IEEE Computer Society, 2015. doi:10.1109/LICS.2015.36.
[9] Meghyn Bienvenu, Balder ten Cate, Carsten Lutz, and Frank Wolter. Ontology-based data access: A study through disjunctive datalog, csp, and MMSNP. ACM Trans. Database Syst., 39(4):33:1–33:44, 2014. doi:10.1145/2661643.
[10] Sebastian Brandt, Elem Güzel Kalayci, Vladislav Ryzhikov, Guohui Xiao, and Michael Zakharyaschev. Querying log data with metric temporal logic. J. Artif. Intell. Res., 62:829–877, 2018. doi:10.1613/jair.1.11229.
[11] Diego Calvanese, Giuseppe De Giacomo, Domenico Lembo, Maurizio Lenzerini, and Riccardo Rosati. Tractable reasoning and efficient query answering in description logics: The DL-Lite family. J. Autom. Reason., 39(3):385–429, 2007. doi:10.1007/s10817-007-9078-x.
[12] Jan Chomicki. Polynomial time query processing in temporal deductive databases. In Daniel J. Rosenkrantz and Yehoshua Sagiv, editors, Proceedings of the Ninth ACM SIGACT-SIGMOD-SIGART Symposium on Principles of Database Systems, April 2-4, 1990, Nashville, Tennessee, USA, pages 379–391. ACM Press, 1990. doi:10.1145/298514.298589.
[13] Jan Chomicki and Tomasz Imielinski. Temporal deductive databases and infinite objects. In Chris Edmondson-Yurkanan and Mihalis Yannakakis, editors, Proceedings of the Seventh ACM SIGACT-SIGMOD-SIGART Symposium on Principles of Database Systems, March 21-23, 1988, Austin, Texas, USA, pages 61–73. ACM, 1988. doi:10.1145/308386.308416.
[14] Jan Chomicki, David Toman, and Michael H. Böhlen. Querying ATSQL databases with temporal logic. ACM Trans. Database Syst., 26(2):145–178, 2001. doi:10.1145/383891.383892.
[15] Stavros S. Cosmadakis, Haim Gaifman, Paris C. Kanellakis, and Moshe Y. Vardi. Decidable optimization problems for database logic programs (preliminary report). In Janos Simon, editor, Proceedings of the 20th Annual ACM Symposium on Theory of Computing, May 2-4, 1988, Chicago, Illinois, USA, pages 477–490. ACM, 1988. doi:10.1145/62212.62259.
[16] Evgeny Dantsin, Thomas Eiter, Georg Gottlob, and Andrei Voronkov. Complexity and expressive power of logic programming. ACM Comput. Surv., 33(3):374–425, 2001. doi:10.1145/502807.502810.
[17] Stéphane Demri, Valentin Goranko, and Martin Lange. Temporal Logics in Computer Science: Finite-State Systems. Cambridge Tracts in Theoretical Computer Science. Cambridge University Press, 2016. doi:10.1017/CBO9781139236119.
[18] Cristina Feier, Antti Kuusisto, and Carsten Lutz. Rewritability in monadic disjunctive datalog, mmsnp, and expressive description logics. Log. Methods Comput. Sci., 15(2), 2019. doi:10.23638/LMCS-15(2:15)2019.
[19] Valeria Fionda and Giuseppe Pirrò. Characterizing evolutionary trends in temporal knowledge graphs with linear temporal logic. In Jingrui He, Themis Palpanas, Xiaohua Hu, Alfredo Cuzzocrea, Dejing Dou, Dominik Slezak, Wei Wang, Aleksandra Gruca, Jerry Chun-Wei Lin, and Rakesh Agrawal, editors, IEEE International Conference on Big Data, BigData 2023, Sorrento, Italy, December 15-18, 2023, pages 2907–2909. IEEE, 2023. doi:10.1109/BigData59044.2023.10386573.
[20] International Organization for Standardization. Information technology — database languages — sql — part 2: Foundation (sql/foundation), 1999. URL: https://www.iso.org/standard/23532.html.
[21] Nadime Francis, Alastair Green, Paolo Guagliardo, Leonid Libkin, Tobias Lindaaker, Victor Marsault, Stefan Plantikow, Mats Rydberg, Petra Selmer, and Andrés Taylor. Cypher: An evolving query language for property graphs. In Gautam Das, Christopher M. Jermaine, and Philip A. Bernstein, editors, Proceedings of the 2018 International Conference on Management of Data, SIGMOD Conference 2018, Houston, TX, USA, June 10-15, 2018, pages 1433–1445. ACM, 2018. doi:10.1145/3183713.3190657.
[22] Víctor Gutiérrez-Basulto, Jean Christoph Jung, and Roman Kontchakov. Temporalized EL ontologies for accessing temporal data: Complexity of atomic queries. In Subbarao Kambhampati, editor, Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, IJCAI 2016, New York, NY, USA, 9-15 July 2016, pages 1102–1108. IJCAI/AAAI Press, 2016. URL: http://www.ijcai.org/Abstract/16/160.
[23] Steve Harris and Andy Seaborne. Sparql 1.1 query language. https://www.w3.org/TR/sparql11-query/, 2013. W3C Recommendation, 21 March 2013. URL: https://www.w3.org/TR/sparql11-query/.
[24] Gerd G. Hillebrand, Paris C. Kanellakis, Harry G. Mairson, and Moshe Y. Vardi. Undecidable boundedness problems for datalog programs. J. Log. Program., 25(2):163–190, 1995. doi:10.1016/0743-1066(95)00051-K.
[25] Ismail Husein, Herman Mawengkang, Saib Suwilo, and Mardiningsih. Modeling the transmission of infectious disease in a dynamic network. Journal of Physics: Conference Series, 1255(1):012052, August 2019. URL: https://dx.doi.org/10.1088/1742-6596/1255/1/012052.
[26] Neil Immerman. Descriptive complexity. Graduate texts in computer science. Springer, 1999. doi:10.1007/978-1-4612-0539-5.
[27] Paris C. Kanellakis. Elements of relational database theory. In Jan van Leeuwen, editor, Handbook of Theoretical Computer Science, Volume B: Formal Models and Semantics, pages 1073–1156. Elsevier and MIT Press, 1990. doi:10.1016/b978-0-444-88074-1.50022-6.
[28] Kevin Cullinane. Modeling dynamic transportation networks: Bin ran and david boyce springer 1996 isbn 3540611398. Journal of Transport Geography, 6(1):76–78, 1998. doi:10.1016/S0966-6923(98)90041-2.
[29] Stanislav Kikot, Agi Kurucz, Vladimir V. Podolskii, and Michael Zakharyaschev. Deciding boundedness of monadic sirups. In Leonid Libkin, Reinhard Pichler, and Paolo Guagliardo, editors, PODS’21: Proceedings of the 40th ACM SIGMOD-SIGACT-SIGAI Symposium on Principles of Database Systems, Virtual Event, China, June 20-25, 2021, pages 370–387. ACM, 2021. doi:10.1145/3452021.3458332.
[30] Ron Koymans. Specifying real-time properties with metric temporal logic. Real Time Syst., 2(4):255–299, 1990. doi:10.1007/BF01995674.
[31] Agi Kurucz, Vladislav Ryzhikov, Yury Savateev, and Michael Zakharyaschev. Deciding fo-rewritability of regular languages and ontology-mediated queries in linear temporal logic. J. Artif. Intell. Res., 76:645–703, 2023. doi:10.1613/jair.1.14061.
[32] Ling Liu and M. Tamer Özsu, editors. Encyclopedia of Database Systems, Second Edition. Springer, 2018. doi:10.1007/978-1-4614-8265-9.
[33] Marvin L. Minsky. Computation: finite and infinite machines. Prentice-Hall, Inc., USA, 1967. URL: https://dl.acm.org/doi/book/10.5555/1095587.
[34] Jeffrey F. Naughton. Data independent recursion in deductive databases. J. Comput. Syst. Sci., 38(2):259–289, 1989. doi:10.1016/0022-0000(89)90003-2.
[35] Juan L. Reutter, Adrián Soto, and Domagoj Vrgoc. Recursion in SPARQL. Semantic Web, 12(5):711–740, 2021. doi:10.3233/SW-200401.
[36] Vladislav Ryzhikov, Przemyslaw Andrzej Walega, and Michael Zakharyaschev. Data complexity and rewritability of ontology-mediated queries in metric temporal logic under the event-based semantics. In Sarit Kraus, editor, Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, IJCAI 2019, Macao, China, August 10-16, 2019, pages 1851–1857. ijcai.org, 2019. doi:10.24963/ijcai.2019/256.
[37] Benjamin Schäfer, Dirk Witthaut, Marc Timme, and Vito Latora. Dynamically induced cascading failures in power grids. Nature Communications, 9(1):1975, May 2018. doi:10.1038/s41467-018-04287-5.
[38] Brian Skyrms and Robin Pemantle. A dynamic model of social network formation. Proceedings of the National Academy of Sciences, 97(16):9340–9346, 2000. arXiv:https://www.pnas.org/doi/pdf/10.1073/pnas.97.16.9340.
[39] Richard T. Snodgrass, Ilsoo Ahn, Gad Ariav, Don S. Batory, James Clifford, Curtis E. Dyreson, Ramez Elmasri, Fabio Grandi, Christian S. Jensen, Wolfgang Käfer, Nick Kline, Krishna G. Kulkarni, T. Y. Cliff Leung, Nikos A. Lorentzos, John F. Roddick, Arie Segev, Michael D. Soo, and Suryanarayana M. Sripada. TSQL2 language specification. SIGMOD Rec., 23(1):65–86, 1994. doi:10.1145/181550.181562.
[40] Howard Straubing. Finite Automata, Formal Logic, and Circuit Complexity. Birkhäuser, Boston, MA, 1994. URL: http://link.springer.com/10.1007/978-1-4612-0289-9.
[41] David Toman. Point vs. interval-based query languages for temporal databases. In Richard Hull, editor, Proceedings of the Fifteenth ACM SIGACT-SIGMOD-SIGART Symposium on Principles of Database Systems, June 3-5, 1996, Montreal, Canada, pages 58–67. ACM Press, 1996. doi:10.1145/237661.237676.
[42] Valentina Urzua and Claudio Gutierrez. Linear recursion in G-CORE. In Aidan Hogan and Tova Milo, editors, Proceedings of the 13th Alberto Mendelzon International Workshop on Foundations of Data Management, Asunción, Paraguay, June 3-7, 2019, volume 2369 of CEUR Workshop Proceedings. CEUR-WS.org, 2019. URL: https://ceur-ws.org/Vol-2369/short07.pdf.
[43] Ron van der Meyden. Predicate boundedness of linear monadic datalog is in PSPACE. Int. J. Found. Comput. Sci., 11(4):591–612, 2000. doi:10.1142/S0129054100000351.
[44] Dingmin Wang, Pan Hu, Przemyslaw Andrzej Walega, and Bernardo Cuenca Grau. Meteor: Practical reasoning in datalog with metric temporal operators. In Thirty-Sixth AAAI Conference on Artificial Intelligence, AAAI 2022, Thirty-Fourth Conference on Innovative Applications of Artificial Intelligence, IAAI 2022, The Twelveth Symposium on Educational Advances in Artificial Intelligence, EAAI 2022 Virtual Event, February 22 - March 1, 2022, pages 5906–5913. AAAI Press, 2022. doi:10.1609/aaai.v36i5.20535.
[45] Mincheng Wu, Chao Li, Zhangchong Shen, Shibo He, Lingling Tang, Jie Zheng, Yi Fang, Kehan Li, Yanggang Cheng, Zhiguo Shi, Guoping Sheng, Yu Liu, Jinxing Zhu, Xinjiang Ye, Jinlai Chen, Wenrong Chen, Lanjuan Li, Youxian Sun, and Jiming Chen. Use of temporal contact graphs to understand the evolution of covid-19 through contact tracing data. Communications Physics, 5(1):270, 2022. doi:10.1038/s42005-022-01045-4.
[46] Mengkai Xu, Srinivasan Radhakrishnan, Sagar Kamarthi, and Xiaoning Jin. Resiliency of mutualistic supplier-manufacturer networks. Scientific Reports, 9, September 2019. doi:10.1038/s41598-019-49932-1.
[47] Dmitriy Zhuk. $\prod$ ${}_{\mbox{2}}$ ${}^{\mbox{p}}$ vs pspace dichotomy for the quantified constraint satisfaction problem. In 65th IEEE Annual Symposium on Foundations of Computer Science, FOCS 2024, Chicago, IL, USA, October 27-30, 2024, pages 560–572. IEEE, 2024. doi:10.1109/FOCS61266.2024.00043.

[bib.bib1] [1] Rajeev Alur and Thomas A. Henzinger. Real-time logics: Complexity and expressiveness. Inf. Comput., 104(1):35–77, 1993. doi:10.1006/inco.1993.1025.

[bib.bib2] [2] Peter Alvaro, William R. Marczak, Neil Conway, Joseph M. Hellerstein, David Maier, and Russell Sears. Dedalus: Datalog in time and space. In Oege de Moor, Georg Gottlob, Tim Furche, and Andrew Jon Sellers, editors, Datalog Reloaded - First International Workshop, Datalog 2010, Oxford, UK, March 16-19, 2010. Revised Selected Papers, volume 6702 of Lecture Notes in Computer Science, pages 262–281. Springer, 2010. doi:10.1007/978-3-642-24206-9_16.

[bib.bib3] [3] Alessandro Artale, Diego Calvanese, Roman Kontchakov, and Michael Zakharyaschev. The dl-lite family and relations. J. Artif. Intell. Res., 36:1–69, 2009. doi:10.1613/jair.2820.

[bib.bib4] [4] Alessandro Artale, Anton Gnatenko, Vladislav Ryzhikov, and Michael Zakharyaschev. On deciding the data complexity of answering linear monadic datalog queries with LTL operators (extended version), 2025. URL: https://arxiv.org/abs/2501.13762.

[bib.bib5] [5] Alessandro Artale, Roman Kontchakov, Alisa Kovtunova, Vladislav Ryzhikov, Frank Wolter, and Michael Zakharyaschev. First-order rewritability of ontology-mediated queries in linear temporal logic. Artif. Intell., 299:103536, 2021. doi:10.1016/j.artint.2021.103536.

[bib.bib6] [6] Alessandro Artale, Roman Kontchakov, Vladislav Ryzhikov, and Michael Zakharyaschev. The complexity of clausal fragments of LTL. In Kenneth L. McMillan, Aart Middeldorp, and Andrei Voronkov, editors, Logic for Programming, Artificial Intelligence, and Reasoning - 19th International Conference, LPAR-19, Stellenbosch, South Africa, December 14-19, 2013. Proceedings, volume 8312 of Lecture Notes in Computer Science, pages 35–52. Springer, 2013. doi:10.1007/978-3-642-45221-5_3.

[bib.bib7] [7] Alessandro Artale, Roman Kontchakov, Vladislav Ryzhikov, and Michael Zakharyaschev. A cookbook for temporal conceptual data modelling with description logics. ACM Trans. Comput. Log., 15(3):25:1–25:50, 2014. doi:10.1145/2629565.

[bib.bib8] [8] Michael Benedikt, Balder ten Cate, Thomas Colcombet, and Michael Vanden Boom. The complexity of boundedness for guarded logics. In 30th Annual ACM/IEEE Symposium on Logic in Computer Science, LICS 2015, Kyoto, Japan, July 6-10, 2015, pages 293–304. IEEE Computer Society, 2015. doi:10.1109/LICS.2015.36.

[bib.bib9] [9] Meghyn Bienvenu, Balder ten Cate, Carsten Lutz, and Frank Wolter. Ontology-based data access: A study through disjunctive datalog, csp, and MMSNP. ACM Trans. Database Syst., 39(4):33:1–33:44, 2014. doi:10.1145/2661643.

[bib.bib10] [10] Sebastian Brandt, Elem Güzel Kalayci, Vladislav Ryzhikov, Guohui Xiao, and Michael Zakharyaschev. Querying log data with metric temporal logic. J. Artif. Intell. Res., 62:829–877, 2018. doi:10.1613/jair.1.11229.

[bib.bib11] [11] Diego Calvanese, Giuseppe De Giacomo, Domenico Lembo, Maurizio Lenzerini, and Riccardo Rosati. Tractable reasoning and efficient query answering in description logics: The DL-Lite family. J. Autom. Reason., 39(3):385–429, 2007. doi:10.1007/s10817-007-9078-x.

[bib.bib12] [12] Jan Chomicki. Polynomial time query processing in temporal deductive databases. In Daniel J. Rosenkrantz and Yehoshua Sagiv, editors, Proceedings of the Ninth ACM SIGACT-SIGMOD-SIGART Symposium on Principles of Database Systems, April 2-4, 1990, Nashville, Tennessee, USA, pages 379–391. ACM Press, 1990. doi:10.1145/298514.298589.

[bib.bib13] [13] Jan Chomicki and Tomasz Imielinski. Temporal deductive databases and infinite objects. In Chris Edmondson-Yurkanan and Mihalis Yannakakis, editors, Proceedings of the Seventh ACM SIGACT-SIGMOD-SIGART Symposium on Principles of Database Systems, March 21-23, 1988, Austin, Texas, USA, pages 61–73. ACM, 1988. doi:10.1145/308386.308416.

[bib.bib14] [14] Jan Chomicki, David Toman, and Michael H. Böhlen. Querying ATSQL databases with temporal logic. ACM Trans. Database Syst., 26(2):145–178, 2001. doi:10.1145/383891.383892.

[bib.bib15] [15] Stavros S. Cosmadakis, Haim Gaifman, Paris C. Kanellakis, and Moshe Y. Vardi. Decidable optimization problems for database logic programs (preliminary report). In Janos Simon, editor, Proceedings of the 20th Annual ACM Symposium on Theory of Computing, May 2-4, 1988, Chicago, Illinois, USA, pages 477–490. ACM, 1988. doi:10.1145/62212.62259.

[bib.bib16] [16] Evgeny Dantsin, Thomas Eiter, Georg Gottlob, and Andrei Voronkov. Complexity and expressive power of logic programming. ACM Comput. Surv., 33(3):374–425, 2001. doi:10.1145/502807.502810.

[bib.bib17] [17] Stéphane Demri, Valentin Goranko, and Martin Lange. Temporal Logics in Computer Science: Finite-State Systems. Cambridge Tracts in Theoretical Computer Science. Cambridge University Press, 2016. doi:10.1017/CBO9781139236119.

[bib.bib18] [18] Cristina Feier, Antti Kuusisto, and Carsten Lutz. Rewritability in monadic disjunctive datalog, mmsnp, and expressive description logics. Log. Methods Comput. Sci., 15(2), 2019. doi:10.23638/LMCS-15(2:15)2019.

[bib.bib19] [19] Valeria Fionda and Giuseppe Pirrò. Characterizing evolutionary trends in temporal knowledge graphs with linear temporal logic. In Jingrui He, Themis Palpanas, Xiaohua Hu, Alfredo Cuzzocrea, Dejing Dou, Dominik Slezak, Wei Wang, Aleksandra Gruca, Jerry Chun-Wei Lin, and Rakesh Agrawal, editors, IEEE International Conference on Big Data, BigData 2023, Sorrento, Italy, December 15-18, 2023, pages 2907–2909. IEEE, 2023. doi:10.1109/BigData59044.2023.10386573.

[bib.bib20] [20] International Organization for Standardization. Information technology — database languages — sql — part 2: Foundation (sql/foundation), 1999. URL: https://www.iso.org/standard/23532.html.

[bib.bib21] [21] Nadime Francis, Alastair Green, Paolo Guagliardo, Leonid Libkin, Tobias Lindaaker, Victor Marsault, Stefan Plantikow, Mats Rydberg, Petra Selmer, and Andrés Taylor. Cypher: An evolving query language for property graphs. In Gautam Das, Christopher M. Jermaine, and Philip A. Bernstein, editors, Proceedings of the 2018 International Conference on Management of Data, SIGMOD Conference 2018, Houston, TX, USA, June 10-15, 2018, pages 1433–1445. ACM, 2018. doi:10.1145/3183713.3190657.

[bib.bib22] [22] Víctor Gutiérrez-Basulto, Jean Christoph Jung, and Roman Kontchakov. Temporalized EL ontologies for accessing temporal data: Complexity of atomic queries. In Subbarao Kambhampati, editor, Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, IJCAI 2016, New York, NY, USA, 9-15 July 2016, pages 1102–1108. IJCAI/AAAI Press, 2016. URL: http://www.ijcai.org/Abstract/16/160.

[bib.bib23] [23] Steve Harris and Andy Seaborne. Sparql 1.1 query language. https://www.w3.org/TR/sparql11-query/, 2013. W3C Recommendation, 21 March 2013. URL: https://www.w3.org/TR/sparql11-query/.

[bib.bib24] [24] Gerd G. Hillebrand, Paris C. Kanellakis, Harry G. Mairson, and Moshe Y. Vardi. Undecidable boundedness problems for datalog programs. J. Log. Program., 25(2):163–190, 1995. doi:10.1016/0743-1066(95)00051-K.

[bib.bib25] [25] Ismail Husein, Herman Mawengkang, Saib Suwilo, and Mardiningsih. Modeling the transmission of infectious disease in a dynamic network. Journal of Physics: Conference Series, 1255(1):012052, August 2019. URL: https://dx.doi.org/10.1088/1742-6596/1255/1/012052.

[bib.bib26] [26] Neil Immerman. Descriptive complexity. Graduate texts in computer science. Springer, 1999. doi:10.1007/978-1-4612-0539-5.

[bib.bib27] [27] Paris C. Kanellakis. Elements of relational database theory. In Jan van Leeuwen, editor, Handbook of Theoretical Computer Science, Volume B: Formal Models and Semantics, pages 1073–1156. Elsevier and MIT Press, 1990. doi:10.1016/b978-0-444-88074-1.50022-6.

[bib.bib28] [28] Kevin Cullinane. Modeling dynamic transportation networks: Bin ran and david boyce springer 1996 isbn 3540611398. Journal of Transport Geography, 6(1):76–78, 1998. doi:10.1016/S0966-6923(98)90041-2.

[bib.bib29] [29] Stanislav Kikot, Agi Kurucz, Vladimir V. Podolskii, and Michael Zakharyaschev. Deciding boundedness of monadic sirups. In Leonid Libkin, Reinhard Pichler, and Paolo Guagliardo, editors, PODS’21: Proceedings of the 40th ACM SIGMOD-SIGACT-SIGAI Symposium on Principles of Database Systems, Virtual Event, China, June 20-25, 2021, pages 370–387. ACM, 2021. doi:10.1145/3452021.3458332.

[bib.bib30] [30] Ron Koymans. Specifying real-time properties with metric temporal logic. Real Time Syst., 2(4):255–299, 1990. doi:10.1007/BF01995674.

[bib.bib31] [31] Agi Kurucz, Vladislav Ryzhikov, Yury Savateev, and Michael Zakharyaschev. Deciding fo-rewritability of regular languages and ontology-mediated queries in linear temporal logic. J. Artif. Intell. Res., 76:645–703, 2023. doi:10.1613/jair.1.14061.

[bib.bib32] [32] Ling Liu and M. Tamer Özsu, editors. Encyclopedia of Database Systems, Second Edition. Springer, 2018. doi:10.1007/978-1-4614-8265-9.

[bib.bib33] [33] Marvin L. Minsky. Computation: finite and infinite machines. Prentice-Hall, Inc., USA, 1967. URL: https://dl.acm.org/doi/book/10.5555/1095587.

[bib.bib34] [34] Jeffrey F. Naughton. Data independent recursion in deductive databases. J. Comput. Syst. Sci., 38(2):259–289, 1989. doi:10.1016/0022-0000(89)90003-2.

[bib.bib35] [35] Juan L. Reutter, Adrián Soto, and Domagoj Vrgoc. Recursion in SPARQL. Semantic Web, 12(5):711–740, 2021. doi:10.3233/SW-200401.

[bib.bib36] [36] Vladislav Ryzhikov, Przemyslaw Andrzej Walega, and Michael Zakharyaschev. Data complexity and rewritability of ontology-mediated queries in metric temporal logic under the event-based semantics. In Sarit Kraus, editor, Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, IJCAI 2019, Macao, China, August 10-16, 2019, pages 1851–1857. ijcai.org, 2019. doi:10.24963/ijcai.2019/256.

[bib.bib37] [37] Benjamin Schäfer, Dirk Witthaut, Marc Timme, and Vito Latora. Dynamically induced cascading failures in power grids. Nature Communications, 9(1):1975, May 2018. doi:10.1038/s41467-018-04287-5.

[bib.bib38] [38] Brian Skyrms and Robin Pemantle. A dynamic model of social network formation. Proceedings of the National Academy of Sciences, 97(16):9340–9346, 2000. arXiv:https://www.pnas.org/doi/pdf/10.1073/pnas.97.16.9340.

[bib.bib39] [39] Richard T. Snodgrass, Ilsoo Ahn, Gad Ariav, Don S. Batory, James Clifford, Curtis E. Dyreson, Ramez Elmasri, Fabio Grandi, Christian S. Jensen, Wolfgang Käfer, Nick Kline, Krishna G. Kulkarni, T. Y. Cliff Leung, Nikos A. Lorentzos, John F. Roddick, Arie Segev, Michael D. Soo, and Suryanarayana M. Sripada. TSQL2 language specification. SIGMOD Rec., 23(1):65–86, 1994. doi:10.1145/181550.181562.

[bib.bib40] [40] Howard Straubing. Finite Automata, Formal Logic, and Circuit Complexity. Birkhäuser, Boston, MA, 1994. URL: http://link.springer.com/10.1007/978-1-4612-0289-9.

[bib.bib41] [41] David Toman. Point vs. interval-based query languages for temporal databases. In Richard Hull, editor, Proceedings of the Fifteenth ACM SIGACT-SIGMOD-SIGART Symposium on Principles of Database Systems, June 3-5, 1996, Montreal, Canada, pages 58–67. ACM Press, 1996. doi:10.1145/237661.237676.

[bib.bib42] [42] Valentina Urzua and Claudio Gutierrez. Linear recursion in G-CORE. In Aidan Hogan and Tova Milo, editors, Proceedings of the 13th Alberto Mendelzon International Workshop on Foundations of Data Management, Asunción, Paraguay, June 3-7, 2019, volume 2369 of CEUR Workshop Proceedings. CEUR-WS.org, 2019. URL: https://ceur-ws.org/Vol-2369/short07.pdf.

[bib.bib43] [43] Ron van der Meyden. Predicate boundedness of linear monadic datalog is in PSPACE. Int. J. Found. Comput. Sci., 11(4):591–612, 2000. doi:10.1142/S0129054100000351.

[bib.bib44] [44] Dingmin Wang, Pan Hu, Przemyslaw Andrzej Walega, and Bernardo Cuenca Grau. Meteor: Practical reasoning in datalog with metric temporal operators. In Thirty-Sixth AAAI Conference on Artificial Intelligence, AAAI 2022, Thirty-Fourth Conference on Innovative Applications of Artificial Intelligence, IAAI 2022, The Twelveth Symposium on Educational Advances in Artificial Intelligence, EAAI 2022 Virtual Event, February 22 - March 1, 2022, pages 5906–5913. AAAI Press, 2022. doi:10.1609/aaai.v36i5.20535.

[bib.bib45] [45] Mincheng Wu, Chao Li, Zhangchong Shen, Shibo He, Lingling Tang, Jie Zheng, Yi Fang, Kehan Li, Yanggang Cheng, Zhiguo Shi, Guoping Sheng, Yu Liu, Jinxing Zhu, Xinjiang Ye, Jinlai Chen, Wenrong Chen, Lanjuan Li, Youxian Sun, and Jiming Chen. Use of temporal contact graphs to understand the evolution of covid-19 through contact tracing data. Communications Physics, 5(1):270, 2022. doi:10.1038/s42005-022-01045-4.

[bib.bib46] [46] Mengkai Xu, Srinivasan Radhakrishnan, Sagar Kamarthi, and Xiaoning Jin. Resiliency of mutualistic supplier-manufacturer networks. Scientific Reports, 9, September 2019. doi:10.1038/s41598-019-49932-1.

[bib.bib47] [47] Dmitriy Zhuk. $\prod$ ${}_{\mbox{2}}$ ${}^{\mbox{p}}$ vs pspace dichotomy for the quantified constraint satisfaction problem. In 65th IEEE Annual Symposium on Foundations of Computer Science, FOCS 2024, Chicago, IL, USA, October 27-30, 2024, pages 560–572. IEEE, 2024. doi:10.1109/FOCS61266.2024.00043.

On Deciding the Data Complexity of Answering Linear Monadic Datalog Queries with LTL Operators

Abstract

Keywords and phrases:

Copyright and License:

2012 ACM Subject Classification:

Related Version:

DOI:

Event:

Editors:

Series and Publisher:

1 Introduction

Example 1.

2 Preliminaries

Proposition 2.

Lemma 3.

2.1 Temporal Datalog

Theorem 4.

Proof (Sketch).

2.1.1 Expansions for Linear Monadic Queries

3 Undecidability for Queries with ◇

Theorem 5.

Lemma 6.

4 Automata-Theoretic Tools for Queries with ○

Example 7.

Example 8.

Example 9.

Lemma 10.

Lemma 11.

Corollary 12.

5 Decidability for Connected Linear Queries with ○

Theorem 13.

Lemma 14.

Lemma 15.

Lemma 16.

Lemma 17.

Proposition 18.

Example 19.

6 Conclusions and Future Work

References

3 Undecidability for Queries with $\Diamond$

4 Automata-Theoretic Tools for Queries with $\bigcirc$

5 Decidability for Connected Linear Queries with $\bigcirc$