Abstract 1 Introduction 2 Preliminaries 3 The complexity of deciding characteristic formulae modulo preorders 4 Finding characteristic formulae: The gap between trace simulation and the other preorders 5 A note on deciding characteristic formulae modulo equivalence relations 6 Conclusions References

The Complexity of Deciding Characteristic Formulae in Van Glabbeek’s Branching-Time Spectrum

Luca Aceto ORCID Department of Computer Science, Reykjavik University, Iceland
Gran Sasso Science Institute, L’Aquila, Italy
Antonis Achilleos ORCID Department of Computer Science, Reykjavik University, Iceland Aggeliki Chalki ORCID Department of Computer Science, Reykjavik University, Iceland Anna Ingólfsdóttir ORCID Department of Computer Science, Reykjavik University, Iceland
Abstract

Characteristic formulae give a complete logical description of the behaviour of processes modulo some chosen notion of behavioural semantics. They allow one to reduce equivalence or preorder checking to model checking, and are exactly the formulae in the modal logics characterizing classic behavioural equivalences and preorders for which model checking can be reduced to equivalence or preorder checking.

This paper studies the complexity of determining whether a formula is characteristic for some process in each of the logics providing modal characterizations of the simulation-based semantics in van Glabbeek’s branching-time spectrum. Since characteristic formulae in each of those logics are exactly the satisfiable and prime ones, this article presents complexity results for the satisfiability and primality problems, and investigates the boundary between modal logics for which those problems can be solved in polynomial time and those for which they become computationally hard.

Amongst other contributions, this article also studies the complexity of constructing characteristic formulae in the modal logics characterizing simulation-based semantics, both when such formulae are presented in explicit form and via systems of equations.

Keywords and phrases:
Characteristic formulae, prime formulae, bisimulation, simulation relations, modal logics, complexity theory, satisfiability
Copyright and License:
[Uncaptioned image] © Luca Aceto, Antonis Achilleos, Aggeliki Chalki, and Anna Ingólfsdóttir; licensed under Creative Commons License CC-BY 4.0
2012 ACM Subject Classification:
Theory of computation Modal and temporal logics
; Theory of computation Complexity theory and logic
Related Version:
Full Version: https://doi.org/10.48550/arXiv.2405.13697 [1]
Acknowledgements:
The authors thank the anonymous reviewers for comments that led to improvements in the paper. This paper is dedicated to the memory of Rance Cleaveland (1961–2024), who used characteristic formulae to compute behavioural relations, logically and efficiently.
Funding:
This work has been funded by the projects “Open Problems in the Equational Logic of Processes (OPEL)” (grant no. 196050), “Mode(l)s of Verification and Monitorability” (MoVeMnt) (grant no. 217987), and “Learning and Applying Probabilistic Systems” (grant no. 206574-051) of the Icelandic Research Fund.
Editors:
Jörg Endrullis and Sylvain Schmitz

1 Introduction

Several notions of behavioural relations have been proposed in concurrency theory to describe when one process is a suitable implementation of another. Many such relations have been catalogued by van Glabbeek in his seminal linear-time/branching-time spectrum [22], together with a variety of alternative ways of describing them including testing scenarios and axiom systems. To our mind, modal characterizations of behavioural equivalences and preorders are some of the most classic and pleasing results in concurrency theory – see, for instance, [25] for the seminal Hennessy-Milner theorem and [12, 16, 17, 22] for similar results for other relations in van Glabbeek’s spectrum and other settings. By way of example, in their archetypal modal characterization of bisimilarity, Hennessy and Milner have shown in [25] that, under a mild finiteness condition, two processes are bisimilar if, and only if, they satisfy the same formulae in a multi-modal logic that is now often called Hennessy-Milner logic. Apart from its intrinsic theoretical interest, this seminal logical characterization of bisimilarity means that, when two processes are not bisimilar, there is always a formula that distinguishes between them. Such a formula describes a reason why the two processes are not bisimilar, provides useful debugging information and can be algorithmically constructed over finite processes – see, for instance, [8, 14] and [35], where Martens and Groote show that, in general, computing minimal distinguishing Hennessy-Milner formulae is 𝖭𝖯-hard.

On the other hand, the Hennessy-Milner theorem seems to be less useful to show that two processes are bisimilar, since that would involve verifying that they satisfy the same formulae, and there are infinitely many of those. However, as shown in works such as [3, 6, 12, 23, 39], the logics that underlie classic modal characterization theorems for equivalences and preorders over processes allow one to express characteristic formulae. Intuitively, a characteristic formula χ(p) for a process p gives a complete logical characterization of the behaviour of p modulo the behavioural semantics of interest , in the sense that any process is related to p with respect to if, and only if, it satisfies χ(p).111Formulae akin to characteristic ones first occurred in the study of equivalence of structures using first-order formulae up to some quantifier rank. See, for example, the survey paper [40] and the textbook [20]. The existence of formulae in first-order logic with counting that characterize graphs up to isomorphism has significantly contributed to the study of the complexity of the Graph Isomorphism problem – see, for instance, [13, 30]. Since the formula χ(p) can be constructed from p, characteristic formulae reduce the problem of checking whether a process q is related to p by to a model checking problem, viz. whether q satisfies χ(p). See, for instance, the classic reference [15] for applications of this approach.

Characteristic formulae, thus, allow one to reduce equivalence and preorder checking to model checking. But what model checking problems can be reduced to equivalence/preorder checking ones? To the best of our knowledge, that question was first studied by Boudol and Larsen in [11] in the setting of modal refinement over modal transition systems. See [3, 4] for other contributions in that line of research. The aforementioned articles showed that characteristic formulae coincide with those that are satisfiable and prime. (A formula is prime if whenever it entails a disjunction φ1φ2, then it must entail φ1 or φ2.) Moreover, characteristic formulae with respect to bisimilarity coincide with the formulae that are satisfiable and complete [7]. (A modal formula is complete if, for each formula φ, it entails either φ or its negation.) The aforementioned results give semantic characterizations of the formulae that are characteristic within the logics that correspond to the behavioural semantics in van Glabbeek’s spectrum. Those characterizations tell us for what logical specifications model checking can be reduced to equivalence or preorder checking. However, given a specification expressed as a modal formula, can one decide whether that formula is characteristic and therefore can be model checked using algorithms for behavioural equivalences or preorders? And, if so, what is the complexity of checking whether a formula is characteristic? Perhaps surprisingly, those questions were not addressed in the literature until the recent papers [2, 7], where it is shown that, in the setting of the modal logics that characterize bisimilarity over natural classes of Kripke structures and labelled transition systems, the problem of checking whether a formula is characteristic for some process modulo bisimilarity is computationally hard and, typically, has the same complexity as validity checking, which is 𝖯𝖲𝖯𝖠𝖢𝖤-complete for Hennessy-Milner logic and 𝖤𝖷𝖯-complete for its extension with fixed-point operators [26, 33] and the μ-calculus [31].

The aforementioned hardness results for the logics characterizing bisimilarity tell us that deciding whether a formula is characteristic in bisimulation semantics is computationally hard. But what about the less expressive logics that characterize the coarser semantics in van Glabbeek’s spectrum? And for what logics characterizing relations in the spectrum does computational hardness manifest itself? Finally, what is the complexity of computing a characteristic formula for a process?

The aim of this paper is to answer the aforementioned questions for some of the simulation-based semantics in the spectrum. In particular, we study the complexity of determining whether a formula is characteristic modulo the simulation [36], complete simulation and ready simulation preorders [10, 34], as well as the trace simulation and the n-nested simulation preorders [24]. Since characteristic formulae are exactly the satisfiable and prime ones for each behavioural relation in van Glabbeek’s spectrum [3], the above-mentioned tasks naturally break down into studying the complexity of satisfiability and primality checking for formulae in the fragments of Hennessy-Milner logic that characterize those preorders. By using a reduction to the, seemingly unrelated, reachability problem in alternating graphs, as defined by Immerman in [28, Definition 3.24], we discover that both those problems are decidable in polynomial time for the simulation and the complete simulation preorders, as well as for the ready simulation preorder when the set of actions has constant size. On the other hand, when the set of actions is unbounded (that is, it is an input of the algorithmic problem at hand), the problems of checking satisfiability and primality for formulae in the logic characterizing the ready simulation preorder are 𝖭𝖯-complete and 𝖼𝗈𝖭𝖯-complete respectively. We also show that deciding whether a formula is characteristic in that setting is 𝖴𝖲-hard [9] (that is, it is at least as hard as the problem of deciding whether a given Boolean formula has exactly one satisfying truth assignment) and belongs to 𝖣𝖯, which is the class of languages that are the intersection of one language in 𝖭𝖯 and of one in 𝖼𝗈𝖭𝖯 [38].222The class 𝖣𝖯 contains both 𝖭𝖯 and 𝖼𝗈𝖭𝖯, and is contained in the class of problems that can be solved in polynomial time with an 𝖭𝖯 oracle. These negative results are in stark contrast with the positive results for the simulation and the complete simulation preorder, and indicate that augmenting the logic characterizing the simulation preorder with formulae that state that a process cannot perform a given action suffices to make satisfiability and primality checking computationally hard. In passing, we also prove that, in the presence of at least two actions, (1) for the logics characterizing the trace simulation and 2-nested simulation preorders, satisfiability and primality checking are 𝖭𝖯-complete and 𝖼𝗈𝖭𝖯-hard respectively, and deciding whether a formula is characteristic is 𝖴𝖲-hard, (2) for the logic that characterizes the trace simulation preorder, deciding whether a formula is characteristic is fixed-parameter tractable [18], with the modal depth of the input formula as the parameter, when the size of the action set is a constant, and (3) deciding whether a formula is characteristic in the modal logic for the 3-nested simulation preorder [24] is 𝖯𝖲𝖯𝖠𝖢𝖤-hard. (The proof of the last result relies on “simulating” Ladner’s reduction proving the 𝖯𝖲𝖯𝖠𝖢𝖤-hardness of satisfiability for modal logic [32] using the limited alternations of modal operators allowed by the logic for the 3-nested simulation preorder.)

We also study the complexity of computing characteristic formulae for finite, loop-free processes modulo the above-mentioned simulation semantics. To do so, we consider two different representations for formulae, namely an explicit form, where formulae are given by strings of symbols generated by their respective grammars, and a declarative form, where formulae are described by systems of equations. We prove that, even for the coarsest semantics we consider, such as the simulation and complete simulation preorders, computing the characteristic formula in explicit form for a finite, loop-free process cannot be done in polynomial time, unless 𝖯=𝖭𝖯. On the other hand, the characteristic formula for a process modulo the preorders we study, apart from the trace simulation preorder, can be computed in polynomial time if the output is given in declarative form. Intuitively, this is due to the fact that, unlike the explicit form, systems of equations allow for sharing of subformulae and there are formulae for which this sharing leads to an exponentially more concise representation. Finally, in sharp contrast to that result, we prove that, modulo the trace simulation preorder, even if characteristic formulae are always of polynomial declaration size and polynomial equational length, they cannot be efficiently computed unless 𝖯=𝖭𝖯. In passing, we remark that all the aforementioned lower and upper bounds hold also for finite processes with loops, provided that, as done in [6, 29, 39], we add greatest fixed points or systems of equations interpreted as greatest fixed points to the modal logics characterizing the semantics we study in this article.

We summarize our results in Table 2. We provide their proofs in the technical appendices of the full version of the paper [1].

2 Preliminaries

In this paper, we model processes as finite, loop-free labelled transition systems (LTS). A finite LTS is a triple 𝒮=(P,A,), where P is a finite set of states (or processes), A is a finite, non-empty set of actions and P×A×P is a transition relation. As usual, we use paq instead of (p,a,q). For each tA, we write ptq to mean that there is a sequence of transitions labelled with t starting from p and ending at q. An LTS is loop-free iff ptp holds only when t is the empty trace ε. A process q is reachable from p if ptq, for some tA. We define the size of an LTS 𝒮=(P,A,), denoted by |𝒮|, to be |P|+||. The size of a process pP, denoted by |p|, is the cardinality of reach(p)={q|q is reachable from p} plus the cardinality of the set restricted to reach(p). We define the set of initials of p, denoted I(p), as the set {aApap for some pP}. We write pa if aI(p), p↛a if aI(p), and p↛ if I(p)=. A sequence of actions tA is a trace of p if there is a q such that ptq. We denote the set of traces of p by traces(p). The depth of a finite, loop-free process p, denoted by 0pt(p), is the length of a longest trace t of p.

In what follows, we shall often describe finite, loop-free processes using the fragment of Milner’s CCS [37] given by the following grammar:

p::=0a.pp+p,

where aA. For each action a and terms p,p, we write pap iff

  1. (i)

    p=a.p or

  2. (ii)

    p=p1+p2, for some p1,p2, and p1ap or p2ap holds.

In this paper, we consider the following relations in van Glabbeek’s spectrum: simulation, complete simulation, ready simulation, trace simulation, 2-nested simulation, 3-nested simulation, and bisimilarity. Their definitions are given below.

Definition 1 ([37, 22, 3]).

We define each of the following preorders as the largest binary relation over P that satisfies the corresponding condition.

  1. (a)

    Simulation preorder (S): pSq for all pap there exists some qaq such that pSq.

  2. (b)

    Complete simulation (CS): pCSq

    1. (i)

      for all pap there exists some qaq such that pCSq, and

    2. (ii)

      I(p)= iff I(q)=.

  3. (c)

    Ready simulation (RS): pRSq

    1. (i)

      for all pap there exists some qaq such that pRSq, and

    2. (ii)

      I(p)=I(q).

  4. (d)

    Trace simulation (TS): pTSq

    1. (i)

      for all pap there exists some qaq such that pTSq, and

    2. (ii)

      traces(p)=traces(q).

  5. (e)

    n-Nested simulation (nS), where n1, is defined inductively as follows: The 1-nested simulation preorder 1S is S, and the n-nested simulation preorder nS for n>1 is the largest relation such that pnSq

    1. (i)

      for all pap there exists some qaq such that pnSq, and

    2. (ii)

      q(n1)Sp.

  6. (f)

    Bisimilarity (BS): BS is the largest symmetric relation satisfying the condition defining the simulation preorder.

It is well-known that bisimilarity is an equivalence relation and all the other relations are preorders [22, 37]. We sometimes write pq instead of pBS. Moreover, we have that 3S2STSRSCSS – see [22].

Definition 2 (Kernels of the preorders).

For each X{S,CS,RS,TS,2S,3S}, the kernel X of X is the equivalence relation defined thus: for every p,qP, pXq iff pXq and qXp.

Each relation X, where X{S,CS,RS,TS,2S,3S,BS}, is characterized by a fragment X of Hennessy-Milner logic, 𝐇𝐌𝐋, defined as follows [22, 3].

Definition 3.

For X{S,CS,RS,TS,2S,3S,BS}, X is defined by the corresponding grammar given below (aA):

  1. (a)

    S: φS::=𝐭𝐭𝐟𝐟φSφSφSφSaφS.

  2. (b)

    CS: φCS::=𝐭𝐭𝐟𝐟φCSφCSφCSφCSaφCS 0, where 𝟎=aA[a]𝐟𝐟.

  3. (c)

    RS: φRS::=𝐭𝐭𝐟𝐟φRSφRSφRSφRSaφRS[a]𝐟𝐟.

  4. (d)

    TS: φTS::=𝐭𝐭𝐟𝐟φTSφTSφTSφTSaφTSψTS, where ψTS::=𝐟𝐟[a]ψTS.

  5. (e)

    2S: φ2S::=𝐭𝐭𝐟𝐟φ2Sφ2Sφ2Sφ2Saφ2S¬φS.

  6. (f)

    3S: φ3S::=𝐭𝐭𝐟𝐟φ3Sφ3Sφ3Sφ3Saφ3S¬φ2S.

  7. (g)

    𝐇𝐌𝐋 (BS): φBS::=𝐭𝐭𝐟𝐟φBSφBSφBSφBSaφBS[a]φBS¬φBS.

Note that the explicit use of negation in the grammar for BS is unnecessary. However, we included the negation operator explicitly so that BS extends syntactically each of the other modal logics presented in Definition 3.

Given a formula φBS, the modal depth of φ, denoted by md(φ), is the maximum nesting of modal operators in φ. (See [1, Appendix A] for the formal definition.)

Truth in an LTS 𝒮=(P,A,) is defined via the satisfaction relation as follows:

p𝐭𝐭 and p⊧̸𝐟𝐟;
p¬φ iff p⊧̸φ;
pφψ iff both pφ and pψ;
pφψ iff pφ or pψ;
paφ iff there is some paq such that qφ;
p[a]φ iff for all paq it holds that qφ.

If pφ, we say that φ is true, or satisfied, in p. If φ is satisfied in every process in every LTS, we say that φ is valid. Formula φ1 entails φ2, denoted by φ1φ2, if every process that satisfies φ1 also satisfies φ2. Moreover, φ1 and φ2 are logically equivalent, denoted by φ1φ2, if φ1φ2 and φ2φ1. A formula φ is satisfiable if there is a process that satisfies φ. Finally, Sub(φ) denotes the set of subformulae of formula φ.

For BS, we define the dual fragment of to be ¯={φ¬φ}, where ¬𝐭𝐭=𝐟𝐟, ¬𝐟𝐟=𝐭𝐭, ¬(φψ)=¬φ¬ψ, ¬(φψ)=¬φ¬ψ, ¬[a]φ=a¬φ, ¬aφ=[a]¬φ, and ¬¬φ=φ. It is not hard to see that p¬φ iff p⊧̸φ, for every process p. Given a process p, we define (p)={φpφ}. A simplification of the Hennessy-Milner theorem gives a modal characterization of bisimilarity over finite processes. An analogous result is true for every preorder examined in this paper.

Theorem 4 (Hennessy-Milner theorem [25]).

For all processes p,q in a finite LTS, pq iff BS(p)=BS(q).

Proposition 5 ([22, 3]).

Let X{S,CS,RS,TS,2S,3S}. Then pXq iff X(p)X(q), for all p,qP.

 Remark 6.

Neither 𝐟𝐟 nor disjunction are needed in several of the modal characterizations presented in the above result. The reason for adding those constructs to all the logics is that doing so makes our subsequent results more general and uniform. For example, having 𝐟𝐟 and disjunction in all logics allows us to provide algorithms that determine whether a formula in a logic is prime with respect to a sublogic.

Definition 7 ([11, 4]).

Let BS. A formula φBS is prime in if for all φ1,φ2, φφ1φ2 implies φφ1 or φφ2.

When the logic is clear from the context, we say that φ is prime. Note that every unsatisfiable formula is trivially prime in , for every .

Example 8.

The formula a𝐭𝐭 is prime in S. Indeed, let φ1,φ2S and assume that a𝐭𝐭φ1φ2. Since a.0a𝐭𝐭, without loss of generality, we have that a.0φ1. We claim that a𝐭𝐭φ1. To see this, let p be some process such that pa𝐭𝐭 – that is, a process such that pap for some p. It is easy to see that a.0Sp. Since a.0φ1, Proposition 5 yields that pφ1, proving our claim and the primality of a𝐭𝐭. On the other hand, the formula a𝐭𝐭b𝐭𝐭 is not prime in S. Indeed, a𝐭𝐭b𝐭𝐭a𝐭𝐭b𝐭𝐭, but neither a𝐭𝐭b𝐭𝐭a𝐭𝐭 nor a𝐭𝐭b𝐭𝐭b𝐭𝐭 hold.

The definition of a characteristic formula within logic is given next.

Definition 9 ([5, 23, 39]).

Let BS. A formula φ is characteristic for pP within iff, for all qP, it holds that qφ(p)(q). We denote by χ(p) the unique characteristic formula for p with respect to logical equivalence.

 Remark 10.

Let X{S,CS,RS,TS,2S,3S,BS}. In light of Theorem 4 and Proposition 5, a formula φX is characteristic for p within X iff, for all qP, it holds that qφpXq. This property is often used as an alternative definition of characteristic formula for process p modulo X. In what follows, we shall employ the two definitions interchangeably.

In [3, Table 1 and Theorem 5], Aceto, Della Monica, Fabregas, and Ingólfsdóttir presented characteristic formulae for each of the semantics we consider in this paper, and showed that characteristic formulae are exactly the satisfiable and prime ones.

Proposition 11 ([3]).

For every X{S,CS,RS,TS,2S}, φX is characteristic for some process within X iff φ is satisfiable and prime in X.

 Remark 12.

Proposition 11 is the only result we use from [3] and we employ it as a “black box”. The (non-trivial) methods used in the proof of that result given in that reference do not play any role in our technical developments.

We note, in passing, that the article [3] does not deal explicitly with 3S. However, its results apply to all the n-nested simulation preorders.

We can also consider characteristic formulae modulo equivalence relations as follows.

Definition 13.

Let X{S,CS,RS,TS,2S,3S,BS}. A formula φX is characteristic for pP modulo X iff for all qP, it holds that qφX(p)=X(q).333The above definition can also be phrased as follows: A formula φX is characteristic for p modulo X iff, for all qP, it holds that qφpXq. This version of the definition is used, in the setting of bisimilarity, in references such as [2, 29].

When studying the complexity of finding a characteristic formula for some process p with respect to the behavioural relations we have introduced above, we will need some way of measuring the size of the resulting formula as a function of |p|. A formula in X, where X{S,CS,RS,TS,2S,3S,BS}, can be given in explicit form as in Definition 3 or by means of a system of equations. In the latter case, we say that the formula is given in declarative form. For example, formula ϕ=a(a𝐭𝐭b𝐭𝐭)b(a𝐭𝐭b𝐭𝐭) can be represented by the equations ϕ=aϕ1bϕ1 and ϕ1=a𝐭𝐭b𝐭𝐭. We define:

  • the size of formula φ, denoted by |φ|, to be the number of symbols that appear in the explicit form of φ,

  • the declaration size of formula φ, denoted by decl(φ), to be the number of equations that are used in the declarative form of φ, and

  • the equational length of formula φ, denoted by eqlen(φ), to be the maximum number of symbols that appear in an equation in the declarative form of φ.

For example, for the aforementioned formula ϕ, we have that |ϕ|=13, decl(ϕ)=2, and eqlen(ϕ)=5. Note that decl(φ)|Sub(φ)||φ|, for each φ.

3 The complexity of deciding characteristic formulae modulo preorders

In this section, we address the complexity of deciding whether formulae in S, CS, RS, TS, 2S, and 3S are characteristic. Since characteristic formulae in those logics are exactly the satisfiable and prime ones [3, Theorem 5], we study the complexity of checking satisfiability and primality separately in Subsections 3.1 and 3.2.

3.1 The complexity of satisfiability

To address the complexity of the satisfiability problem in S, CS, or RS, we associate a set I(φ)2A to every formula φRS. Intuitively, I(φ) describes all possible sets of initial actions that a process p can have, when pφ.

Definition 14.

Let φRS. We define I(φ) inductively as follows:

  1. (a)

    I(𝐭𝐭)=2A,

  2. (b)

    I(𝐟𝐟)=,

  3. (c)

    I([a]𝐟𝐟)={XXA and aX},

  4. (d)

    I(aφ)={,if I(φ)=,{XXA and aX},otherwise

  5. (e)

    I(φ1φ2)=I(φ1)I(φ2),

  6. (f)

    I(φ1φ2)=I(φ1)I(φ2).

Note that I(𝟎)={}.

Lemma 15.

For every φRS, the following statements hold:

  1. (a)

    for every SA, SI(φ) iff there is a process p such that I(p)=S and pφ.

  2. (b)

    φ is unsatisfiable iff I(φ)=.

When the number of actions is constant, I(φ) can be computed in linear time for every φRS. For CS, we need even less information; indeed, it is sufficient to define I(φ) so that it encodes whether φ is unsatisfiable, or is satisfied only in deadlocked states (that is, states with an empty set of initial actions), or is satisfied only in processes that are not deadlocked, or is satisfied both in some deadlocked and non-deadlocked states. This information can be computed in linear time for every φCS, regardless of the size of the action set.

Corollary 16.
  1. (a)

    Satisfiability of formulae in CS and S is decidable in linear time.

  2. (b)

    Let |A|=k, where k1 is a constant. Satisfiability of formulae in RS is decidable in linear time.

On the other hand, if we can use an unbounded number of actions, the duality of a and [a] can be employed to define a polynomial-time reduction from Sat, the satisfiability problem for propositional logic, to satisfiability in RS. Moreover, if we are allowed to nest [a] modalities (aA) and have at least two actions, we can encode n propositional literals using formulae of logn size and reduce Sat to satisfiability in TS in polynomial time. Finally, satisfiability in 2S is in 𝖭𝖯, which can be shown by an appropriate tableau construction.

Proposition 17.

Let either X=RS and |A| be unbounded or X{TS,2S} and |A|>1. Satisfiability of formulae in X is 𝖭𝖯-complete.

Deciding satisfiability of formulae in ¯2S when |A|>1, turns out to be 𝖯𝖲𝖯𝖠𝖢𝖤-complete. (A proof is provided in [1, Appendix B.6].) This means that satisfiability of 3S is also 𝖯𝖲𝖯𝖠𝖢𝖤-complete, since ¯2S3S.

Proposition 18.

Let |A|>1. Satisfiability of formulae in 3S is 𝖯𝖲𝖯𝖠𝖢𝖤-complete.

3.2 The complexity of primality

We now study the complexity of checking whether a formula is prime in the logics that characterize some of the relations in Definition 1.

Primality in 𝓛𝑺.

Unsatisfiable formulae are trivially prime. Note also that in the case that |A|=1, all satisfiable formulae in S are prime. To address the problem for any action set, for every satisfiable formula φS we can efficiently compute a logically equivalent formula φ given by the grammar φ::=𝐭𝐭|aφ|φφ|φφ. We examine the complexity of deciding primality of such formulae.

Table 1: Rules for the simulation preorder. If | is displayed in the conclusion of a rule, then the rule is called universal. Otherwise, it is called existential.
Proposition 19.

Let φS such that 𝐟𝐟Sub(φ). Deciding whether φ is prime is in 𝖯.

Proof.

We describe algorithm PrimeS that, on input φ, decides primality of φ. PrimeS constructs a rooted directed acyclic graph, denoted by Gφ, from the formula φ as follows. Every vertex of the graph is either of the form φ1,φ2ψ – where φ1, φ2 and ψ are sub-formulae of φ – , or True. The algorithm starts from vertex x=(φ,φφ) and applies some rule in Table 1 to x in top-down fashion to generate one or two new vertices that are given at the bottom of the rule. These vertices are the children of x and the vertex x is labelled with either or , depending on which one is displayed at the bottom of the applied rule. If x has only one child, PrimeS labels it with . The algorithm recursively continues this procedure on the children of x. If no rule can be applied on a vertex, then this vertex has no outgoing edges. For the sake of clarity and consistency, we assume that right rules, i.e. (R) and (R), are applied before the left ones, i.e. (Li) and (Li), i=1,2, by the algorithm. The graph generated in this way is an alternating graph, as defined by Immerman in [28, Definition 3.24] (see also [1, Appendix A]). In Gφ, the source vertex s is φ,φφ, and the target vertex t is True. Algorithm PrimeS solves the problem Reacha on input Gφ, where Reacha is Reachability on alternating graphs and is defined in [28, pp. 53–54]. It accepts φ iff Reacha accepts Gφ. Intuitively, the source vertex (φ,φφ) can reach the target vertex True in the alternating graph Gφ exactly when for each pair of disjuncts ψ1 and ψ2 in the disjunctive normal form of φ there is a disjunct ψ3 in the disjunctive normal form of φ that is entailed by both ψ1 and ψ2. It turns out that this is a necessary and sufficient condition for the primality of φ. For example, consider the formula a𝐭𝐭b𝐭𝐭. There is no disjunct of a𝐭𝐭b𝐭𝐭 that is entailed by both a𝐭𝐭 and b𝐭𝐭. This is because that formula is not prime, as we observed in Example 8. On the other hand, the formula a𝐭𝐭ab𝐭𝐭 is prime since each of its disjuncts entails a𝐭𝐭. The full technical details are included in [1, Appendix C.1]. Note that graph Gφ is of polynomial size and there is a linear-time algorithm solving Reacha [28].

Primality in 𝓛𝑪𝑺.

Note that, in the case of CS, the rules in Table 1 do not work any more because, unlike S, the logic CS can express some “negative information” about the behaviour of processes. For example, let A={a} and φ=a𝐭𝐭. Then, PrimeS accepts φ, even though φ is not prime in CS. Indeed, φaa𝐭𝐭a𝟎, but φ⊧̸aa𝐭𝐭 and φ⊧̸a𝟎. However, we can overcome this problem as described in the proof sketch of Proposition 20 below.

Proposition 20.

Let φCS be a formula such that every ψSub(φ) is satisfiable. Deciding whether φ is prime is in 𝖯.

Proof.

Consider the algorithm that first computes the formula φ by applying rule a𝐭𝐭𝐭𝐭, and rules 𝐭𝐭ψtt𝐭𝐭 and 𝐭𝐭ψttψ modulo commutativity on φ. It holds that φ is prime iff φ is prime and φφ. Next, the algorithm decides primality of φ by solving reachability on a graph constructed as in the case of simulation using the rules in Table 1, where rule (tt) is replaced by rule (0), whose premise is 𝟎,𝟎𝟎 and whose conclusion is True. To verify φφ, the algorithm computes a process p for which φ is characteristic within CS and checks whether pφ. In fact, the algorithm has also a preprocessing phase during which it applies a set of rules on φ and obtains an equivalent formula with several desirable properties. See [1, Appendix C.2] for full details.

Primality in 𝓛𝑹𝑺.

The presence of formulae of the form [a]𝐟𝐟 in RS means that a prime formula φRS has at least to describe which actions are necessary or forbidden for any process that satisfies φ. For example, let A={a,b}. Then, a𝟎 is not prime, since a𝟎(a𝟎[b]𝐟𝐟)(a𝟎b𝐭𝐭), and a𝟎 entails neither a𝟎[b]𝐟𝐟 nor a𝟎b𝐭𝐭. Intuitively, we call a formula φ saturated if φ describes exactly which actions label the outgoing edges of any process p such that pφ. Formally, φ is saturated iff I(φ) is a singleton.

If the action set is bounded by a constant, given φ, we can efficiently construct a formula φs such that (1) φs is saturated and for every aφSub(φs), φ is saturated, (2) φ is prime iff φs is prime and φsφ, and (3) primality of φs can be efficiently reduced to Reacha(Gφs).

Proposition 21.

Let |A|=k, where k1 is a constant, and φRS be such that if ψSub(φ) is unsatisfiable, then ψ=𝐟𝐟 and ψ occurs in the scope of some [a]. Deciding whether φ is prime is in 𝖯.

As the following result indicates, primality checking for formulae in RS becomes computationally hard when |A| is not a constant.

Proposition 22.

Let |A| be unbounded. Deciding primality of formulae in RS is 𝖼𝗈𝖭𝖯-complete.

Proof.

We give a polynomial-time reduction from Sat to deciding whether a formula in RS is not prime. Let φ be a propositional formula over x0,,xn1. We set φ=(φ¬xn)(xni=1n1¬xi) and φ′′ to be φ where xi is substituted with ai𝟎 and ¬xi with [ai]𝐟𝐟, where A={a0,,an}. As φ′′ is satisfied in an.0, it is a satisfiable formula, and so φ′′ is prime in RS iff φ′′ is characteristic within RS. We show that φ is satisfiable iff φ′′ is not characteristic within RS. Let φ be satisfiable and let s denote a satisfying assignment of φ. Consider p1,p2P such that:

  • p1ai𝟶 iff s(xi)=true, for 0in1, and p1↛an, and

  • p2an𝟶 and p2↛a for every aA{an}.

It holds that piφ′′, i=1,2, p1RSp2, and p2RSp1. Suppose that there is a process q, such that φ′′ is characteristic for q within RS. If qan, then qRSp1. On the other hand, if q↛an, then qRSp2. So, both cases lead to a contradiction, which means that φ′′ is not characteristic within RS. For the converse implication, assume that φ is unsatisfiable. This implies that there is no process satisfying the first disjunct of φ′′. Thus, φ′′ is characteristic for p2, described above, within RS.

Proving the matching upper bound is non-trivial. There is a 𝖼𝗈𝖭𝖯 algorithm that uses properties of prime formulae and rules of Table 1, carefully adjusted to the case of ready simulation. We describe the algorithm and prove its correctness in [1, Appendix C.3.2].

Primality in 𝓛𝑻𝑺.

If we have more than one action, a propositional literal can be encoded by using the restricted nesting of modal operators that is allowed by the grammar for TS. This observation is the crux of the proof of the following result.

Proposition 23.

Let |A|>1. Deciding primality of formulae in TS is 𝖼𝗈𝖭𝖯-hard.

Proof.

Let A={0,1}. The proof follows the steps of the proof of Proposition 22. The initial and basic idea is that given an instance φ of Sat over x1,,xn, every xi is substituted with [bi1¯]𝐟𝐟bi1([bi2¯]𝐟𝐟bi2(([bik¯]𝐟𝐟bik𝟎))) and ¬xi with [bi1][bi2][bik]𝐟𝐟, where bi1bik is the binary representation of i and b¯=0, if b=1, and b¯=1, if b=0. For more technical details, see [1, Appendix C.4.1].

In contrast to the case for RS, bounding the size of the action set is not sufficient for deciding primality of formulae in TS in polynomial time. However, we show that both satisfiability and primality become efficiently solvable if we bound both |A| and the modal depth of the input formula.

Proposition 24.

Let |A|=k and φTS with md(φ)=d, where k,d1 are constants. Then, there is an algorithm that decides whether φ is satisfiable and prime in linear time.

Proof.

It is necessary and sufficient to check that there is a process p with 0pt(p)d such that (1) pφ and (2) for every q with 0pt(q)d+1, if qφ then pTSq. Since k and d are considered to be constants, there is an algorithm that does so and requires linear time in |φ|. In particular, the algorithm runs in 𝒪(22kd+1kd+1|φ|).

To classify the problem of deciding whether formulae in TS are characteristic when |A| is bounded, let us briefly introduce fixed-parameter tractable problems – see, for instance, [19, 21] for textbook accounts of this topic. Let LΣ×Σ be a parameterized problem. We denote by Ly the associated fixed-parameter problem Ly={x|(x,y)L}, where y is the parameter. Then, L𝖥𝖯𝖳 (or L is fixed-parameter tractable) if there are a constant α and an algorithm to determine if (x,y) is in L in time f(|y|)|x|α, where f is a computable function [18].

Corollary 25.

Let |A|=k, where k1 is a constant. The problems of deciding whether formulae in TS are satisfiable, prime, and characteristic are in 𝖥𝖯𝖳, with the modal depth of the input formula as the parameter.

We note that the 𝖼𝗈𝖭𝖯-hardness argument from Proposition 23 applies also to logics that include TS. Since TS2S, the 𝖼𝗈𝖭𝖯-hardness of deciding primality of formulae in TS with |A|>1 implies the same lower bound for deciding primality of formulae in 2S when |A|>1. Next, we show that in 3S with |A|>1 the problem becomes 𝖯𝖲𝖯𝖠𝖢𝖤-hard.

Primality in 𝓛𝟑𝑺.

Let |A|>1. 𝖯𝖲𝖯𝖠𝖢𝖤-hardness of 3S-satisfiability implies 𝖯𝖲𝖯𝖠𝖢𝖤-hardness of ¯3S-validity. Along the lines of the proof of [2, Theorem 26], we prove the following result.

Proposition 26.

Let |A|>1. Deciding prime formulae within 3S is 𝖯𝖲𝖯𝖠𝖢𝖤-hard.

 Remark 27.

Note that primality within BS coincides with primality modulo . In [2], primality modulo is called completeness and it is shown to be decidable in 𝖯𝖲𝖯𝖠𝖢𝖤. However, the algorithm used in [2] does not immediately imply that primality within 3S is in 𝖯𝖲𝖯𝖠𝖢𝖤.

Interestingly, 𝖯𝖲𝖯𝖠𝖢𝖤-hardness of 2S-validity implies the following theorem.

Theorem 28.

Let X{CS,RS,TS,2S,3S} and |A|>1. The problem of deciding whether a formula in ¯2S is prime in X is 𝖯𝖲𝖯𝖠𝖢𝖤-hard.

Proof.

We reduce 2S-validity to this problem. Let φ2S. The reduction will return a formula φ, such that φ is 2S-valid if and only if φ is prime in X. If 𝟶⊧̸φ, then let φ=𝐭𝐭; in this case, φ is not valid and 𝐭𝐭 is not prime in X. Otherwise, let φ=𝟎¬φ. If φ is valid, then φ𝟎 and therefore φ is prime in X. On the other hand, if φ is not valid, then there is some process p¬φ. From 𝟶φ, it holds that pa. Then, φ𝟎aAa𝐭𝐭, but φ⊧̸𝟎 and φ⊧̸aAa𝐭𝐭. Since 𝟎aAa𝐭𝐭CS, φ is not prime in X, where X{CS,RS,TS,2S,3S}.

Theorem 28 shows that when deciding primality in X, if we allow the input to be in a logic that is more expressive than X, the computational complexity of the problem can increase. It is then reasonable to constrain the input of the problem to be in X in order to obtain tractable problems as in the case of S and CS.

Before we give our main result summarizing the complexity of deciding characteristic formulae, we introduce two classes that play an important role in pinpointing the complexity of deciding characteristic formulae within RS, TS, and 2S. The first class is 𝖣𝖯={L1L2L1𝖭𝖯 and L2𝖼𝗈𝖭𝖯} [38] and the second one is 𝖴𝖲 [9], which is defined thus: A language L𝖴𝖲 iff there is a non-deterministic Turing machine T such that, for every instance x of L, xL iff T has exactly one accepting path. The problem UniqueSat, viz. the problem of deciding whether a given Boolean formula has exactly one satisfying truth assignment, is 𝖴𝖲-complete and 𝖴𝖲𝖣𝖯 [9].

Theorem 29.
  1. (a)

    Deciding characteristic formulae within S, CS, or RS with a bounded action set is in 𝖯.

  2. (b)

    Deciding characteristic formulae within RS with an unbounded action set is 𝖴𝖲-hard and belongs to 𝖣𝖯.

  3. (c)

    Deciding characteristic formulae within TS or 2S is 𝖴𝖲-hard.

  4. (d)

    Deciding characteristic formulae within 3S is 𝖯𝖲𝖯𝖠𝖢𝖤-hard.

Table 2: The complexity of deciding satisfiability and primality, and of finding characteristic formulae for different logics. Findingdecl (resp. Findingexpl) denotes the problem of finding the characteristic formula for a given finite loop-free process, when the output is given in declarative (resp. explicit) form. Superscripts =k, >k, and >1 mean that the action set is bounded by a constant, unbounded, and has more than one action, respectively. 𝖥𝖯 is the class of functions computable in polynomial time. All the results shown in white cells have been proven in this paper, whereas results in light gray are from [2].
S CS RS=k RS>k TS>1 2S>1 3S>1 BS
Satisfiability 𝖯 𝖯 𝖯 𝖭𝖯-comp. 𝖭𝖯-comp. 𝖭𝖯-comp. 𝖯𝖲𝖯𝖠𝖢𝖤-comp. 𝖯𝖲𝖯𝖠𝖢𝖤-comp.
Primality 𝖯 𝖯 𝖯 𝖼𝗈𝖭𝖯-comp. 𝖼𝗈𝖭𝖯-hard 𝖼𝗈𝖭𝖯-hard 𝖯𝖲𝖯𝖠𝖢𝖤-hard 𝖯𝖲𝖯𝖠𝖢𝖤-comp.
Findingdecl 𝖥𝖯 𝖥𝖯 𝖥𝖯 𝖥𝖯 𝖭𝖯-hard 𝖥𝖯 𝖥𝖯 𝖥𝖯
Findingexpl 𝖭𝖯-hard

4 Finding characteristic formulae: The gap between trace simulation and the other preorders

Let X{S,CS} or X=RS and |A| is bounded by a constant. The complexity of finding characteristic formulae within X depends on the representation of the output. If the characteristic formula has to be given in explicit form, then the following result holds.

Proposition 30.

Let X{S,CS} or X=RS and |A| is bounded by a constant. If finding the characteristic formula within X for a given finite loop-free process can be done in polynomial time when the output is given in explicit form, then 𝖯=𝖭𝖯.

Proof.

If the assumption of the proposition is true, the results of this paper allow us to decide trace equivalence of two finite loop-free processes in polynomial time. (For details, the reader can see [1, Appendix E.1].) Since trace equivalence for such processes is 𝖼𝗈𝖭𝖯-complete [27, Theorem 2.7(1)], this implies that 𝖯=𝖭𝖯.

However, if output formulae are given in declarative form, then finding characteristic formulae within X, where X{nS,CS,RS,BS}, n1, can be done in polynomial time.

Proposition 31.

For every X{nS,CS,RS,BS}, where n1, there is a polynomial-time algorithm that, given a finite loop-free process p, outputs a formula in declarative form that is characteristic for p within X.

Proof.

The proof relies on inductive definitions of characteristic formulae within X, where X{S,CS,RS,2S,BS}, given in [29, 6], and within nS, n3, given in [1, Appendix E.1]. These definitions guarantee that there are polynomial-time recursive procedures which construct characteristic formulae within X. We prove the proposition for X=2S below.

Given a finite loop-free process p, the characteristic formula for p within 2S is defined as follows: χ2S(p)=χ¯S(p)aApapaχ2S(p), where χ¯S(p)=aA[a]papχ¯S(p). Consider the algorithm that recursively constructs χ2S(p). The algorithm has to construct χ2S(p) and χ¯S(p) for every preach(p), yielding a linear number of equations. Moreover, for every preach(p), χ¯S(p) is of linear size in |p|. If p=𝟶, then χ¯S(p)=aA[a]𝐟𝐟. Otherwise, |χ¯S(p)|=𝒪(|{p′′pap′′}|+|A|), where |A| is added because for every aA such that p↛a, [a]𝐟𝐟 is a conjunct of χ¯S(p). Note that for every p′′, if χ¯S(p′′) occurs in χ¯S(p), it is considered to add 1 to the size of χ¯S(p). Therefore, |χ¯S(p)| is of linear size in |p|. Using a similar argument, we can show that χ2S(p) is of linear size. Thus, the algorithm constructs a linear number of equations, each of which is of linear size in |p|. The proofs for X{nS,CS,RS,BS}, n2, are analogous.

 Remark 32.

Note that the recursive procedures given in [29, 6] and [1, Appendix E.1] provide characteristic formulae for finite processes with loops provided that we enrich the syntax of our logics by adding greatest fixed points. See, for example, [6]. Consequently, constructing characteristic formulae for finite processes within X, X{nS,CS,RS,BS}, n1, can be done in polynomial time.

We now present the complexity gap between finding characteristic formulae for preorders CS,RS,BS, and nS, n1, and the same search problem for preorder TS. In the former case, there are characteristic formulae with both declaration size and equational length that are polynomial in the size of the processes they characterize, and they can be efficiently computed. On the contrary, for TS, even if characteristic formulae are always of polynomial declaration size and polynomial equational length, they cannot be efficiently computed unless 𝖯=𝖭𝖯.

Proposition 33.

Assume that for every finite loop-free process p, there is a characteristic formula within TS for p, denoted by χ(p), such that both decl(χ(p)) and eqlen(χ(p)) are in 𝒪(|p|k) for some k. Given a finite loop-free process p, if χ(p) can be computed in polynomial time, then 𝖯=𝖭𝖯.

Next, we prove that we do not expect that a finite loop-free process p has always a short characteristic formula within TS when this is combined with a second condition. To show that statement, we need the following lemma.

Lemma 34.

For every finite p and q, traces(p)=traces(q) iff pTSp+q and qTSp+q.

Proof.

If traces(p)=traces(q), then pTSp+q. Indeed, for every pap, it holds that p+qap and, trivially, pTSp. Moreover, traces(p+q)=traces(p)traces(q)=traces(p). Symmetrically, qTSp+q. Conversely, if pTSp+q and qTSp+q, then traces(p+q)=traces(p)=traces(q), and we are done.

Proposition 35.

Assume that the following two conditions hold:

  1. 1.

    For every finite loop-free process p, there is a characteristic formula within TS for p, denoted by χ(p), such that both decl(χ(p)) and eqlen(χ(p)) are in 𝒪(|p|k) for some k.

  2. 2.

    Given a finite loop-free process p and a formula φ in declarative form, deciding whether φ is characteristic for p within TS is in 𝖭𝖯.

Then 𝖭𝖯=𝖼𝗈𝖭𝖯.

Proof.

We describe an 𝖭𝖯 algorithm 𝒜 that decides non-membership in Sat and makes use of conditions 1 and 2 of the proposition. Let ϕ be an input CNF formula to Sat. Algorithm 𝒜 computes the DNF formula ¬ϕ for which it needs to decide DNF-Tautology. Then, 𝒜 reduces DNF-Tautology to deciding trace equivalence of processes p0 and q constructed as described in the proof of [27, Theorem 2.7(1)]. 𝒜 can decide if traces(p0)=traces(q) by checking p0TSp0+q and qTSp0+q because of Lemma 34. Finally, 𝒜 reduces p0TSp0+q (resp. qTSp0+q) to model checking: it needs to check whether p0+qχ(p0) (resp. p0+qχ(q)). To this end, 𝒜 guesses two formulae φp0 and φq in declarative form of polynomial declaration size and equational length, and two witnesses that verify that φp0 and φq are characteristic within TS for p0 and q, respectively. This can be done due to conditions 1 and 2. 𝒜 rejects the input iff both p0+qχ(p0) and p0+qχ(q) are true.

5 A note on deciding characteristic formulae modulo equivalence relations

So far, we have studied the complexity of algorithmic problems related to characteristic formulae in the modal logics that characterize the simulation-based preorders in van Glabbeek’s spectrum. As shown in [3], those logics are powerful enough to describe characteristic formulae for each finite, loop-free process up to the preorder they characterize. It is therefore natural to wonder whether they can also express characteristic formulae modulo the kernels of those preorders. The following result indicates that the logics X, where X{S,CS,RS}, have very weak expressive power when it comes to defining characteristic formulae modulo X.

Proposition 36.

No formula in S is characteristic for some process p with respect to S. For X{CS,RS}, a formula φ is characteristic for some process p with respect to X iff it is logically equivalent to aA[a]𝐟𝐟.

Proof.

Assume, towards contradiction, that there is a formula φcS in S that is characteristic for some process p with respect to S. Let be the depth of p and aA. Define process q=p+a+1𝟶 – that is, q is a copy of p with an additional path that has exactly +1 a-transitions. It is easy to see that pSq, but q≴Sp. Since pφcS, it holds that qφcS. However, qSp, which contradicts our assumption that φcS is characteristic for p with respect to S. For X{CS,RS}, note that a formula φ is logically equivalent to aA[a]𝐟𝐟 iff it is satisfied only by processes without outgoing transitions, and so it is characteristic for any such process modulo X. To prove that no formula is characteristic for some process p with positive depth modulo CS or RS, a similar argument to the one for S can be used. For RS, the action a should be chosen such that pap for some p.

For TS and 2S, there are non-trivial characteristic formulae modulo TS and 2S, respectively. For example, if A={a,b}, the formula φa=a([a]𝐟𝐟[b]𝐟𝐟)[b]𝐟𝐟[a][a]𝐟𝐟[a][b]𝐟𝐟 is satisfied only by processes that are equivalent, modulo those equivalences, to process pa=a.0 that has a single transition labelled with a. Thus, φa is characteristic for pa modulo both TS and 2S. We can use the following theorem as a tool to prove hardness of deciding characteristic formulae modulo some equivalence relation. Theorem 37 below is an extension of [2, Theorem 26], so that it holds for every X such that a characteristic formula modulo X exists, namely X{CS,RS,TS,2S,3S,BS}.

Theorem 37.

Let X{CS,RS,TS,2S,3S,BS}. Validity in ¯X reduces in polynomial time to deciding characteristic formulae with respect to X.

Note that, from the results of Subsection 3.1, validity in ¯RS with an unbounded action set, ¯TS with |A|>1, and ¯2S with |A|>1 is 𝖼𝗈𝖭𝖯-complete, whereas validity in ¯3S with |A|>1 is 𝖯𝖲𝖯𝖠𝖢𝖤-complete. Consequently, from Theorem 37, deciding whether a formula is characteristic modulo RS with an unbounded action set, TS with |A|>1, and 2S with |A|>1 is 𝖼𝗈𝖭𝖯-hard. That problem is 𝖯𝖲𝖯𝖠𝖢𝖤-hard modulo 3S with |A|>1.

6 Conclusions

In this paper, we studied the complexity of determining whether a formula is characteristic for some finite, loop-free process in each of the logics providing modal characterizations of the simulation-based semantics in van Glabbeek’s branching-time spectrum [22]. Since, as shown in [3], characteristic formulae in each of those logics are exactly the satisfiable and prime ones, we gave complexity results for the satisfiability and primality problems, and investigated the boundary between logics for which those problems can be solved in polynomial time and those for which they become computationally hard. Our results show that computational hardness already manifests itself in ready simulation semantics [10, 34] when the size of the action set is not a constant. Indeed, in that setting, the mere addition of formulae of the form [a]𝐟𝐟 to the logic that characterizes the simulation preorder yields a logic whose satisfiability and primality problems are 𝖭𝖯-hard and 𝖼𝗈𝖭𝖯-hard respectively. Moreover, we show that deciding primality in the logic characterizing 3-nested simulation is 𝖯𝖲𝖯𝖠𝖢𝖤-hard in the presence of at least two actions.

Amongst others, we also studied the complexity of constructing characteristic formulae in each of the logics we consider, both when such formulae are presented in explicit form and in declarative form. In particular, one of our results identifies a sharp difference between trace simulation and the other semantics when it comes to constructing characteristic formulae. For all the semantics apart from trace simulation, there are characteristic formulae that have declaration size and equational length that are polynomial in the size of the processes they characterize and they can be efficiently computed. On the contrary, for trace simulation, even if characteristic formulae are always of polynomial declaration size and polynomial equational length, they cannot be efficiently computed, unless 𝖯=𝖭𝖯.

Our results are summarized in Table 2 and open several avenues for future research that we are currently pursuing. First of all, the precise complexity of primality checking is still open for the logics characterizing the n-nested simulation semantics. We conjecture that checking primality in 2S is 𝖼𝗈𝖭𝖯-complete and that 𝖯𝖲𝖯𝖠𝖢𝖤-completeness holds for n-nested simulation when n3. Next, we plan to study the complexity of deciding whether formulae are characteristic in the extensions of the modal logics we have considered in this article with greatest fixed points. Indeed, in those extended languages, one can define characteristic formulae for finite processes. It is known that deciding whether a formula is characteristic is 𝖯𝖲𝖯𝖠𝖢𝖤-complete for 𝐇𝐌𝐋, but becomes 𝖤𝖷𝖯-complete for its extension with fixed-point operators – see reference [2]. It would be interesting to see whether similar results hold for the other logics. Finally, building on the work presented in [3], we plan to study the complexity of the algorithmic questions considered in this article for (some of) the linear-time semantics in van Glabbeek’s spectrum.

References

  • [1] Luca Aceto, Antonis Achilleos, Aggeliki Chalki, and Anna Ingólfsdóttir. The complexity of deciding characteristic formulae in van glabbeek’s branching-time spectrum. CoRR, abs/2405.13697, 2024. doi:10.48550/arXiv.2405.13697.
  • [2] Luca Aceto, Antonis Achilleos, Adrian Francalanza, and Anna Ingólfsdóttir. The complexity of identifying characteristic formulae. J. Log. Algebraic Methods Program., 112:100529, 2020. doi:10.1016/j.jlamp.2020.100529.
  • [3] Luca Aceto, Dario Della Monica, Ignacio Fábregas, and Anna Ingólfsdóttir. When are prime formulae characteristic? Theor. Comput. Sci., 777:3–31, 2019. doi:10.1016/j.tcs.2018.12.004.
  • [4] Luca Aceto, Ignacio Fábregas, David de Frutos-Escrig, Anna Ingólfsdóttir, and Miguel Palomino. Graphical representation of covariant-contravariant modal formulae. In Bas Luttik and Frank Valencia, editors, Proceedings 18th International Workshop on Expressiveness in Concurrency, EXPRESS 2011, Aachen, Germany, 5th September 2011, volume 64 of EPTCS, pages 1–15, 2011. doi:10.4204/EPTCS.64.1.
  • [5] Luca Aceto, Anna Ingólfsdóttir, Kim Guldstrand Larsen, and Jiri Srba. Reactive Systems: Modelling, Specification and Verification. Cambridge University Press, USA, 2007.
  • [6] Luca Aceto, Anna Ingólfsdóttir, Paul Blain Levy, and Joshua Sack. Characteristic formulae for fixed-point semantics: a general framework. Math. Struct. Comput. Sci., 22(2):125–173, 2012. doi:10.1017/S0960129511000375.
  • [7] Antonis Achilleos. The completeness problem for modal logic. In Proc. of Logical Foundations of Computer Science - International Symposium, LFCS 2018, volume 10703 of Lecture Notes in Computer Science, pages 1–21. Springer, 2018. doi:10.1007/978-3-319-72056-2_1.
  • [8] Benjamin Bisping, David N. Jansen, and Uwe Nestmann. Deciding all behavioral equivalences at once: A game for linear-time-branching-time spectroscopy. Log. Methods Comput. Sci., 18(3), 2022. doi:10.46298/lmcs-18(3:19)2022.
  • [9] Andreas Blass and Yuri Gurevich. On the unique satisfiability problem. Inf. Control., 55(1-3):80–88, 1982. doi:10.1016/S0019-9958(82)90439-9.
  • [10] Bard Bloom, Sorin Istrail, and Albert R. Meyer. Bisimulation can’t be traced. J. ACM, 42(1):232–268, 1995. doi:10.1145/200836.200876.
  • [11] Gérard Boudol and Kim Guldstrand Larsen. Graphical versus logical specifications. Theor. Comput. Sci., 106(1):3–20, 1992. doi:10.1016/0304-3975(92)90276-L.
  • [12] Michael C. Browne, Edmund M. Clarke, and Orna Grumberg. Characterizing finite Kripke structures in propositional temporal logic. Theor. Comput. Sci., 59:115–131, 1988. doi:10.1016/0304-3975(88)90098-9.
  • [13] Jin-yi Cai, Martin Fürer, and Neil Immerman. An optimal lower bound on the number of variables for graph identification. Comb., 12(4):389–410, 1992. doi:10.1007/BF01305232.
  • [14] Rance Cleaveland. On automatically explaining bisimulation inequivalence. In Edmund M. Clarke and Robert P. Kurshan, editors, Computer Aided Verification, 2nd International Workshop, CAV ’90, volume 531 of Lecture Notes in Computer Science, pages 364–372. Springer, 1990. doi:10.1007/BFb0023750.
  • [15] Rance Cleaveland and Bernhard Steffen. Computing behavioural relations, logically. In Javier Leach Albert, Burkhard Monien, and Mario Rodríguez-Artalejo, editors, Automata, Languages and Programming, 18th International Colloquium, ICALP91, Madrid, Spain, July 8-12, 1991, Proceedings, volume 510 of Lecture Notes in Computer Science, pages 127–138. Springer, 1991. doi:10.1007/3-540-54233-7_129.
  • [16] David de Frutos-Escrig, Carlos Gregorio-Rodríguez, Miguel Palomino, and David Romero-Hernández. Unifying the linear time-branching time spectrum of process semantics. Log. Methods Comput. Sci., 9(2), 2013. doi:10.2168/LMCS-9(2:11)2013.
  • [17] Rocco De Nicola and Frits W. Vaandrager. Three logics for branching bisimulation. J. ACM, 42(2):458–487, 1995. doi:10.1145/201019.201032.
  • [18] Rodney G. Downey and Michael R. Fellows. Fixed-parameter tractability and completeness I: Basic results. SIAM J. Comput., 24(4):873–921, 1995. doi:10.1137/S0097539792228228.
  • [19] Rodney G. Downey and Michael R. Fellows. Fundamentals of Parameterized Complexity. Texts in Computer Science. Springer, 2013. doi:10.1007/978-1-4471-5559-1.
  • [20] Heinz-Dieter Ebbinghaus, Jörg Flum, and Wolfgang Thomas. Mathematical logic (2. ed.). Undergraduate texts in mathematics. Springer, 1994.
  • [21] Jörg Flum and Martin Grohe. Parameterized Complexity Theory. Texts in Theoretical Computer Science. An EATCS Series. Springer, 2006. doi:10.1007/3-540-29953-X.
  • [22] Rob J. van Glabbeek. The linear time - branching time spectrum I. In Jan A. Bergstra, Alban Ponse, and Scott A. Smolka, editors, Handbook of Process Algebra, pages 3–99. North-Holland / Elsevier, 2001. doi:10.1016/b978-044482830-9/50019-9.
  • [23] Susanne Graf and Joseph Sifakis. A modal characterization of observational congruence on finite terms of CCS. Inf. Control., 68(1-3):125–145, 1986. doi:10.1016/S0019-9958(86)80031-6.
  • [24] Jan Friso Groote and Frits W. Vaandrager. Structured operational semantics and bisimulation as a congruence. Inf. Comput., 100(2):202–260, 1992. doi:10.1016/0890-5401(92)90013-6.
  • [25] Matthew Hennessy and Robin Milner. Algebraic laws for nondeterminism and concurrency. J. ACM, 32(1):137–161, 1985. doi:10.1145/2455.2460.
  • [26] Sören Holmström. A refinement calculus for specifications in Hennessy-Milner logic with recursion. Formal Aspects Comput., 1(3):242–272, 1989. doi:10.1007/BF01887208.
  • [27] Harry B. Hunt III, Daniel J. Rosenkrantz, and Thomas G. Szymanski. On the equivalence, containment, and covering problems for the regular and context-free languages. J. Comput. Syst. Sci., 12(2):222–268, 1976. doi:10.1016/S0022-0000(76)80038-4.
  • [28] Neil Immerman. Descriptive Complexity. Springer, 1999. doi:10.1007/978-1-4612-0539-5.
  • [29] Anna Ingolfsdottir, Jens Christian Godskesen, and Michael Zeeberg. Fra Hennessy-Milner logik til CCS-processer. Master’s thesis, Aalborg University, 1987. In Danish.
  • [30] Sandra Kiefer, Pascal Schweitzer, and Erkal Selman. Graphs identified by logics with counting. In Giuseppe F. Italiano, Giovanni Pighizzini, and Donald Sannella, editors, Mathematical Foundations of Computer Science 2015 - 40th International Symposium, MFCS 2015, Milan, Italy, August 24-28, 2015, Proceedings, Part I, volume 9234 of Lecture Notes in Computer Science, pages 319–330. Springer, 2015. doi:10.1007/978-3-662-48057-1_25.
  • [31] Dexter Kozen. Results on the propositional μ-calculus. Theor. Comput. Sci., 27:333–354, 1983. doi:10.1016/0304-3975(82)90125-6.
  • [32] Richard E. Ladner. The computational complexity of provability in systems of modal propositional logic. SIAM J. Comput., 6(3):467–480, 1977. doi:10.1137/0206033.
  • [33] Kim Guldstrand Larsen. Proof systems for satisfiability in Hennessy-Milner logic with recursion. Theor. Comput. Sci., 72(2&3):265–288, 1990. doi:10.1016/0304-3975(90)90038-J.
  • [34] Kim Guldstrand Larsen and Arne Skou. Bisimulation through probabilistic testing. Inf. Comput., 94(1):1–28, 1991. doi:10.1016/0890-5401(91)90030-6.
  • [35] Jan Martens and Jan Friso Groote. Computing minimal distinguishing Hennessy-Milner formulas is NP-hard, but variants are tractable. In Guillermo A. Pérez and Jean-François Raskin, editors, 34th International Conference on Concurrency Theory, CONCUR 2023, volume 279 of LIPIcs, pages 32:1–32:17. Schloss Dagstuhl – Leibniz-Zentrum für Informatik, 2023. doi:10.4230/LIPIcs.CONCUR.2023.32.
  • [36] Robin Milner. An algebraic definition of simulation between programs. In D. C. Cooper, editor, Proceedings of the 2nd International Joint Conference on Artificial Intelligence, IJCAI 1971, pages 481–489. William Kaufmann, 1971. URL: http://ijcai.org/Proceedings/71/Papers/044.pdf.
  • [37] Robin Milner. Communication and Concurrency. Prentice Hall, 1989.
  • [38] Christos H. Papadimitriou and Mihalis Yannakakis. The complexity of facets (and some facets of complexity). J. Comput. Syst. Sci., 28(2):244–259, 1984. doi:10.1016/0022-0000(84)90068-0.
  • [39] Bernhard Steffen and Anna Ingólfsdóttir. Characteristic formulae for processes with divergence. Inf. Comput., 110(1):149–163, 1994. doi:10.1006/inco.1994.1028.
  • [40] Wolfgang Thomas. On the Ehrenfeucht-Fraïssé game in theoretical computer science. In Marie-Claude Gaudel and Jean-Pierre Jouannaud, editors, TAPSOFT’93: Theory and Practice of Software Development, International Joint Conference CAAP/FASE, volume 668 of Lecture Notes in Computer Science, pages 559–568. Springer, 1993. doi:10.1007/3-540-56610-4_89.