The Complexity of Deciding Characteristic Formulae in Van Glabbeek’s Branching-Time Spectrum
Abstract
Characteristic formulae give a complete logical description of the behaviour of processes modulo some chosen notion of behavioural semantics. They allow one to reduce equivalence or preorder checking to model checking, and are exactly the formulae in the modal logics characterizing classic behavioural equivalences and preorders for which model checking can be reduced to equivalence or preorder checking.
This paper studies the complexity of determining whether a formula is characteristic for some process in each of the logics providing modal characterizations of the simulation-based semantics in van Glabbeek’s branching-time spectrum. Since characteristic formulae in each of those logics are exactly the satisfiable and prime ones, this article presents complexity results for the satisfiability and primality problems, and investigates the boundary between modal logics for which those problems can be solved in polynomial time and those for which they become computationally hard.
Amongst other contributions, this article also studies the complexity of constructing characteristic formulae in the modal logics characterizing simulation-based semantics, both when such formulae are presented in explicit form and via systems of equations.
Keywords and phrases:
Characteristic formulae, prime formulae, bisimulation, simulation relations, modal logics, complexity theory, satisfiabilityCopyright and License:
![[Uncaptioned image]](x1.png)
2012 ACM Subject Classification:
Theory of computation Modal and temporal logics ; Theory of computation Complexity theory and logicAcknowledgements:
The authors thank the anonymous reviewers for comments that led to improvements in the paper. This paper is dedicated to the memory of Rance Cleaveland (1961–2024), who used characteristic formulae to compute behavioural relations, logically and efficiently.Funding:
This work has been funded by the projects “Open Problems in the Equational Logic of Processes (OPEL)” (grant no. 196050), “Mode(l)s of Verification and Monitorability” (MoVeMnt) (grant no. 217987), and “Learning and Applying Probabilistic Systems” (grant no. 206574-051) of the Icelandic Research Fund.Editors:
Jörg Endrullis and Sylvain SchmitzSeries and Publisher:

1 Introduction
Several notions of behavioural relations have been proposed in concurrency theory to describe when one process is a suitable implementation of another. Many such relations have been catalogued by van Glabbeek in his seminal linear-time/branching-time spectrum [22], together with a variety of alternative ways of describing them including testing scenarios and axiom systems. To our mind, modal characterizations of behavioural equivalences and preorders are some of the most classic and pleasing results in concurrency theory – see, for instance, [25] for the seminal Hennessy-Milner theorem and [12, 16, 17, 22] for similar results for other relations in van Glabbeek’s spectrum and other settings. By way of example, in their archetypal modal characterization of bisimilarity, Hennessy and Milner have shown in [25] that, under a mild finiteness condition, two processes are bisimilar if, and only if, they satisfy the same formulae in a multi-modal logic that is now often called Hennessy-Milner logic. Apart from its intrinsic theoretical interest, this seminal logical characterization of bisimilarity means that, when two processes are not bisimilar, there is always a formula that distinguishes between them. Such a formula describes a reason why the two processes are not bisimilar, provides useful debugging information and can be algorithmically constructed over finite processes – see, for instance, [8, 14] and [35], where Martens and Groote show that, in general, computing minimal distinguishing Hennessy-Milner formulae is -hard.
On the other hand, the Hennessy-Milner theorem seems to be less useful to show that two processes are bisimilar, since that would involve verifying that they satisfy the same formulae, and there are infinitely many of those. However, as shown in works such as [3, 6, 12, 23, 39], the logics that underlie classic modal characterization theorems for equivalences and preorders over processes allow one to express characteristic formulae. Intuitively, a characteristic formula for a process gives a complete logical characterization of the behaviour of modulo the behavioural semantics of interest , in the sense that any process is related to with respect to if, and only if, it satisfies .111Formulae akin to characteristic ones first occurred in the study of equivalence of structures using first-order formulae up to some quantifier rank. See, for example, the survey paper [40] and the textbook [20]. The existence of formulae in first-order logic with counting that characterize graphs up to isomorphism has significantly contributed to the study of the complexity of the Graph Isomorphism problem – see, for instance, [13, 30]. Since the formula can be constructed from , characteristic formulae reduce the problem of checking whether a process is related to by to a model checking problem, viz. whether satisfies . See, for instance, the classic reference [15] for applications of this approach.
Characteristic formulae, thus, allow one to reduce equivalence and preorder checking to model checking. But what model checking problems can be reduced to equivalence/preorder checking ones? To the best of our knowledge, that question was first studied by Boudol and Larsen in [11] in the setting of modal refinement over modal transition systems. See [3, 4] for other contributions in that line of research. The aforementioned articles showed that characteristic formulae coincide with those that are satisfiable and prime. (A formula is prime if whenever it entails a disjunction , then it must entail or .) Moreover, characteristic formulae with respect to bisimilarity coincide with the formulae that are satisfiable and complete [7]. (A modal formula is complete if, for each formula , it entails either or its negation.) The aforementioned results give semantic characterizations of the formulae that are characteristic within the logics that correspond to the behavioural semantics in van Glabbeek’s spectrum. Those characterizations tell us for what logical specifications model checking can be reduced to equivalence or preorder checking. However, given a specification expressed as a modal formula, can one decide whether that formula is characteristic and therefore can be model checked using algorithms for behavioural equivalences or preorders? And, if so, what is the complexity of checking whether a formula is characteristic? Perhaps surprisingly, those questions were not addressed in the literature until the recent papers [2, 7], where it is shown that, in the setting of the modal logics that characterize bisimilarity over natural classes of Kripke structures and labelled transition systems, the problem of checking whether a formula is characteristic for some process modulo bisimilarity is computationally hard and, typically, has the same complexity as validity checking, which is -complete for Hennessy-Milner logic and -complete for its extension with fixed-point operators [26, 33] and the -calculus [31].
The aforementioned hardness results for the logics characterizing bisimilarity tell us that deciding whether a formula is characteristic in bisimulation semantics is computationally hard. But what about the less expressive logics that characterize the coarser semantics in van Glabbeek’s spectrum? And for what logics characterizing relations in the spectrum does computational hardness manifest itself? Finally, what is the complexity of computing a characteristic formula for a process?
The aim of this paper is to answer the aforementioned questions for some of the simulation-based semantics in the spectrum. In particular, we study the complexity of determining whether a formula is characteristic modulo the simulation [36], complete simulation and ready simulation preorders [10, 34], as well as the trace simulation and the -nested simulation preorders [24]. Since characteristic formulae are exactly the satisfiable and prime ones for each behavioural relation in van Glabbeek’s spectrum [3], the above-mentioned tasks naturally break down into studying the complexity of satisfiability and primality checking for formulae in the fragments of Hennessy-Milner logic that characterize those preorders. By using a reduction to the, seemingly unrelated, reachability problem in alternating graphs, as defined by Immerman in [28, Definition 3.24], we discover that both those problems are decidable in polynomial time for the simulation and the complete simulation preorders, as well as for the ready simulation preorder when the set of actions has constant size. On the other hand, when the set of actions is unbounded (that is, it is an input of the algorithmic problem at hand), the problems of checking satisfiability and primality for formulae in the logic characterizing the ready simulation preorder are -complete and -complete respectively. We also show that deciding whether a formula is characteristic in that setting is -hard [9] (that is, it is at least as hard as the problem of deciding whether a given Boolean formula has exactly one satisfying truth assignment) and belongs to , which is the class of languages that are the intersection of one language in and of one in [38].222The class contains both and , and is contained in the class of problems that can be solved in polynomial time with an oracle. These negative results are in stark contrast with the positive results for the simulation and the complete simulation preorder, and indicate that augmenting the logic characterizing the simulation preorder with formulae that state that a process cannot perform a given action suffices to make satisfiability and primality checking computationally hard. In passing, we also prove that, in the presence of at least two actions, (1) for the logics characterizing the trace simulation and 2-nested simulation preorders, satisfiability and primality checking are -complete and -hard respectively, and deciding whether a formula is characteristic is -hard, (2) for the logic that characterizes the trace simulation preorder, deciding whether a formula is characteristic is fixed-parameter tractable [18], with the modal depth of the input formula as the parameter, when the size of the action set is a constant, and (3) deciding whether a formula is characteristic in the modal logic for the 3-nested simulation preorder [24] is -hard. (The proof of the last result relies on “simulating” Ladner’s reduction proving the -hardness of satisfiability for modal logic [32] using the limited alternations of modal operators allowed by the logic for the 3-nested simulation preorder.)
We also study the complexity of computing characteristic formulae for finite, loop-free processes modulo the above-mentioned simulation semantics. To do so, we consider two different representations for formulae, namely an explicit form, where formulae are given by strings of symbols generated by their respective grammars, and a declarative form, where formulae are described by systems of equations. We prove that, even for the coarsest semantics we consider, such as the simulation and complete simulation preorders, computing the characteristic formula in explicit form for a finite, loop-free process cannot be done in polynomial time, unless . On the other hand, the characteristic formula for a process modulo the preorders we study, apart from the trace simulation preorder, can be computed in polynomial time if the output is given in declarative form. Intuitively, this is due to the fact that, unlike the explicit form, systems of equations allow for sharing of subformulae and there are formulae for which this sharing leads to an exponentially more concise representation. Finally, in sharp contrast to that result, we prove that, modulo the trace simulation preorder, even if characteristic formulae are always of polynomial declaration size and polynomial equational length, they cannot be efficiently computed unless . In passing, we remark that all the aforementioned lower and upper bounds hold also for finite processes with loops, provided that, as done in [6, 29, 39], we add greatest fixed points or systems of equations interpreted as greatest fixed points to the modal logics characterizing the semantics we study in this article.
2 Preliminaries
In this paper, we model processes as finite, loop-free labelled transition systems (LTS). A finite LTS is a triple , where is a finite set of states (or processes), is a finite, non-empty set of actions and is a transition relation. As usual, we use instead of . For each , we write to mean that there is a sequence of transitions labelled with starting from and ending at . An LTS is loop-free iff holds only when is the empty trace . A process is reachable from if , for some . We define the size of an LTS , denoted by , to be . The size of a process , denoted by , is the cardinality of plus the cardinality of the set restricted to . We define the set of initials of , denoted , as the set . We write if , if , and if . A sequence of actions is a trace of if there is a such that . We denote the set of traces of by . The depth of a finite, loop-free process , denoted by , is the length of a longest trace of .
In what follows, we shall often describe finite, loop-free processes using the fragment of Milner’s CCS [37] given by the following grammar:
where . For each action and terms , we write iff
-
(i)
or
-
(ii)
, for some , and or holds.
In this paper, we consider the following relations in van Glabbeek’s spectrum: simulation, complete simulation, ready simulation, trace simulation, 2-nested simulation, 3-nested simulation, and bisimilarity. Their definitions are given below.
Definition 1 ([37, 22, 3]).
We define each of the following preorders as the largest binary relation over that satisfies the corresponding condition.
-
(a)
Simulation preorder (S): for all there exists some such that .
-
(b)
Complete simulation (CS):
-
(i)
for all there exists some such that , and
-
(ii)
iff .
-
(i)
-
(c)
Ready simulation (RS):
-
(i)
for all there exists some such that , and
-
(ii)
.
-
(i)
-
(d)
Trace simulation (TS):
-
(i)
for all there exists some such that , and
-
(ii)
.
-
(i)
-
(e)
-Nested simulation (S), where , is defined inductively as follows: The -nested simulation preorder is , and the -nested simulation preorder for is the largest relation such that
-
(i)
for all there exists some such that , and
-
(ii)
.
-
(i)
-
(f)
Bisimilarity (BS): is the largest symmetric relation satisfying the condition defining the simulation preorder.
It is well-known that bisimilarity is an equivalence relation and all the other relations are preorders [22, 37]. We sometimes write instead of . Moreover, we have that – see [22].
Definition 2 (Kernels of the preorders).
For each , the kernel of is the equivalence relation defined thus: for every , iff and .
Each relation , where , is characterized by a fragment of Hennessy-Milner logic, , defined as follows [22, 3].
Definition 3.
For , is defined by the corresponding grammar given below ():
-
(a)
:
-
(b)
: , where .
-
(c)
:
-
(d)
: , where .
-
(e)
:
-
(f)
:
-
(g)
():
Note that the explicit use of negation in the grammar for is unnecessary. However, we included the negation operator explicitly so that extends syntactically each of the other modal logics presented in Definition 3.
Given a formula , the modal depth of , denoted by , is the maximum nesting of modal operators in . (See [1, Appendix A] for the formal definition.)
Truth in an LTS is defined via the satisfaction relation as follows:
If , we say that is true, or satisfied, in . If is satisfied in every process in every LTS, we say that is valid. Formula entails , denoted by , if every process that satisfies also satisfies . Moreover, and are logically equivalent, denoted by , if and . A formula is satisfiable if there is a process that satisfies . Finally, denotes the set of subformulae of formula .
For , we define the dual fragment of to be , where , , , , , , and . It is not hard to see that iff , for every process . Given a process , we define . A simplification of the Hennessy-Milner theorem gives a modal characterization of bisimilarity over finite processes. An analogous result is true for every preorder examined in this paper.
Theorem 4 (Hennessy-Milner theorem [25]).
For all processes in a finite LTS, iff .
Remark 6.
Neither nor disjunction are needed in several of the modal characterizations presented in the above result. The reason for adding those constructs to all the logics is that doing so makes our subsequent results more general and uniform. For example, having and disjunction in all logics allows us to provide algorithms that determine whether a formula in a logic is prime with respect to a sublogic.
When the logic is clear from the context, we say that is prime. Note that every unsatisfiable formula is trivially prime in , for every .
Example 8.
The formula is prime in . Indeed, let and assume that . Since , without loss of generality, we have that . We claim that . To see this, let be some process such that – that is, a process such that for some . It is easy to see that . Since , Proposition 5 yields that , proving our claim and the primality of . On the other hand, the formula is not prime in . Indeed, , but neither nor hold.
The definition of a characteristic formula within logic is given next.
Definition 9 ([5, 23, 39]).
Let . A formula is characteristic for within iff, for all , it holds that . We denote by the unique characteristic formula for with respect to logical equivalence.
Remark 10.
In [3, Table 1 and Theorem 5], Aceto, Della Monica, Fabregas, and Ingólfsdóttir presented characteristic formulae for each of the semantics we consider in this paper, and showed that characteristic formulae are exactly the satisfiable and prime ones.
Proposition 11 ([3]).
For every , is characteristic for some process within iff is satisfiable and prime in .
Remark 12.
Proposition 11 is the only result we use from [3] and we employ it as a “black box”. The (non-trivial) methods used in the proof of that result given in that reference do not play any role in our technical developments.
We note, in passing, that the article [3] does not deal explicitly with . However, its results apply to all the -nested simulation preorders.
We can also consider characteristic formulae modulo equivalence relations as follows.
Definition 13.
Let . A formula is characteristic for modulo iff for all , it holds that .333The above definition can also be phrased as follows: A formula is characteristic for modulo iff, for all , it holds that . This version of the definition is used, in the setting of bisimilarity, in references such as [2, 29].
When studying the complexity of finding a characteristic formula for some process with respect to the behavioural relations we have introduced above, we will need some way of measuring the size of the resulting formula as a function of . A formula in , where , can be given in explicit form as in Definition 3 or by means of a system of equations. In the latter case, we say that the formula is given in declarative form. For example, formula can be represented by the equations and . We define:
-
the size of formula , denoted by , to be the number of symbols that appear in the explicit form of ,
-
the declaration size of formula , denoted by , to be the number of equations that are used in the declarative form of , and
-
the equational length of formula , denoted by , to be the maximum number of symbols that appear in an equation in the declarative form of .
For example, for the aforementioned formula , we have that , , and . Note that , for each .
3 The complexity of deciding characteristic formulae modulo preorders
In this section, we address the complexity of deciding whether formulae in , , , , , and are characteristic. Since characteristic formulae in those logics are exactly the satisfiable and prime ones [3, Theorem 5], we study the complexity of checking satisfiability and primality separately in Subsections 3.1 and 3.2.
3.1 The complexity of satisfiability
To address the complexity of the satisfiability problem in , , or , we associate a set to every formula . Intuitively, describes all possible sets of initial actions that a process can have, when .
Definition 14.
Let . We define inductively as follows:
-
(a)
,
-
(b)
,
-
(c)
,
-
(d)
-
(e)
,
-
(f)
.
Note that .
Lemma 15.
For every , the following statements hold:
-
(a)
for every , iff there is a process such that and .
-
(b)
is unsatisfiable iff .
When the number of actions is constant, can be computed in linear time for every . For , we need even less information; indeed, it is sufficient to define so that it encodes whether is unsatisfiable, or is satisfied only in deadlocked states (that is, states with an empty set of initial actions), or is satisfied only in processes that are not deadlocked, or is satisfied both in some deadlocked and non-deadlocked states. This information can be computed in linear time for every , regardless of the size of the action set.
Corollary 16.
-
(a)
Satisfiability of formulae in and is decidable in linear time.
-
(b)
Let , where is a constant. Satisfiability of formulae in is decidable in linear time.
On the other hand, if we can use an unbounded number of actions, the duality of and can be employed to define a polynomial-time reduction from Sat, the satisfiability problem for propositional logic, to satisfiability in . Moreover, if we are allowed to nest modalities () and have at least two actions, we can encode propositional literals using formulae of size and reduce Sat to satisfiability in in polynomial time. Finally, satisfiability in is in , which can be shown by an appropriate tableau construction.
Proposition 17.
Let either and be unbounded or and . Satisfiability of formulae in is -complete.
Deciding satisfiability of formulae in when , turns out to be -complete. (A proof is provided in [1, Appendix B.6].) This means that satisfiability of is also -complete, since .
Proposition 18.
Let . Satisfiability of formulae in is -complete.
3.2 The complexity of primality
We now study the complexity of checking whether a formula is prime in the logics that characterize some of the relations in Definition 1.
Primality in .
Unsatisfiable formulae are trivially prime. Note also that in the case that , all satisfiable formulae in are prime. To address the problem for any action set, for every satisfiable formula we can efficiently compute a logically equivalent formula given by the grammar . We examine the complexity of deciding primality of such formulae.
Proposition 19.
Let such that . Deciding whether is prime is in .
Proof.
We describe algorithm that, on input , decides primality of . constructs a rooted directed acyclic graph, denoted by , from the formula as follows. Every vertex of the graph is either of the form – where , and are sub-formulae of – , or True. The algorithm starts from vertex and applies some rule in Table 1 to in top-down fashion to generate one or two new vertices that are given at the bottom of the rule. These vertices are the children of and the vertex is labelled with either or , depending on which one is displayed at the bottom of the applied rule. If has only one child, labels it with . The algorithm recursively continues this procedure on the children of . If no rule can be applied on a vertex, then this vertex has no outgoing edges. For the sake of clarity and consistency, we assume that right rules, i.e. (R) and (R), are applied before the left ones, i.e. (L) and (L), , by the algorithm. The graph generated in this way is an alternating graph, as defined by Immerman in [28, Definition 3.24] (see also [1, Appendix A]). In , the source vertex is , and the target vertex is True. Algorithm solves the problem on input , where is Reachability on alternating graphs and is defined in [28, pp. 53–54]. It accepts iff accepts . Intuitively, the source vertex can reach the target vertex True in the alternating graph exactly when for each pair of disjuncts and in the disjunctive normal form of there is a disjunct in the disjunctive normal form of that is entailed by both and . It turns out that this is a necessary and sufficient condition for the primality of . For example, consider the formula . There is no disjunct of that is entailed by both and . This is because that formula is not prime, as we observed in Example 8. On the other hand, the formula is prime since each of its disjuncts entails . The full technical details are included in [1, Appendix C.1]. Note that graph is of polynomial size and there is a linear-time algorithm solving [28].
Primality in .
Note that, in the case of , the rules in Table 1 do not work any more because, unlike , the logic can express some “negative information” about the behaviour of processes. For example, let and . Then, accepts , even though is not prime in . Indeed, , but and . However, we can overcome this problem as described in the proof sketch of Proposition 20 below.
Proposition 20.
Let be a formula such that every is satisfiable. Deciding whether is prime is in .
Proof.
Consider the algorithm that first computes the formula by applying rule , and rules and modulo commutativity on . It holds that is prime iff is prime and . Next, the algorithm decides primality of by solving reachability on a graph constructed as in the case of simulation using the rules in Table 1, where rule (tt) is replaced by rule (0), whose premise is and whose conclusion is True. To verify , the algorithm computes a process for which is characteristic within and checks whether . In fact, the algorithm has also a preprocessing phase during which it applies a set of rules on and obtains an equivalent formula with several desirable properties. See [1, Appendix C.2] for full details.
Primality in .
The presence of formulae of the form in means that a prime formula has at least to describe which actions are necessary or forbidden for any process that satisfies . For example, let . Then, is not prime, since , and entails neither nor . Intuitively, we call a formula saturated if describes exactly which actions label the outgoing edges of any process such that . Formally, is saturated iff is a singleton.
If the action set is bounded by a constant, given , we can efficiently construct a formula such that (1) is saturated and for every , is saturated, (2) is prime iff is prime and , and (3) primality of can be efficiently reduced to .
Proposition 21.
Let , where is a constant, and be such that if is unsatisfiable, then and occurs in the scope of some . Deciding whether is prime is in .
As the following result indicates, primality checking for formulae in becomes computationally hard when is not a constant.
Proposition 22.
Let be unbounded. Deciding primality of formulae in is -complete.
Proof.
We give a polynomial-time reduction from Sat to deciding whether a formula in is not prime. Let be a propositional formula over . We set and to be where is substituted with and with , where . As is satisfied in , it is a satisfiable formula, and so is prime in iff is characteristic within . We show that is satisfiable iff is not characteristic within . Let be satisfiable and let denote a satisfying assignment of . Consider such that:
-
iff , for , and , and
-
and for every .
It holds that , , , and . Suppose that there is a process , such that is characteristic for within . If , then . On the other hand, if , then . So, both cases lead to a contradiction, which means that is not characteristic within . For the converse implication, assume that is unsatisfiable. This implies that there is no process satisfying the first disjunct of . Thus, is characteristic for , described above, within .
Primality in .
If we have more than one action, a propositional literal can be encoded by using the restricted nesting of modal operators that is allowed by the grammar for . This observation is the crux of the proof of the following result.
Proposition 23.
Let . Deciding primality of formulae in is -hard.
Proof.
Let . The proof follows the steps of the proof of Proposition 22. The initial and basic idea is that given an instance of Sat over , every is substituted with and with , where is the binary representation of and , if , and , if . For more technical details, see [1, Appendix C.4.1].
In contrast to the case for , bounding the size of the action set is not sufficient for deciding primality of formulae in in polynomial time. However, we show that both satisfiability and primality become efficiently solvable if we bound both and the modal depth of the input formula.
Proposition 24.
Let and with , where are constants. Then, there is an algorithm that decides whether is satisfiable and prime in linear time.
Proof.
It is necessary and sufficient to check that there is a process with such that (1) and (2) for every with , if then . Since and are considered to be constants, there is an algorithm that does so and requires linear time in . In particular, the algorithm runs in .
To classify the problem of deciding whether formulae in are characteristic when is bounded, let us briefly introduce fixed-parameter tractable problems – see, for instance, [19, 21] for textbook accounts of this topic. Let be a parameterized problem. We denote by the associated fixed-parameter problem , where is the parameter. Then, (or is fixed-parameter tractable) if there are a constant and an algorithm to determine if is in in time , where is a computable function [18].
Corollary 25.
Let , where is a constant. The problems of deciding whether formulae in are satisfiable, prime, and characteristic are in , with the modal depth of the input formula as the parameter.
We note that the -hardness argument from Proposition 23 applies also to logics that include . Since , the -hardness of deciding primality of formulae in with implies the same lower bound for deciding primality of formulae in when . Next, we show that in with the problem becomes -hard.
Primality in .
Let . -hardness of -satisfiability implies -hardness of -validity. Along the lines of the proof of [2, Theorem 26], we prove the following result.
Proposition 26.
Let . Deciding prime formulae within is -hard.
Remark 27.
Interestingly, -hardness of -validity implies the following theorem.
Theorem 28.
Let and . The problem of deciding whether a formula in is prime in is -hard.
Proof.
We reduce -validity to this problem. Let . The reduction will return a formula , such that is -valid if and only if is prime in . If , then let ; in this case, is not valid and is not prime in . Otherwise, let . If is valid, then and therefore is prime in . On the other hand, if is not valid, then there is some process . From , it holds that . Then, , but and . Since , is not prime in , where .
Theorem 28 shows that when deciding primality in , if we allow the input to be in a logic that is more expressive than , the computational complexity of the problem can increase. It is then reasonable to constrain the input of the problem to be in in order to obtain tractable problems as in the case of and .
Before we give our main result summarizing the complexity of deciding characteristic formulae, we introduce two classes that play an important role in pinpointing the complexity of deciding characteristic formulae within , , and . The first class is [38] and the second one is [9], which is defined thus: A language iff there is a non-deterministic Turing machine such that, for every instance of , iff has exactly one accepting path. The problem UniqueSat, viz. the problem of deciding whether a given Boolean formula has exactly one satisfying truth assignment, is -complete and [9].
Theorem 29.
-
(a)
Deciding characteristic formulae within , , or with a bounded action set is in .
-
(b)
Deciding characteristic formulae within with an unbounded action set is -hard and belongs to .
-
(c)
Deciding characteristic formulae within or is -hard.
-
(d)
Deciding characteristic formulae within is -hard.
Satisfiability | -comp. | -comp. | -comp. | -comp. | -comp. | |||
Primality | -comp. | -hard | -hard | -hard | -comp. | |||
-hard | ||||||||
-hard |
4 Finding characteristic formulae: The gap between trace simulation and the other preorders
Let or and is bounded by a constant. The complexity of finding characteristic formulae within depends on the representation of the output. If the characteristic formula has to be given in explicit form, then the following result holds.
Proposition 30.
Let or and is bounded by a constant. If finding the characteristic formula within for a given finite loop-free process can be done in polynomial time when the output is given in explicit form, then .
Proof.
If the assumption of the proposition is true, the results of this paper allow us to decide trace equivalence of two finite loop-free processes in polynomial time. (For details, the reader can see [1, Appendix E.1].) Since trace equivalence for such processes is -complete [27, Theorem 2.7(1)], this implies that .
However, if output formulae are given in declarative form, then finding characteristic formulae within , where , , can be done in polynomial time.
Proposition 31.
For every , where , there is a polynomial-time algorithm that, given a finite loop-free process , outputs a formula in declarative form that is characteristic for within .
Proof.
The proof relies on inductive definitions of characteristic formulae within , where , given in [29, 6], and within , , given in [1, Appendix E.1]. These definitions guarantee that there are polynomial-time recursive procedures which construct characteristic formulae within . We prove the proposition for below.
Given a finite loop-free process , the characteristic formula for within is defined as follows: Consider the algorithm that recursively constructs . The algorithm has to construct and for every , yielding a linear number of equations. Moreover, for every , is of linear size in . If , then . Otherwise, , where is added because for every such that , is a conjunct of . Note that for every , if occurs in , it is considered to add to the size of . Therefore, is of linear size in . Using a similar argument, we can show that is of linear size. Thus, the algorithm constructs a linear number of equations, each of which is of linear size in . The proofs for , , are analogous.
Remark 32.
Note that the recursive procedures given in [29, 6] and [1, Appendix E.1] provide characteristic formulae for finite processes with loops provided that we enrich the syntax of our logics by adding greatest fixed points. See, for example, [6]. Consequently, constructing characteristic formulae for finite processes within , , , can be done in polynomial time.
We now present the complexity gap between finding characteristic formulae for preorders and , , and the same search problem for preorder . In the former case, there are characteristic formulae with both declaration size and equational length that are polynomial in the size of the processes they characterize, and they can be efficiently computed. On the contrary, for , even if characteristic formulae are always of polynomial declaration size and polynomial equational length, they cannot be efficiently computed unless .
Proposition 33.
Assume that for every finite loop-free process , there is a characteristic formula within for , denoted by , such that both and are in for some . Given a finite loop-free process , if can be computed in polynomial time, then .
Next, we prove that we do not expect that a finite loop-free process has always a short characteristic formula within when this is combined with a second condition. To show that statement, we need the following lemma.
Lemma 34.
For every finite and , iff and .
Proof.
If , then . Indeed, for every , it holds that and, trivially, . Moreover, . Symmetrically, . Conversely, if and , then , and we are done.
Proposition 35.
Assume that the following two conditions hold:
-
1.
For every finite loop-free process , there is a characteristic formula within for , denoted by , such that both and are in for some .
-
2.
Given a finite loop-free process and a formula in declarative form, deciding whether is characteristic for within is in .
Then .
Proof.
We describe an algorithm that decides non-membership in Sat and makes use of conditions 1 and 2 of the proposition. Let be an input CNF formula to Sat. Algorithm computes the DNF formula for which it needs to decide DNF-Tautology. Then, reduces DNF-Tautology to deciding trace equivalence of processes and constructed as described in the proof of [27, Theorem 2.7(1)]. can decide if by checking and because of Lemma 34. Finally, reduces (resp. ) to model checking: it needs to check whether (resp. ). To this end, guesses two formulae and in declarative form of polynomial declaration size and equational length, and two witnesses that verify that and are characteristic within for and , respectively. This can be done due to conditions 1 and 2. rejects the input iff both and are true.
5 A note on deciding characteristic formulae modulo equivalence relations
So far, we have studied the complexity of algorithmic problems related to characteristic formulae in the modal logics that characterize the simulation-based preorders in van Glabbeek’s spectrum. As shown in [3], those logics are powerful enough to describe characteristic formulae for each finite, loop-free process up to the preorder they characterize. It is therefore natural to wonder whether they can also express characteristic formulae modulo the kernels of those preorders. The following result indicates that the logics , where , have very weak expressive power when it comes to defining characteristic formulae modulo .
Proposition 36.
No formula in is characteristic for some process with respect to . For , a formula is characteristic for some process with respect to iff it is logically equivalent to .
Proof.
Assume, towards contradiction, that there is a formula in that is characteristic for some process with respect to . Let be the depth of and . Define process – that is, is a copy of with an additional path that has exactly -transitions. It is easy to see that , but . Since , it holds that . However, , which contradicts our assumption that is characteristic for with respect to . For , note that a formula is logically equivalent to iff it is satisfied only by processes without outgoing transitions, and so it is characteristic for any such process modulo . To prove that no formula is characteristic for some process with positive depth modulo or , a similar argument to the one for can be used. For , the action should be chosen such that for some .
For and , there are non-trivial characteristic formulae modulo and , respectively. For example, if , the formula is satisfied only by processes that are equivalent, modulo those equivalences, to process that has a single transition labelled with . Thus, is characteristic for modulo both and . We can use the following theorem as a tool to prove hardness of deciding characteristic formulae modulo some equivalence relation. Theorem 37 below is an extension of [2, Theorem 26], so that it holds for every such that a characteristic formula modulo exists, namely .
Theorem 37.
Let . Validity in reduces in polynomial time to deciding characteristic formulae with respect to .
Note that, from the results of Subsection 3.1, validity in with an unbounded action set, with , and with is -complete, whereas validity in with is -complete. Consequently, from Theorem 37, deciding whether a formula is characteristic modulo with an unbounded action set, with , and with is -hard. That problem is -hard modulo with .
6 Conclusions
In this paper, we studied the complexity of determining whether a formula is characteristic for some finite, loop-free process in each of the logics providing modal characterizations of the simulation-based semantics in van Glabbeek’s branching-time spectrum [22]. Since, as shown in [3], characteristic formulae in each of those logics are exactly the satisfiable and prime ones, we gave complexity results for the satisfiability and primality problems, and investigated the boundary between logics for which those problems can be solved in polynomial time and those for which they become computationally hard. Our results show that computational hardness already manifests itself in ready simulation semantics [10, 34] when the size of the action set is not a constant. Indeed, in that setting, the mere addition of formulae of the form to the logic that characterizes the simulation preorder yields a logic whose satisfiability and primality problems are -hard and -hard respectively. Moreover, we show that deciding primality in the logic characterizing 3-nested simulation is -hard in the presence of at least two actions.
Amongst others, we also studied the complexity of constructing characteristic formulae in each of the logics we consider, both when such formulae are presented in explicit form and in declarative form. In particular, one of our results identifies a sharp difference between trace simulation and the other semantics when it comes to constructing characteristic formulae. For all the semantics apart from trace simulation, there are characteristic formulae that have declaration size and equational length that are polynomial in the size of the processes they characterize and they can be efficiently computed. On the contrary, for trace simulation, even if characteristic formulae are always of polynomial declaration size and polynomial equational length, they cannot be efficiently computed, unless .
Our results are summarized in Table 2 and open several avenues for future research that we are currently pursuing. First of all, the precise complexity of primality checking is still open for the logics characterizing the -nested simulation semantics. We conjecture that checking primality in is -complete and that -completeness holds for -nested simulation when . Next, we plan to study the complexity of deciding whether formulae are characteristic in the extensions of the modal logics we have considered in this article with greatest fixed points. Indeed, in those extended languages, one can define characteristic formulae for finite processes. It is known that deciding whether a formula is characteristic is -complete for , but becomes -complete for its extension with fixed-point operators – see reference [2]. It would be interesting to see whether similar results hold for the other logics. Finally, building on the work presented in [3], we plan to study the complexity of the algorithmic questions considered in this article for (some of) the linear-time semantics in van Glabbeek’s spectrum.
References
- [1] Luca Aceto, Antonis Achilleos, Aggeliki Chalki, and Anna Ingólfsdóttir. The complexity of deciding characteristic formulae in van glabbeek’s branching-time spectrum. CoRR, abs/2405.13697, 2024. doi:10.48550/arXiv.2405.13697.
- [2] Luca Aceto, Antonis Achilleos, Adrian Francalanza, and Anna Ingólfsdóttir. The complexity of identifying characteristic formulae. J. Log. Algebraic Methods Program., 112:100529, 2020. doi:10.1016/j.jlamp.2020.100529.
- [3] Luca Aceto, Dario Della Monica, Ignacio Fábregas, and Anna Ingólfsdóttir. When are prime formulae characteristic? Theor. Comput. Sci., 777:3–31, 2019. doi:10.1016/j.tcs.2018.12.004.
- [4] Luca Aceto, Ignacio Fábregas, David de Frutos-Escrig, Anna Ingólfsdóttir, and Miguel Palomino. Graphical representation of covariant-contravariant modal formulae. In Bas Luttik and Frank Valencia, editors, Proceedings 18th International Workshop on Expressiveness in Concurrency, EXPRESS 2011, Aachen, Germany, 5th September 2011, volume 64 of EPTCS, pages 1–15, 2011. doi:10.4204/EPTCS.64.1.
- [5] Luca Aceto, Anna Ingólfsdóttir, Kim Guldstrand Larsen, and Jiri Srba. Reactive Systems: Modelling, Specification and Verification. Cambridge University Press, USA, 2007.
- [6] Luca Aceto, Anna Ingólfsdóttir, Paul Blain Levy, and Joshua Sack. Characteristic formulae for fixed-point semantics: a general framework. Math. Struct. Comput. Sci., 22(2):125–173, 2012. doi:10.1017/S0960129511000375.
- [7] Antonis Achilleos. The completeness problem for modal logic. In Proc. of Logical Foundations of Computer Science - International Symposium, LFCS 2018, volume 10703 of Lecture Notes in Computer Science, pages 1–21. Springer, 2018. doi:10.1007/978-3-319-72056-2_1.
- [8] Benjamin Bisping, David N. Jansen, and Uwe Nestmann. Deciding all behavioral equivalences at once: A game for linear-time-branching-time spectroscopy. Log. Methods Comput. Sci., 18(3), 2022. doi:10.46298/lmcs-18(3:19)2022.
- [9] Andreas Blass and Yuri Gurevich. On the unique satisfiability problem. Inf. Control., 55(1-3):80–88, 1982. doi:10.1016/S0019-9958(82)90439-9.
- [10] Bard Bloom, Sorin Istrail, and Albert R. Meyer. Bisimulation can’t be traced. J. ACM, 42(1):232–268, 1995. doi:10.1145/200836.200876.
- [11] Gérard Boudol and Kim Guldstrand Larsen. Graphical versus logical specifications. Theor. Comput. Sci., 106(1):3–20, 1992. doi:10.1016/0304-3975(92)90276-L.
- [12] Michael C. Browne, Edmund M. Clarke, and Orna Grumberg. Characterizing finite Kripke structures in propositional temporal logic. Theor. Comput. Sci., 59:115–131, 1988. doi:10.1016/0304-3975(88)90098-9.
- [13] Jin-yi Cai, Martin Fürer, and Neil Immerman. An optimal lower bound on the number of variables for graph identification. Comb., 12(4):389–410, 1992. doi:10.1007/BF01305232.
- [14] Rance Cleaveland. On automatically explaining bisimulation inequivalence. In Edmund M. Clarke and Robert P. Kurshan, editors, Computer Aided Verification, 2nd International Workshop, CAV ’90, volume 531 of Lecture Notes in Computer Science, pages 364–372. Springer, 1990. doi:10.1007/BFb0023750.
- [15] Rance Cleaveland and Bernhard Steffen. Computing behavioural relations, logically. In Javier Leach Albert, Burkhard Monien, and Mario Rodríguez-Artalejo, editors, Automata, Languages and Programming, 18th International Colloquium, ICALP91, Madrid, Spain, July 8-12, 1991, Proceedings, volume 510 of Lecture Notes in Computer Science, pages 127–138. Springer, 1991. doi:10.1007/3-540-54233-7_129.
- [16] David de Frutos-Escrig, Carlos Gregorio-Rodríguez, Miguel Palomino, and David Romero-Hernández. Unifying the linear time-branching time spectrum of process semantics. Log. Methods Comput. Sci., 9(2), 2013. doi:10.2168/LMCS-9(2:11)2013.
- [17] Rocco De Nicola and Frits W. Vaandrager. Three logics for branching bisimulation. J. ACM, 42(2):458–487, 1995. doi:10.1145/201019.201032.
- [18] Rodney G. Downey and Michael R. Fellows. Fixed-parameter tractability and completeness I: Basic results. SIAM J. Comput., 24(4):873–921, 1995. doi:10.1137/S0097539792228228.
- [19] Rodney G. Downey and Michael R. Fellows. Fundamentals of Parameterized Complexity. Texts in Computer Science. Springer, 2013. doi:10.1007/978-1-4471-5559-1.
- [20] Heinz-Dieter Ebbinghaus, Jörg Flum, and Wolfgang Thomas. Mathematical logic (2. ed.). Undergraduate texts in mathematics. Springer, 1994.
- [21] Jörg Flum and Martin Grohe. Parameterized Complexity Theory. Texts in Theoretical Computer Science. An EATCS Series. Springer, 2006. doi:10.1007/3-540-29953-X.
- [22] Rob J. van Glabbeek. The linear time - branching time spectrum I. In Jan A. Bergstra, Alban Ponse, and Scott A. Smolka, editors, Handbook of Process Algebra, pages 3–99. North-Holland / Elsevier, 2001. doi:10.1016/b978-044482830-9/50019-9.
- [23] Susanne Graf and Joseph Sifakis. A modal characterization of observational congruence on finite terms of CCS. Inf. Control., 68(1-3):125–145, 1986. doi:10.1016/S0019-9958(86)80031-6.
- [24] Jan Friso Groote and Frits W. Vaandrager. Structured operational semantics and bisimulation as a congruence. Inf. Comput., 100(2):202–260, 1992. doi:10.1016/0890-5401(92)90013-6.
- [25] Matthew Hennessy and Robin Milner. Algebraic laws for nondeterminism and concurrency. J. ACM, 32(1):137–161, 1985. doi:10.1145/2455.2460.
- [26] Sören Holmström. A refinement calculus for specifications in Hennessy-Milner logic with recursion. Formal Aspects Comput., 1(3):242–272, 1989. doi:10.1007/BF01887208.
- [27] Harry B. Hunt III, Daniel J. Rosenkrantz, and Thomas G. Szymanski. On the equivalence, containment, and covering problems for the regular and context-free languages. J. Comput. Syst. Sci., 12(2):222–268, 1976. doi:10.1016/S0022-0000(76)80038-4.
- [28] Neil Immerman. Descriptive Complexity. Springer, 1999. doi:10.1007/978-1-4612-0539-5.
- [29] Anna Ingolfsdottir, Jens Christian Godskesen, and Michael Zeeberg. Fra Hennessy-Milner logik til CCS-processer. Master’s thesis, Aalborg University, 1987. In Danish.
- [30] Sandra Kiefer, Pascal Schweitzer, and Erkal Selman. Graphs identified by logics with counting. In Giuseppe F. Italiano, Giovanni Pighizzini, and Donald Sannella, editors, Mathematical Foundations of Computer Science 2015 - 40th International Symposium, MFCS 2015, Milan, Italy, August 24-28, 2015, Proceedings, Part I, volume 9234 of Lecture Notes in Computer Science, pages 319–330. Springer, 2015. doi:10.1007/978-3-662-48057-1_25.
- [31] Dexter Kozen. Results on the propositional -calculus. Theor. Comput. Sci., 27:333–354, 1983. doi:10.1016/0304-3975(82)90125-6.
- [32] Richard E. Ladner. The computational complexity of provability in systems of modal propositional logic. SIAM J. Comput., 6(3):467–480, 1977. doi:10.1137/0206033.
- [33] Kim Guldstrand Larsen. Proof systems for satisfiability in Hennessy-Milner logic with recursion. Theor. Comput. Sci., 72(2&3):265–288, 1990. doi:10.1016/0304-3975(90)90038-J.
- [34] Kim Guldstrand Larsen and Arne Skou. Bisimulation through probabilistic testing. Inf. Comput., 94(1):1–28, 1991. doi:10.1016/0890-5401(91)90030-6.
- [35] Jan Martens and Jan Friso Groote. Computing minimal distinguishing Hennessy-Milner formulas is NP-hard, but variants are tractable. In Guillermo A. Pérez and Jean-François Raskin, editors, 34th International Conference on Concurrency Theory, CONCUR 2023, volume 279 of LIPIcs, pages 32:1–32:17. Schloss Dagstuhl – Leibniz-Zentrum für Informatik, 2023. doi:10.4230/LIPIcs.CONCUR.2023.32.
- [36] Robin Milner. An algebraic definition of simulation between programs. In D. C. Cooper, editor, Proceedings of the 2nd International Joint Conference on Artificial Intelligence, IJCAI 1971, pages 481–489. William Kaufmann, 1971. URL: http://ijcai.org/Proceedings/71/Papers/044.pdf.
- [37] Robin Milner. Communication and Concurrency. Prentice Hall, 1989.
- [38] Christos H. Papadimitriou and Mihalis Yannakakis. The complexity of facets (and some facets of complexity). J. Comput. Syst. Sci., 28(2):244–259, 1984. doi:10.1016/0022-0000(84)90068-0.
- [39] Bernhard Steffen and Anna Ingólfsdóttir. Characteristic formulae for processes with divergence. Inf. Comput., 110(1):149–163, 1994. doi:10.1006/inco.1994.1028.
- [40] Wolfgang Thomas. On the Ehrenfeucht-Fraïssé game in theoretical computer science. In Marie-Claude Gaudel and Jean-Pierre Jouannaud, editors, TAPSOFT’93: Theory and Practice of Software Development, International Joint Conference CAAP/FASE, volume 668 of Lecture Notes in Computer Science, pages 559–568. Springer, 1993. doi:10.1007/3-540-56610-4_89.