Minimal DFAs Witnessing Language Inequivalence

Martens, Jan

doi:10.4230/LIPIcs.CSL.2026.44

Minimal DFAs Witnessing Language Inequivalence

Jan Martens

Leiden Institute of Advanced Computer Science, Leiden University, The Netherlands

Abstract

We study small witnesses for the inequivalence of two regular languages. A natural witness is a distinguishing word, e.g. a word in exactly one of the two languages. We propose using more succinct witnesses in the form of witnessing DFAs. A witnessing DFA recognizes a subset of one of the languages and contains at least one distinguishing word. In this way the DFA expresses behaviour contained in the first language but not the second. We show witnessing DFAs can be used to present more concise witnesses for the inequivalence of two regular languages. We show that the decision problem for the existence of a witnessing DFA of certain size is NP-complete in general, and in P in the special case of unary DFAs. Besides these computational aspects, we study structural properties of witnessing DFAs. Not all languages can be a minimal witness. It turns out that minimal witnesses are exactly the languages that are not decomposable in the union of languages with smaller state-complexity, the so-called prime languages as studied earlier by Kupferman and Mosheiff.

Keywords and phrases:

Deterministic Finite Automata, Language Inequivalence, DFA decomposition, Prime languages

Copyright and License:

2012 ACM Subject Classification:

Theory of computation

\rightarrow

Regular languages

Editors:

Stefano Guerrini and Barbara König

Series and Publisher:

Leibniz International Proceedings in Informatics, Schloss Dagstuhl – Leibniz-Zentrum für Informatik

1 Introduction

We are motivated by computing witnesses that explain difference in behaviour of two Deterministic Finite Automata (DFAs). In the formal verification of complex systems it can be key to compute witnesses and explanations for certain properties. Rather than a computer telling two DFAs are inequivalent, we would like to compute a witness that explains why the DFAs differ. We focus on the smallest witnesses since simple explanations are usually the best explanations.

A straightforward approach to explain the inequivalence of two DFAs would be to provide a distinguishing word, i.e. a word that is accepted by one of the automata but not the other. The method of finding minimal length distinguishing words is well understood and decidable in polynomial time [15]. An efficient implementation is given in [19] that has the same runtime complexity as the best known algorithm for DFA minimization known as Hopcroft’s minimization [8].

A natural question is if more powerful properties could result in smaller witnesses explaining the inequivalence of languages. Given two DFAs $A_{1}$ and $A_{2}$ , any property of the language recognized by $A_{1}$ that is not satisfied by the language of $A_{2}$ can be used to witness their inequivalence. We propose to use inclusion of a third language as property. In this way a DFA $A$ contains the property $A_{p}$ if the language of $A_{p}$ is included in the language recognized by $A$ . In this work we study DFAs that witness inequivalence of two regular languages. In particular, a minimal witnessing DFA can be much more succinct than a distinguishing word.

The state complexity, also called index, of a language $L$ is the number of states of the minimal DFA recognizing $L$ . This descriptive complexity measure is well studied in literature, and forms a natural notion of size of a regular language. In this way properties expressed by DFAs are smaller witnesses that provide shorter and more intuitive explanation.

For example, consider the DFAs $\mathcal{A}$ and $\mathcal{B}$ shown in Figure 1. The shortest distinguishing word for these DFAs is $a^{7}$ . Indeed, we confirm $a^{7}\in\mathcal{L}(\mathcal{A})$ but $a^{7}\not\in\mathcal{L}(\mathcal{B})$ . A different explanation for the inequivalence of $\mathcal{A}$ and $\mathcal{B}$ could be: every odd length sequence of $a$ ’s is accepted. This property is satisfied by $\mathcal{A}$ and is not true for $\mathcal{B}$ .

We call a DFA a witness for two DFAs if the language recognized is a subset of exactly one of the two DFAs. It is then said to distinguish the two languages. In the example from Figure 1, we see that the minimal witnessing DFA contains only two states, i.e. the DFA $A_{odd}$ such that $\mathcal{L}(A_{odd})=\{a^{2i+1}\mid i\in\mathbb{N}\}$ . An automaton recognizing only the minimal distinguishing word $a^{7}$ would contain at least eight states. Therefore, a witnessing DFA can be much more succinct than a distinguishing word.

In this paper we study witnessing DFAs. We show that there are families of DFAs for which there are witnessing DFAs of constant size, where the length of the minimal distinguishing word grows linearly in the size of the DFAs. Additionally, we prove that computing minimal witnessing DFAs is NP-complete. In the case of unary DFAs this problem is computable in polynomial time.

This last result is due to a connection of witnessing DFAs with so-called prime languages [12]. Prime languages are the languages for which there is no decomposition in languages of smaller DFAs. We show that a DFA is a minimal witness for inequivalence of regular languages if and only if it is prime.

Figure 1: The DFA

\mathcal{A}

on the left and the DFA

\mathcal{B}

on the right side.

The reduction from CNF-SAT that proves the NP-completeness of deciding witnessing DFAs encodes satisfying assignment in structure of concatenation instead of composition used in other hardness proofs [11]. To show that this construction is more widely useable we show how modify the construction to obtain a simpler proof of the coNP-hardness of deciding the minimal pumping constant.

Structure

This work is structured as follows. First in Section 2 we cover the notation we use and introduce the main concept of witnessing DFAs. In Section 3 we show that computing minimal witnesses is NP-complete. After that, in Section 4, we demonstrate that the proof of the previous section can be more widely used in DFA decision problems. In Section 5 we discuss the connection between minimal witnessing DFAs, language compositionality, and so-called prime languages. In Section 6 we relate our results to the problem of separating words [4]. Finally, in Section 7 we introduce two polynomial time algorithms. The first algorithm computes a witnessing DFA that is not necessarily minimal. The second algorithm computes a minimal DFA that witnesses the non-inclusion of two unary languages. We end with conclusions and related work.

2 Notation & Preliminaries

For two natural numbers $i,j\in\mathbb{N}$ we write $[i,j]=\{i,i+1,\dots,j\}$ as the closed interval from $i$ to $j$ . Given a finite alphabet $\Sigma$ , a finite sequence of elements of $\Sigma$ is called a word. We write $\varepsilon$ for the empty word and define $\Sigma^{i}$ as the set of all words over $\Sigma$ of length $i$ , and $\Sigma^{*}=\bigcup_{i\in\mathbb{N}}\Sigma^{i}$ for all words over $\Sigma$ . We write $\Sigma^{+}=\Sigma\cdot\Sigma^{*}$ for all non-empty words over $\Sigma$ . Given words $u,v\in\Sigma^{*}$ , we write $u\cdot v$ and $u v$ for word concatenation. Additionally, given a number $i\in\mathbb{N}$ and a word $u\in\Sigma^{*}$ we write $u^{i}$ for the concatenation of $i$ times the word $u$ .

Definition 1.

A Deterministic Finite Automaton (DFA) $A=(Q,\Sigma,\delta,q_{0},F)$ is a five-tuple consisting of:

$\blacksquare$

$Q$ a finite set of states,
$\blacksquare$

$\Sigma$ a finite set of symbols called the alphabet,
$\blacksquare$

$\delta:Q\times\Sigma\rightarrow Q$ the transition function,
$\blacksquare$

$q_{0}\in Q$ the initial state, and
$\blacksquare$

$F\subseteq Q$ the set of final states.

The transition function $\delta$ extends naturally to a transition function for words $\delta^{*}:Q\times\Sigma^{*}\rightarrow Q$ . This is done inductively as follows:

	$\displaystyle\delta^{*}(q,\varepsilon)$	$\displaystyle=q$
	$\displaystyle\delta^{*}(q,aw)$	$\displaystyle=\delta^{*}(\delta(q,a),w).$

The language recognized by a DFA $A=(Q,\Sigma,\delta,q_{0},F)$ is denoted by $\mathcal{L}(A)$ , and consists of all words $w\in\Sigma^{*}$ such that $\delta^{*}(q_{0},w)\in F$ . A language $L\subseteq\Sigma^{*}$ is called regular iff there is a DFA $A$ such that $\mathcal{L}(A)=L$ . We write $\bar{A}$ for the DFA $\bar{A}=(Q,\Sigma,\delta,q_{0},Q\setminus F)$ , which recognizes the complement of $A$ .

The Myhill-Nerode theorem is a useful tool to establish the number of states necessary to recognize a language. It is based on the equivalence relation between words that have the exact same accepting extensions.

Definition 2.

Let $x,y\in\Sigma^{*}$ be words and $L\subseteq\Sigma^{*}$ a language, then $x\equiv_{L}y$ if and only if for all $z\in\Sigma^{*}$ it holds that $xz\in L\iff yz\in L$ .

Theorem 3 (Myhill-Nerode [16, 17]).

Let $L\subseteq\Sigma^{*}$ be a language, then $L$ is regular if and only if the relation $\equiv_{L}$ has a finite number of equivalence classes.

A more specific corollary of the theorem relates the number of equivalence classes of $\equiv_{L}$ to the smallest number of states a DFA needs in order to recognize $L$ . We refer to this number as the index of a language and write it as $\mathit{ind}(L)$ . For a DFA $A$ we write $\mathit{ind}(A)$ for the index of the language accepted by $A$ .

Corollary 4.

Let $L$ be a regular language over an alphabet $\Sigma$ , then the smallest DFA $A$ that recognizes $L$ has $\mathit{ind}(L)=k$ states where $k$ is the number of equivalence classes of the relation $\equiv_{L}$ .

Now we introduce the notion of witnessing DFA for the inequivalence of languages.

Definition 5.

Given DFAs $A_{1},A_{2}$ such that $\mathcal{L}(A_{1})\neq\mathcal{L}(A_{2})$ , a DFA $B$ is called to witness the language inequivalence iff:

\mathcal{L}(B)\subseteq\mathcal{L}(A_{1})\iff\mathcal{L}(B)\not\subseteq% \mathcal{L}(A_{2}).

A minimal witness for the inequivalence of $A_{1}$ and $A_{2}$ is a witness that is minimal (in size) amongst all witnesses. Of course, for DFAs $A_{1}$ and $A_{2}$ , such that $\mathcal{L}(A_{1})\not\subseteq\mathcal{L}(A_{2})$ the DFA $A_{1}$ itself is already a witness DFA. We are interested in small witnesses for language inequivalence, since in general small DFAs express the most intuive properties.

The notion of minimal witnessing DFAs naturally leads to the following decision problem.

$k$ -DFA-DIST:

Let $A_{1}$ and $A_{2}$ be DFAs such that $\mathcal{L}(A_{1})\neq\mathcal{L}(A_{2})$ , and $k\in\mathbb{N}$ a number. Decide if there is a DFA $A_{\textit{dist}}$ with at most $k$ states such that:

\displaystyle\mathcal{L}(A_{\textit{dist}})\subseteq\mathcal{L}(A_{1})\iff% \mathcal{L}(A_{\textit{dist}})\not\subseteq\mathcal{L}(A_{2}).

3 Minimal witnesses are NP-complete

First, we observe that DFA witnesses for language inequivalence can be arbitrarily smaller than distinguishing words. We show this by generalizing the construction from the example in the introduction (Figure 1).

Example 6.

For a number $n\in\mathbb{N}\setminus\{0\}$ , let $\mathcal{A}_{n}$ , $\mathcal{B}_{n}$ be DFAs defined such that:

	$\displaystyle\mathcal{L}(\mathcal{A}_{n})$	$\displaystyle=\{a^{c}\mid i\in\mathbb{N}\text{ and }c\neq i\cdot 2^{n}\}$
	$\displaystyle\mathcal{L}(\mathcal{B}_{n})$	$\displaystyle=\{a\}\cup\{a^{c}\mid i\in\mathbb{N},i\neq 0\text{ and }c\neq 1+i% \cdot(2^{n}-1)\}.$

The minimal automata recognizing $\mathcal{L}(\mathcal{A}_{n})$ (resp. $\mathcal{L}(\mathcal{B}_{n}))$ have $2^{n}$ (resp. $2^{n}+1$ ) states. The smallest word accepted by $\mathcal{A}_{n}$ and not by $\mathcal{B}_{n}$ has length $2^{n+1}-1$ . In this case the DFA $A_{odd}$ accepting all odd length sequences of $a$ also acts as a witnessing DFA. Hence, a minimal witnessing DFA is $A_{odd}$ , i.e. the DFA such that $\mathcal{L}(A_{odd})=\{a^{2i+1}\mid i\in\mathbb{N}\}$ . The DFA $A_{odd}$ contains only $2$ states, and thus the minimal witnessing DFA is asymptotically smaller than the minimal witnessing word.

The shape of the example automata $\mathcal{A}_{3}$ and $\mathcal{B}_{3}$ are given in Figure 2.

Figure 2: Example automata

\mathcal{A}_{3}

(left) and

\mathcal{B}_{3}

(right). The minimal witnessing word is

a^{15}

whereas the minimal witnessing DFA contains

2

states.

Next, we show the NP-completeness of computing minimal witnessing DFAs for language inequivalence. We prove $k$ -DFA-DIST is NP-hard by a reduction from CNF-SAT.

Theorem 7.

Deciding $k$ -DFA-DIST is NP-complete.

Before we introduce the reduction we define some notation in which we encode truth values of propositions. In the reduction we represent truth assignments as words over the Boolean alphabet $\mathbb{B}=\{\mathtt{0},\mathtt{1}\}$ . Given a set of propositional variables $\mathit{Prop}=\{p_{1},\dots,p_{k}\}$ , a truth assignment $\rho:\mathit{Prop}\to\mathbb{B}$ is represented by the word $a_{1}\dots a_{k}\in\mathbb{B}^{k}$ , where $a_{i}=\rho(p_{i})$ for every $i\in[1,k]$ . The set $\mathcal{X}=\mathbb{B}^{k}$ defines all words that represent truth assignments.

Now we are ready to introduce our reduction from CNF-SAT in order to prove Theorem 7. Let $\phi=C_{1}\wedge\dots\wedge C_{n}$ be a CNF formula over the propositional variables $\mathit{Prop}=\{p_{1},\dots,p_{k}\}$ , we define two regular languages over the alphabet $\Sigma=\mathbb{B}\cup\{\sharp\}$ . The first language $L^{-}_{\phi}\subseteq\Sigma^{*}$ is the finite set of at most $n$ concatenated truth assignments separated by a $\sharp$ , i.e.

L^{-}_{\phi}=\{w_{1}\sharp\dots w_{j}\sharp\mid j\in[1,n]\text{ and }w_{1},% \dots,w_{j}\in\mathcal{X}\}.

The second language $L^{+}_{\phi}\subseteq\Sigma^{*}$ is a superset of $L^{-}_{\phi}$ . In addition to all the words of $L^{-}_{\phi}$ , the language $L^{+}_{\phi}$ contains words of more than $n$ concatenated truth assignments. Specifically, $L^{+}_{\phi}$ includes all words consisting of $n$ or more concatenated truth assignments of which the first consecutively satisfy the clauses $C_{1},\dots,C_{n}$ . More precisely,

\displaystyle L^{+}_{\phi}=L^{-}_{\phi}\cup\{w_{1}\sharp\cdots w_{j}\sharp\mid j% \geq n,w_{1},\dots w_{j}\in\mathcal{X}\text{ and for all }i\in[1,n]:w_{i}\text% { satisfies }C_{i}\}.

We sketch the shape of the languages with an example using an unsatisfiable formula.

Example 8.

Consider the CNF-formula $\mathcal{C}=p_{1}\wedge\neg p_{1}$ over the propositional letters $\mathit{Prop}=\{p_{1}\}$ . The language $L_{\mathcal{C}}^{-}$ is the language containing the words:

$\blacksquare$

$b\sharp$ for $b\in\{\mathtt{0},\mathtt{1}\}$ , and
$\blacksquare$

$b\sharp b^{\prime}\sharp$ for $b,b^{\prime}\in\{\mathtt{0},\mathtt{1}\}$ .

The language $L_{\mathcal{C}}^{+}$ contains:

$\blacksquare$

$w$ if $w\in L_{\mathcal{C}}^{-}$ , and
$\blacksquare$

$\mathtt{1}\sharp\mathtt{0}\sharp w$ for every $w\in(\mathcal{X}\sharp)^{+}$ .

Since $\mathtt{1}$ is the only truth-assignment for $p_{1}$ that satisfies $C_{1}=p_{1}$ and only $\mathtt{0}$ satisfies $C_{2}=\neg p_{2}$ . The only words that witness the inequivalence $L_{\mathcal{C}}^{+}\neq L_{\mathcal{C}}^{-}$ are of the shape $\mathtt{1}\sharp\mathtt{0}\sharp w$ , for $w\in(\mathcal{X}\sharp)^{+}$ .

The languages $L^{-}_{\phi}$ and $L^{+}_{\phi}$ are regular, and hence there are automata that recognize these languages. In particular, there are automata recognizing these languages that are polynomial in size. One way of observing this fact is by inspecting the number of Myhill–Nerode equivalence classes of $L^{+}_{\phi}$ and $L^{-}_{\phi}$ .

Lemma 9.

Given a CNF formula $\phi$ , the languages $L_{\phi}^{+}$ and $L_{\phi}^{-}$ are recognizable by an automaton that is polynomial in the size of $\phi$ .

The next lemma proves the key fact of our reduction. Given a CNF formula $\phi$ that is satisfiable on the propositional letters $\mathit{Prop}=\{p_{1},\dots,p_{k}\}$ , the language $\{(w_{\rho}\sharp)^{i}\mid i\in\mathbb{N}\}$ formed by a satisfying truth assignment $\rho$ represented as the word $w_{\rho}\in\{\mathtt{0},\mathtt{1}\}^{k}$ iterated arbitrary often is a small distinguishing automaton for the languages $L_{\phi}^{+}$ and $L_{\phi}^{-}$ . Conversely, a distinguishing automaton smaller than a certain size necessarily implies the existence of a satisfying truth assignment for $\phi$ .

Lemma 10.

Let $\phi=C_{1}\wedge\dots\wedge C_{n}$ be a CNF formula over $k$ propositional letters $\mathit{Prop}=\{p_{1},\dots,p_{k}\}$ . Then $\phi$ is satisfiable if and only if there is a DFA $A_{\textit{dist}}$ with at most $k+2$ states such that $\mathcal{L}(A_{\textit{dist}})\subseteq L^{+}_{\phi}$ and $\mathcal{L}(A_{\textit{dist}})\not\subseteq L^{-}_{\phi}$ .

Proof.

We prove both directions of the implication separately.

( $\Rightarrow$ )

Assume $\phi$ is satisfiable, then there is a satisfying truth assignment $\rho$ that is mapped to the word $w_{\rho}=\rho(p_{1})\dots\rho(p_{k})\in\mathcal{X}$ . We define the language $L_{\textit{dist}}=\{(w_{\rho}\cdot\sharp)^{i}\mid i\in\mathbb{N}\}$ , and show that $L_{\textit{dist}}$ witnesses this implication.

First we show that $L_{\textit{dist}}\subseteq L^{+}_{\phi}$ . Assume $i\in\mathbb{N}$ , if $i\leqslant n$ then by definition $(w_{\rho}\cdot\sharp)^{i}\in L_{\phi}^{-}$ and hence also $(w_{\rho}\cdot\sharp)^{i}\in L_{\phi}^{+}$ . If $i>n$ , since $\rho$ is a satisfying assignment, it holds for any $w^{\prime}\in\{w\sharp\mid w\in\mathcal{X}\}^{*}$ that $(w_{\rho}\cdot\sharp)^{n}w^{\prime}\in L^{+}_{\phi}$ , and thus also $(w_{\rho}\cdot\sharp)^{n}(w_{\rho}\cdot\sharp)^{i-n}\in L^{+}_{\phi}$ . By covering both cases this means $L_{\textit{dist}}\subseteq L^{+}_{\phi}$ .

Next, we observe that $(w_{\rho}\cdot\sharp)^{n+1}\not\in L^{-}_{\phi}$ , and thus $L_{\textit{dist}}\not\subseteq L^{-}_{\phi}$ . Hence, since $L_{\textit{dist}}\subseteq L^{+}_{\phi}$ any DFA that recognizes $L_{\textit{dist}}$ is a distinguishing automaton.

The minimal DFA $A_{\textit{dist}}$ such that $\mathcal{L}(A_{\textit{dist}})=L_{\textit{dist}}$ contains one loop with $k+1$ states containing all positions of the word $w_{\rho}\cdot\sharp$ and a sink state to reject all other words. Thus, if $\phi$ is satisfiable we can construct $A_{\textit{dist}}$ with $k+2$ states that distinguishes $L^{+}_{\phi}$ and $L^{-}_{\phi}$ , which was to be showed.

( $\Leftarrow$ )

We assume $A_{\textit{dist}}$ is a DFA with at most $k+2$ states such that for the language accepted $\hat{L}=\mathcal{L}(A_{\textit{dist}})$ it holds that $\hat{L}\subseteq L_{\phi}^{+}$ and $\hat{L}\not\subseteq L_{\phi}^{-}$ . We show that this means $\phi$ is satisfiable.

Since $\hat{L}\setminus L_{\phi}^{-}\neq\emptyset$ and $L_{\phi}^{-}\subseteq L_{\phi}^{+}$ there is a word $w\in L^{+}_{\phi}\setminus L^{-}_{\phi}$ accepted by $A_{\textit{dist}}$ . By definition $w$ is of shape $w=w_{1}\sharp\dots w_{n}\sharp w^{\prime}$ where $w^{\prime}\in\Sigma^{+}$ and $w_{1},\dots,w_{n}\in\mathcal{X}$ and for every $i\in[1,n]$ the word $w_{i}$ represents a satisfying truth assignment for the clause $C_{i}$ . Next we show that $w_{1}$ represents a satisfying truth assignment for $\phi$ by counting the number of equivalence classes of $\equiv_{\hat{L}}$ for the prefixes of $w_{1}\cdot\sharp$ , together with the postfix $w_{post}=w_{2}\sharp\dots w_{n}\sharp w^{\prime}$ that witnesses an accepting postfix for $w_{1}\sharp$ .

We define the set $U$ as the set containing all prefixes of $w_{1}=a_{1}\dots a_{k}$ , i.e.

U=\{\varepsilon\}\cup\{a_{1}\dots a_{j}\mid j\in[1,k]\}.

If $v,u\in U$ and $v\neq u$ then $v\not\equiv_{\hat{L}}u$ , since there is a $\sigma\in\Sigma^{*}$ such that $v\sigma=w$ and $w\in\hat{L}$ and $u\sigma\not\in\hat{L}$ . This means there are $|U|=k+1$ distinct classes of ${\equiv_{\hat{L}}}$ . Lastly, since $\sharp z\not\in\hat{L}$ for any $z\in\Sigma^{*}$ we can also conclude that $\sharp\not\equiv_{\hat{L}}u$ for all $u\in U$ .

Since we assumed that $A_{\textit{dist}}$ has at most $k+2$ states, by Corollary 4 there are at most $k+2$ equivalence classes of ${\equiv_{\hat{L}}}$ . Since trivially $w_{1}\sharp\not\equiv_{\hat{L}}\sharp$ , by the pigeonhole principle there is a prefix $u\in U$ such that at $w_{1}\sharp\equiv_{\hat{L}}u$ .

It can not be the case that $u=a_{1}\dots a_{i}$ for some $i\in[1,k]$ , since

	$\displaystyle a_{1}\dots a_{i}$	$\displaystyle\cdot a_{i+1}\dots a_{k}\sharp w_{post}\in\hat{L}$
	$\displaystyle w_{1}\sharp$	$\displaystyle\cdot a_{i+1}\dots a_{k}\sharp w_{post}\not\in\hat{L}.$

By eliminating all alternatives we conclude $u=\varepsilon$ . Using this equivalence and since $\varepsilon\cdot w_{1}\sharp w_{post}\in L_{\textit{dist}}$ we derive that $w_{1}\sharp\cdot w_{1}\sharp w_{post}\in L_{\textit{dist}}$ . In particular, this means that $(w_{1}\sharp)^{n}\cdot w_{post}\in L_{\textit{dist}}$ . By definition of $L_{\phi}^{+}$ this means that the truth assignment $w_{1}$ satisfies all clauses $C_{1},\dots,C_{n}$ and hence it is a satisfying assignment for $\phi$ . This witnesses that $\phi$ is a satisfiable formula.

$\hfill\blacktriangleleft$

This lemma allows us to prove Theorem 7.

Proof of Theorem 7.

Membership of NP follows naturally. For two DFAs $A_{1}$ and $A_{2}$ we can, in polynomial time, check if $\mathcal{L}(A_{1})\subseteq\mathcal{L}(A_{2})$ . This can be done by computing the emptiness of $\mathcal{L}(A_{1})\cap\overline{\mathcal{L}(A_{2})}$ . Moreover, either $A_{1}$ or $A_{2}$ itself necessarily already is a distinguishing automaton, so the minimal distinguishing DFA is definitely polynomial in size.

NP-hardness is a direct consequence of Lemma 10 and of the fact that $L_{\phi}^{-}\subseteq L_{\phi}^{+}$ , so the language of any distinguishing automaton is a subset of $L_{\phi}^{+}$ and not vice-versa. $\hfill\blacktriangleleft$

4 Minimal Pumping Length

The pumping lemma is mostly famous for educational purposes in automata theory. The lemma asserts that in a regular language a word from a certain length can be pumped. A word in a language can be pumped if a non-empty part of it can be arbitrarily repeated while remaining in the language.

Definition 11 (Pumping lemma).

Let $L\subseteq\Sigma^{*}$ be a regular language. There is a $p\in\mathbb{N}$ such that for all words $w\in L$ if $|w|\geq p$ then there is a decomposition $w=xyz$ such that $|xy|\leqslant p$ , $|y|\geq 1$ and for all $i\in\mathbb{N}$ : $xy^{i}z\in L$ .

Recently, the computational complexity of deciding the minimal pumping length was studied in [6]. The decision problem associated with the pumping lemma is to decide if the lemma holds for a certain pumping constant $p$ .

$𝒌$ -PUMPING:: Let $A$ be a DFA, and $k\in\mathbb{N}$ a number. Decide if the pumping lemma holds with constant $p=k$ .

In [6] it was shown that computing the minimal pumping constant $p$ of the lemma is coNP-hard.

Theorem 12 ([6, Cor. 15]).

$k$ -PUMPING is coNP-complete.

The proof of the coNP-hardness of $k$ -pumping goes by a reduction from Hamiltonian cycle. A slightly modified language based on $L^{+}_{\phi}$ from Section 3 gives an alternative proof of this fact. The reduction is directly from the tautology of DNF formulas, which is the natural coNP-complete problem.

Let the language $L_{\phi}^{\textit{dnf}}$ be a language similar to the one previously mentioned, but for a propositional formula in Disjunctive Normal Form (DNF). Here, a formula $\phi$ is said to be in disjunctive normal formal if $\phi=C_{1}\vee\dots\vee C_{n}$ where each $C_{i}$ is a conjunction of literals. The language contains all words $w_{1}\sharp\dots w_{n}\sharp\cdot\{w\sharp\mid w\in\mathcal{X}\}^{*}$ , where there is an $i\in[1,n]$ such that the truth assignment $w_{i}$ satisfies $C_{i}$ ,
$\displaystyle L^{\textit{dnf}}_{\phi}=L^{-}_{\phi}\cup\{w_{1}\sharp\cdots w_{j% }\sharp\mid j\geq n,w_{1},\dots,w_{j}\in\mathcal{X}\text{ and there is an }i% \in[1,n]:w_{i}\text{ satisfies }C_{i}\}.$

Lemma 13.

Given a DNF formula $\phi=C_{1}\vee\dots\vee C_{n}$ , the language $L^{\textit{dnf}}_{\phi}$ is $(k+1)n$ -PUMPING if and only if $\phi$ is a tautology.

Proof.

( $\Rightarrow$ )

Let $w\in\mathcal{X}$ be a word representing a truth assignment. Since $L^{\textit{dnf}}_{\phi}$ is $(k+1)n$ -PUMPING, the pumping lemma holds for the word $(w\sharp)^{n}\in L^{-}_{\phi}\subseteq L^{\textit{dnf}}_{\phi}$ . Let $xyz=(w\sharp)^{n}$ be the composition as mentioned in the pumping lemma. Since $y$ is non-empty, and $xy^{2}z\in L^{\textit{dnf}}_{\phi}$ by the structure of $L^{\textit{dnf}}_{\phi}$ it has to hold that $|y|=c(k+1)$ for some $c\in\mathbb{N}$ . Observe that any such $y$ is of the shape $w_{post}\sharp(w\sharp)^{*}w_{pre}$ , where $w=w_{pre}w_{post}$ . From this we can conclude that $xy^{2}z=(w\sharp)^{n+c}$ and thus, by definition of $L^{\textit{dnf}}_{\phi}$ , that $w$ represents a satisfying assignment for $\phi$ . In particular, since $w$ was arbitrarily chosen, any assignment is a valid truth assignment and thus $\phi$ is a tautology.

( $\Leftarrow$ )

Assume $\phi$ is a tautology, and $w\in L^{\textit{dnf}}_{\phi}$ such that $|w|\geq(k+1)n$ . By definition the word can be represented as $w_{1}\sharp\dots w_{j}\sharp$ for some $j\geq n$ and truth assignments $w_{1},\dots,w_{j}\in\mathcal{X}$ . We distinguish on whether $|w|=(k+1)n$ .

$\blacksquare$

Assume $|w|=(k+1)n$ , then since $\phi$ is a tautology, there is a clause $C_{\ell}$ for which $w_{1}$ is a satisfying truth assignment. We verify that $w=xyz$ for the words $x=\varepsilon$ , $y=w_{1}\sharp\dots w_{\ell-1}\sharp$ and $z=w_{\ell}\sharp\dots w_{j}\sharp$ , and that $xy^{i}z\in L^{\textit{dnf}}_{\phi}$ for all $i\in\mathbb{N}$ .
$\blacksquare$

In the second case $|w|>(k+1)n$ . By construction of the language $w=w_{1}\sharp\dots w_{i}\sharp$ for some $i>n$ , and hence there is a word $w_{n+1}$ that represents a truth assignment. Since $\phi$ is a tautology, there is a clause $C_{\ell}$ such that the word $w_{n+1}$ represents a satisfying assignment for $C_{\ell}$ . We show that $x=w_{1}\sharp\dots w_{\ell-1}\sharp$ , $y=w_{\ell}\sharp\dots w_{n}\sharp$ , $z=w_{n+1}\sharp\dots w_{j}\sharp$ verifies the pumping lemma. Indeed, in $xy^{0}z$ the word $w_{n+1}$ satisfies $C_{\ell}$ and thus $xy^{0}z\in L_{\phi}^{\textit{dnf}}$ . For any $c\geq 1$ , the word $xy^{c}z=w_{1}\sharp\dots w_{n}\sharp\dots w_{j}\sharp$ , and since $w\in L_{\phi}^{\textit{dnf}}$ also $xy^{c}z\in L_{\phi}^{\textit{dnf}}$ .

This shows that in both cases, the language $L_{\phi}^{\textit{dnf}}$ is $(k+1)n$ -PUMPING.

$\hfill\blacktriangleleft$

5 Minimal witnesses are primes

In this section we study the structure of minimal witnesses and show that minimal witnesses correspond to the prime languages [12]. Prime languages are those that cannot be decomposed into a union of languages with smaller state complexity. In other words, a language $L$ can be $k$ -decomposed if it is the union of languages of index $k$ . This hierarchy on regular languages was first studied in [12].

Definition 14 ( $k$ -composable).

A regular language $L$ is $k$ -composable if and only if there are DFAs $A_{1},\dots,A_{t}$ , such that $\mathit{ind}(A_{i})\leqslant k$ for all $i\in[1,t]$ , and

L=\bigcup_{i\in[1,t]}\mathcal{L}(A_{i}).

A DFA $A$ is called prime iff it is minimal and not $k$ -composable for any $k<\mathit{ind}(A)$ .

Formally in [12], instead of union decomposition, the focus is intersection decomposition. For our purpose it makes more sense to look at union decomposition. All results for intersection compositionality can be trivially dualized for union composition, this is already observed in [12]. Hence, this difference is only a matter of presentation. If a DFA $A$ is union composite, e.g. $\mathcal{L}(A)=\bigcup_{i\in[1,n]}\mathcal{L}(A_{i})$ for some DFAs $A_{1},\dots,A_{n}$ , then also $\overline{\mathcal{L}(A)}=\bigcap_{i\in[1,n]}\overline{\mathcal{L}(A_{i})}$ .

We show how the concept of prime languages is related to minimal witnesses. We start by the observation that not all DFAs are a minimal witness. This is illustrated by the following example.

Figure 3: Minimal automata recognizing

L_{2}

(from

q_{bb}

), and the languages

L_{a}

(from

p_{\varepsilon}

), and

L_{b}

(from

r_{\varepsilon}

).

Example 15.

Consider the language $L_{2}=\{wa\sigma\mid w\in\Sigma^{*},\sigma\in\Sigma\}$ over the alphabet $\Sigma=\{a,b\}$ . The minimal automaton recognizing $L_{2}$ is shown in Figure 3. Observe that $\mathit{ind}(L_{2})=4$ . Given a language $L\subseteq\Sigma^{*}$ such that $L_{2}\not\subseteq L$ , then necessarily there is a word $w\in\Sigma^{*},\sigma\in\Sigma$ such that the word $wa\sigma\not\in L$ . Let the language $L_{\sigma}$ be defined as $L_{\sigma}=\{wa\sigma\mid w\in\Sigma^{*}\}$ . Then $L_{\sigma}$ is a subset of $L_{2}$ and witnesses the non-inclusion $L_{2}\not\subseteq L$ . Since $\mathit{ind}(L_{\sigma})=3$ , we see that for the non inclusion of $L_{2}$ in any language there is a witness of index smaller than the index of $L_{2}$ .

In general for each $k\in\mathbb{N}$ define the languages $L_{k}=\{wau\mid w\in\Sigma^{*},u\in\Sigma^{k}\}$ . The language $L_{k}$ always has a witness for non-inclusion with index exponentially smaller than the index of $L_{k}$ .

A natural question is when a DFA $A$ is a minimal witness for the non-inclusion of some languages $L\not\subseteq L^{\prime}$ . In the example above, the language $L_{2}$ is not prime. In fact, $L_{2}$ is $3$ -decomposable in the automata recognizing the languages $L_{\sigma}=\{wa\sigma\mid w\in\Sigma^{*}\}$ for each $\sigma\in\Sigma$ . Now we see the language $L_{2}=\bigcup_{\sigma\in\Sigma}L_{\sigma}$ . Next, we prove that prime languages characterize exactly the languages that are minimal witnesses.

Theorem 16.

Given a DFA $A$ , $A$ is a minimal witness for the non-inclusion of some DFAs $B_{1}$ in $B_{2}$ , i.e. $\mathcal{L}(B_{1})\not\subseteq\mathcal{L}(B_{2})$ if and only if $A$ is prime.

Proof.

( $\Leftarrow$ )

Given a prime DFA $A$ , we will show that there are DFAs $B_{1}$ and $B_{2}$ such that $A$ is a minimal witness for their non-inclusion. First we define $\alpha(A)$ the floor of $A$ , by

\alpha(A)=\{A^{\prime}\mid\mathit{ind}(A^{\prime})<\mathit{ind}(A)\text{ and }% \mathcal{L}(A^{\prime})\subseteq\mathcal{L}(A)\}.

Now from [12, Theorem 2.2] it holds that since $A$ is prime, there is a primality witness $w\in\mathcal{L}\left(A\right)$ such that $w\not\in\bigcup_{A^{\prime}\in\alpha(A)}\mathcal{L}(A^{\prime})$ . Let $\mathcal{L}(B_{2})=\Sigma^{*}\setminus\{w\}$ . Now, for all DFAs $A^{\prime}\in\alpha(A)$ , in other words all DFAs such that $\mathcal{L}(A^{\prime})\subseteq\mathcal{L}(A)$ and $\mathit{ind}(A^{\prime})<\mathit{ind}(A)$ , necessarily $w\not\in\mathcal{L}(A^{\prime})$ . Now it is easy to see there is no smaller DFA than $A$ that witnesses $\mathcal{L}(A)\not\subseteq\mathcal{L}(B_{2})$ , and hence $A$ is a minimal witness for the non inclusion $\mathcal{L}(A)\not\subseteq\mathcal{L}(B)$ .

( $\Rightarrow$ )

Assume $A$ is a minimal witness for the DFAs $B_{1}$ and $B_{2}$ . Since $A$ is minimal, it holds for all DFAs $A^{\prime}\in\alpha(A)$ that $A^{\prime}$ is no witness for the non-inclusion of $B_{1}$ and $B_{2}$ . Hence, for all $A^{\prime}\in\alpha(A)$ it holds that $\mathcal{L}(A^{\prime})\subseteq\mathcal{L}(B_{2})$ . Since $\mathcal{L}(A)\not\subseteq\mathcal{L}(B_{2})$ there is a primality witness $w\in\mathcal{L}(A)\setminus\mathcal{L}(B_{2})$ such that $w\not\in\bigcup_{A^{\prime}\in\alpha(A)}\mathcal{L}(A^{\prime})$ , hence $A$ is prime.

$\hfill\blacktriangleleft$

A direct consequence of this theorem is that for any DFA $A$ and $k\in\mathbb{N}$ it holds that if $A$ is $k$ -composable then any non-inclusion $\mathcal{L}(A)\not\subseteq\mathcal{L}(B)$ is witnessed by a DFA of index at most $k$ . Unfortunately, the converse is not necessarily true. A witnessing DFA does not need to be part of a decomposition. For example, consider the minimal automaton $A$ recognizing the language $\mathcal{L}(A)=\{a^{3i}\mid i\in\mathbb{N}\}$ . This DFA contains $3$ states and is prime. However, in order to witness that $\mathcal{L}(A)\not\subseteq\Sigma^{*}\setminus\{\varepsilon\}$ we can use the minimal DFA $B$ recognizing the language $\mathcal{L}(B)=\{\varepsilon\}$ , which only contains $2$ states, but $B$ is not part of the decomposition of $A$ since $A$ is prime.

6 Separating words

A special case of the minimal witness problem is the separating words problem. The problem is to find the smallest DFA that distinguishes two given words, in the sense that it accepts one and rejects the other. Given two words $w_{1},w_{2}\in\Sigma^{*}$ what is the minimal DFA $A$ such that $w_{1}\in\mathcal{L}(A)\iff w_{2}\not\in\mathcal{L}(A)$ . In our context this question is exactly to compute a witness for the inequivalence of $L_{1}=\Sigma^{*}\setminus\{w_{1}\}$ and $L_{2}=\Sigma^{*}\setminus\{w_{2}\}$ . We define $Sep(w_{1},w_{2})$ as the minimal number of states in a DFA that distinguishes $w_{1}$ from $w_{2}$ – accepting one and rejecting the other.

The study of the separating words problem focuses on the asymptotic behavior of the maximum value of $Sep(w_{1},w_{2})$ , taken over all pairs of words $w_{1}$ and $w_{2}$ of length $n$ over an alphabet of size $\ell\in\mathbb{N}$ . This is captured by the function $Sep_{\ell}(n)$ :

\displaystyle Sep_{\ell}(n)=\max_{w_{1},w_{2}\in\{a_{1},\dots,a_{k}\}^{n}}Sep(% w_{1},w_{2}).

Computing $Sep(w_{1},w_{2})$ is NP-complete, which is a corollary from the study of identity checking in arbitrary finite semigroups [1]. Two words $w_{1},w_{2}\in\Sigma^{*}$ (often called terms in the context of algebras) form an identity for a finite semigroup $S$ if, for every projection $\tau:\Sigma\to S$ , the equality $\tau(w_{1})=\tau(w_{2})$ holds. Here, $\tau$ extends to words by applying the semigroup operation, that is,

\tau(a_{1}a_{2}\dots a_{n})=\tau(a_{1})\cdot\tau(a_{2})\cdot\,\dotso\,\cdot% \tau(a_{n})\quad\text{for }a_{1},\dots,a_{n}\in\Sigma.

The identity checking problem for a semigroup $S$ is defined as deciding whether $w_{1},w_{2}$ is an identity.

A particular semigroup is the transformation group $T_{n}$ , which consists of all functions from ${0,\dots,n-1}$ to itself, with function composition as operation.

Theorem 17 (([1, Thm. 2],[10, Thm. 1])).

The identity checking problem for the transformation group $T_{n}$ is coNP-complete for $n\geq 3$ .

In [2] it is noticed how this is closely related to the separating words problem.

Lemma 18.

Let $n\geq 3$ be a number. Given an alphabet $\Sigma$ and words $w_{1},w_{2}\in\Sigma^{*}$ . The words $w_{1},w_{2}$ are not an identity in $T_{n}$ if and only if there is a DFA $A$ with at most $k$ states such that $w_{1}\in\mathcal{L}(A)$ and $w_{2}\not\in\mathcal{L}(A)$ .

Proof.

If $w_{1}$ and $w_{2}$ are not an identity in $T_{n}$ , then, by definition, there exists a projection $\tau:\Sigma\to T_{n}$ such that $\tau(w_{1})\neq\tau(w_{2})$ . Note that, since $\tau(w_{1})\neq\tau(w_{2})$ , there is an $i\in\{0,\dots,n-1\}$ such that $\tau(w_{1})(i)\neq\tau(w_{2})(i)$ . Given this witnessing projection $\tau$ , and number $i$ we define an $n$ -state DFA $A=(\{0,\dots,n-1\},\Sigma,\delta,q_{0},F)$ , where $q_{0}=i$ , $F=\{\tau(w_{1})(i)\}$ , and the transition function $\delta$ is defined as:

\delta(q,a)=\tau(a)(q)\quad\text{for each }q\in\{0,\dots,n-1\}\text{ and }a\in\Sigma.

This DFA operates by simulating the function of the projection $\tau$ on each input symbol. By construction $w_{1}\in\mathcal{L}(A)$ and since $\tau(w_{2})(i)\neq\tau(w_{1})(i)$ , $w_{2}\not\in\mathcal{L}(A)$ .

Conversely, assume that for words $w_{1},w_{2}\in\Sigma^{*}$ there is a DFA $A$ with $n$ states that distinguishes $w_{1}$ and $w_{2}$ . We derive a projection $\tau:\Sigma\to T_{n}$ from the DFA $A=(Q,\Sigma,\delta,q_{0},F)$ . Without loss of generality, assume $Q=\{0,\dots,n-1\}$ and define the projection $\tau$ for every $i\in Q,a\in\Sigma$ as $\tau(a)(i)=\delta(i,a)$ . Now we observe that $\tau$ witnesses that $w_{1}$ and $w_{2}$ are not an identity in $T_{n}$ . $\hfill\blacktriangleleft$

Corollary 19.

Given an alphabet $\Sigma$ , and two words $w_{1},w_{2}\in\Sigma^{*}$ deciding whether there is a DFA $A$ with at most $k$ states such that $w_{1}\in\mathcal{L}(A)$ and $w_{2}\not\in\mathcal{L}(A)$ is NP-complete for any $k\geq 3$ .

It is worth noting that the function $Sep_{\ell}(n)$ remains equal regardless of the size of the alphabet when $\ell\geq 2$ [4].

Lemma 20 ([4, Proposition 2.]).

For all $\ell\geq 2$ , $Sep_{\ell}(n)=Sep_{2}(n)$ .

However, for any finite $k$ , the existence of a $k$ state DFA that distinguishes two binary words can be decided in constant time by trying all projections $\tau:\{0,1\}\to T_{k}$ . When the alphabet is part of the input and $k\geq 2$ , then trying all the projections is not feasible, since there are $|T_{k}|^{|\Sigma|}$ possible projections.

So, deciding whether $Sep(w_{1},w_{2})=k$ is known to be NP-complete, for any fixed $k$ and variable alphabet size. The complexity remains unresolved when $k$ is part of the input and the alphabet is constant. This is particularly significant because the binary case fully determines the asymptotic behaviour of $Sep_{\ell}(n)$ – making it an interesting question for future research.

Question 21.

Given $w_{1},w_{2}\in\{0,1\}^{*}$ is computation of $Sep(w_{1},w_{2})$ possible in polynomial time?

7 Computing Witnesses in polynomial time

In this section we provide polynomial algorithms that compute not necessarily minimal witnesses for DFA inequivalence. First we compute witnessing DFAs greedily combining states until a fixed point is reached. Secondly, for unary DFAs we build a polynomial algorithm that computes a witness of index $k$ or rejects if it does not exist. This algorithm uses the notion of clean quotients, which are used to show that deciding primality of unary DFAs is decidable in LOGSPACE [9].

Simple witnesses

In this section we introduce a naive polynomial time algorithm that computes a witnessing DFA that is not minimal, but can not become smaller by state merging.

Definition 22 (Irreducible witness).

Let $A=(Q,\Sigma,\delta,q_{0},F)$ be a DFA such that $\mathcal{L}(A)\not\subseteq\mathcal{L}(B)$ for some DFA $B$ . A DFA $A_{\textit{dist}}$ is called an irreducible witness w.r.t. $A$ and $B$ iff there is no DFA $A^{\prime}\in\alpha(A_{\textit{dist}})$ such that $\mathcal{L}(A^{\prime})\not\subseteq\mathcal{L}(B)$ , where $\alpha(A_{\textit{dist}})$ is the floor of $A_{\textit{dist}}$ defined as

\alpha(A_{\textit{dist}})=\{A^{\prime}\mid\mathit{ind}(A^{\prime})<\mathit{ind% }(A_{\textit{dist}})\text{ and }\mathcal{L}(A^{\prime})\subseteq\mathcal{L}(A_% {\textit{dist}})\}.

We obtain irreducible DFAs by iteratively combining states. In order to do so, we define an operation on DFAs that maps one state to another. Given a DFA $A=(Q,\Sigma,\delta,q_{0},F)$ , and two states $q,p\in Q$ , when we write $A[q\mapsto p]$ , we mean the automaton where the transitions into $q$ are relayed to $p$ , i.e.

	$\displaystyle A[q\mapsto p]$	$\displaystyle=(Q\setminus\{q\},\Sigma,\delta^{\prime},q_{0}^{\prime},F),\text{% where for all }r,a\in Q\times\Sigma$
	$\displaystyle\delta^{\prime}(r,a)$	$\displaystyle=\begin{cases}p&\text{ if }\delta(r,a)=q\\ \delta(r,a)&\text{ otherwise}.\end{cases}$
	$\displaystyle q_{0}^{\prime}$	$\displaystyle=\begin{cases}p&\text{ if }q_{0}=q\\ q_{0}&\text{ otherwise}.\end{cases}$

The algorithm follows the following procedure.

1. Pick a distinguishing word.: First we compute a distinguishing word $w\in\mathcal{L}(A_{1})\setminus\mathcal{L}(A_{2})$ . This can be done in quasi-linear time [19]. It can be noted that a shortest distinguishing word does not necessarily result in the smallest witnessing automaton. However, by lack of better heuristics it seems best to pick a smallest distinguishing word $w$ , since this will guarantee an upper bound on the distinguishing DFA found.
2. Greedily combine states.: We compute the path $w$ takes in the automaton $A_{1}$ . We try to combine states on this path, and check whether the DFA still accepts a subset of $A_{1}$ . By construction the DFA still accepts $w$ and thus is a witnessing DFA for $\mathcal{L}(A_{1})\not\subseteq\mathcal{L}(A_{2})$ .

Algorithm 1 Computing a irreducible distinguishing DFA.

This procedure results in a DFA which is not larger than that of a minimal distinguishing word, and potentially is a minimal witnessing DFA. We prove that the resulting DFA is so-called irreducible, which is, for any smaller DFA $A^{\prime}$ , which is a subset of $A_{1}$ it holds that $w\not\in\mathcal{L}(A^{\prime})$ .

Theorem 23.

Given two DFAs $A_{1}$ and $A_{2}$ , such that $\mathcal{L}(A_{1})\not\subseteq\mathcal{L}(A_{2})$ , and a word $w\in\mathcal{L}(A_{1})\setminus\mathcal{L}(A_{2})$ the output DFA SimpleWitness( $A_{1},A_{2}$ ) from Algorithm 1 is irreducible.

An interesting question is whether we can compute the minimal witnessing DFA if we compute the correct word $w$ in Line 2. In other words, given DFA $A$ can we compute a minimal DFA which recognizes a subset of $A$ while still containing a word $w\in\mathcal{L}(A)$ . More formally we are interested in the computational complexity of the following decision problem.

$𝒘, 𝒌$ -DFA-DIST:: Given a DFA $A_{1}$ , a number $k\in\mathbb{N}$ , and a word $w\in\mathcal{L}(A_{1})$ is there a DFA $A_{d}$ of index $k$ such that $\mathcal{L}(A_{d})\subseteq\mathcal{L}(A_{1})$ and $w\in\mathcal{L}(A_{d})$ .

If $w, k$ -DFA-DIST is in P, then also Question 21 is answered affirmitively, e.g. we can compute the smallest DFA that separates two words in polynomial time.

Unary DFAs

When the alphabet $\Sigma$ of the input DFAs is only a singleton $\lvert\Sigma\rvert=1$ , the problem of deciding minimal witnesses becomes easier. This is due to the specific shapes of these automata.

A unary DFA is a DFA where the alphabet $\Sigma$ is a singleton, usually we pick $\Sigma=\{a\}$ . The language of unary DFA is a set of words in the form $a^{i}$ for some natural number $i\in\mathbb{N}$ . It is natural to see the language of a unary DFA as a subset of $\mathbb{N}$ . For a unary DFA $B$ we define the unary-language is $\mathcal{L}_{\mathbb{N}}(B)=\{i\in\mathbb{N}\mid a^{i}\in\mathcal{L}(B)\}$ .

A minimal unary DFA has a specific shape. Since each state has exactly one outgoing transition, it contains exactly one cycle. The initial state might not be on the cycle. In this case the state is the start of a path leading up to the cycle. For the analysis of unary DFAs it is very convenient to distinguish these states.

Figure 4: The

(2,1)

-DFA

A_{1}

on the left and the

(0,1)

-DFA

A_{2}

on the right side.

For two integers $d\geq 1$ and $\ell\geq 0$ a unary $(\ell,d)$ -DFA is a unary DFA with $\ell$ prefix states $s_{0},\dots,s_{\ell{-}1}$ and a cycle containing $d$ states $q_{0},\dots,q_{d-1}$ . The path leading into the cycle we call the prefix. Examples of these automata are given in Figure 4.The language of a unary $(\ell,d)$ -DFA $A=(\{s_{0},\dots s_{\ell{-}1}\}\cup\{q_{0},\dots q_{d-1}\},\{a\},\delta,s_{1},F)$ is given by

\mathcal{L}_{\mathbb{N}}(A)=\{i\in\mathbb{N}\mid i<\ell\text{ and }s_{i}\in F% \}\cup\{\ell+i+cd\mid c\in\mathbb{N},0\leqslant i<d\text{ and }q_{i}\in F\}.

For unary DFAs computing minimal DFA witnesses is similar to deciding primality of unary DFAs. Deciding primality for unary DFAs is computable in deterministic logarithmic space [9]. An algorithm is proposed that carefully inspects so-called clean quotients. A clean quotient of a unary DFA is a unary DFA in which the cycle is folded into a smaller cycle.

Although our setting differs from primality testing, a similar technique can be used to compute minimal witnesses for non-inclusion. We inspect all possible covers of the cycle of the unary DFA. For each possible folding this procedure takes only polynomial time and there is only a linear number of possible covers.

In particular there is a unary DFA with a single accepting state that witnesses this non-inclusion. Hence, finding minimal witnesses for the language non-inclusion of two unary DFAs boils down to finding the smallest numbers $k_{1},k_{2}\in\mathbb{N}$ such that the language $L_{k_{1},k_{2}}=\{a^{k_{1}}\cdot a^{i\cdot k_{2}}\mid i\in\mathbb{N}\}$ distinguishes $A_{1}$ and $A_{2}$ .

We define the family of unary DFAs $A_{\ell,d}=(Q,\{a\},\delta,s_{0},F)$ for each $\ell,d\in\mathbb{N}$ , where

$\blacksquare$

the set of states is defined as:

$Q=\{s_{0},\dots,s_{\ell-1}\}\cup\{q_{0},\dots,q_{d-1}\},\\$
$\blacksquare$

the transition function is given as:

$\displaystyle\delta(p,a)=\begin{cases}s_{j+1}&\text{if $p=s_{j}$, for some $j% \in\{0,\dots,{\ell{-}1}\}$}\\ q_{0}&\text{if $p=s_{\ell}$}\\ q_{j}&\text{otherwise, if $p=q_{i}$ and }j\equiv i{+}1\mod d\\ \end{cases},$
$\blacksquare$

and, one accepting state at the start of the cycle $F=\{q_{0}\}$ .

Observe that there is always a minimal distinguishing DFA in the shape of $A_{\ell,d}$ .

Lemma 24.

Given unary DFAs $A_{1},A_{2}$ such that $\mathcal{L}(A_{1})\not\subseteq\mathcal{L}(A_{2})$ , then if a unary DFA $B$ witnesses the non-inclusion, then there is a DFA $A_{\ell,d}$ such that $\mathit{ind}(A_{\ell,d})\leqslant\mathit{ind}(B)$ and $A_{\ell,d}$ witnesses $\mathcal{L}(A_{1})\not\subseteq\mathcal{L}(A_{2})$ .

Proof.

Let $B=(Q,\{a\},\delta,q_{0},F)$ be a distinguishing DFA such that $\mathcal{L}(B)\subseteq\mathcal{L}(A_{1})$ and $\mathcal{L}(B)\not\subseteq\mathcal{L}(A_{2})$ . Then, by definition there is a word $w\in\mathcal{L}(B)$ such that $w\not\in\mathcal{L}(A_{2})$ . Now we define the DFA $B^{\prime}=(Q,\{a\},\delta,q_{0},\{\delta^{*}(q_{0},w)\})$ such that the only accepting state is the one accepting $w$ . Since $B^{\prime}$ still accepts $w$ , it is still a distinguishing DFA for $\mathcal{L}(A_{1})\not\subseteq\mathcal{L}(A_{2})$ and by construction $\mathit{ind}(B^{\prime})\leqslant\mathit{ind}(B)$ .

We complete the proof by showing that $\mathcal{L}(B^{\prime})=\mathcal{L}(A_{\ell,d})$ for some $\ell,d\in\mathbb{N}$ . If $\delta^{*}(w,q_{0})$ is part of the prefix $s_{i}\in Q$ , then the DFA $\mathcal{L}(A_{\lvert w\rvert,0})=\mathcal{L}(B^{\prime})$ . In the other case when $\delta^{*}(q_{0},w)=q_{i}$ is some state $q_{i}\in Q$ on a cycle, and let that cycle have length $d$ . Then it holds that $\mathcal{L}(A_{\lvert w\rvert,d})=\mathcal{L}(B^{\prime})$ . $\hfill\blacktriangleleft$

Algorithm 2 Computing a minimal witnessing DFA for two unary DFAs.

There are only polynomially many of these automata of smaller index, e.g. $\mathit{ind}(A_{\ell,d})\leqslant k$ . A polynomial time algorithm can simply enumerate them all. Our algorithm is slightly less naive and checks for each accepting state that accepts a distinguishing word, the possible languages $A_{\ell,d}$ that accept that distinguishing word.

Lemma 25.

Let $A_{1},A_{2}$ be two unary DFAs such that $\mathcal{L}(A_{1})\not\subseteq\mathcal{L}(A_{2})$ . Then the DFA $B=$ UnaryMinWitness( $A_{1},A_{2}$ ), by Algorithm 2 is a minimal witness for the non-inclusion.

Since all loops of Algorithm 2 have a linear bound, the algorithm runs in polynomial time.

Theorem 26.

The decision problem $k$ -DFA-DIST for unary DFAs is in P.

8 Related work

There are some decision problems on DFAs that show some similarities, but are different from the work here. For instance the early work of Gold [5] and Pfleeger [18] in which it is shown that learning minimal DFAs from (partial) observations is NP-complete. In this line of work by Gold, so-called separating languages are widely studied in the literature. Here the separating problem is, given languages $L_{1}$ and $L_{2}$ , to find a separating language $L_{sep}$ such that $L_{sep}\subseteq L_{1}$ and $L_{sep}\cap L_{2}=\emptyset$ . Although this resembles our witnessing problem, a direct relation is not obvious.

The framework of distinguishing words roughly corresponds to that of Hennessy–Milner theorems for non-deterministic structures [7]. Two non-deterministic structures are not bisimilar if and only if there is a distinguishing formula (within the Hennessy–Milner logic). These formulas are used as counter-examples in the field of model checking. Despite the fact that computing minimal formulas is NP-hard, a similar method as computing distinguishing words results in succinct witnesses [3, 14]. This work can contribute towards witnesses with invariants in the non-deterministic setting.

9 Conclusions

We showed how to use DFAs to witness language inequivalence. In this way it is possible more concisely represent witnesses that explain why the language of input DFAs are not equivalent. These witnesses correspond with prime languages that are previously studied [12, 9]. We show with a reduction from CNF-SAT that deciding minimal distinguishing witnesses is NP-complete. This reduction exploits structure in the language in a non-trivial way. This structure also provides a simpler proof of the coNP-hardness of the minimal pumping length. This indicates that it might be a useful tool in the DFA workbench. In particular, we conjecture that it can be used to closen the complexity gap of deciding primality [12].

To contrast this NP-hardness result we provide a greedy algorithm that in polynomial time computes a witnessing DFA that is never larger than a distinguishing word. In addition, we show that deciding minimal witnesses is in P for unary language non-inclusion.

We leave open two complexity questions that seem interesting. First the complexity of deciding $w, k$ -DFA-DIST and the separating words problem for a binary alphabet. Additionally, it would be interesting to extend the hardness result primality of DFAs. We sketched a connection between prime DFA and minimal witnesses for DFA inequivalence, however, translating this hardness result seems not trivial.

References

[1] J. Almeida, M. V. Volkov, and S. V. Goldberg. Complexity of the identity checking problem for finite semigroups. Journal of Mathematical Sciences, 158(5):605–614, 2009. doi:10.1007/s10958-009-9397-z.
[2] Andrei A. Bulatov, Olga Karpova, Arseny M. Shur, and Konstantin Startsev. Lower bounds on words separation: Are there short identities in transformation semigroups?, August 2017. doi:10.37236/6450.
[3] Rance Cleaveland. On automatically explaining bisimulation inequivalence. In Edmund M. Clarke and Robert P. Kurshan, editors, Proc. Computer-Aided Verification (CAV 1990), pages 364–372, Berlin, Heidelberg, 1991. Springer Berlin Heidelberg. doi:10.1007/BFb0023750.
[4] Erik D Demaine, Sarah Eisenstat, Jeffrey Shallit, and David A Wilson. Remarks on separating words. In Markus Holzer, Martin Kutrib, and Giovanni Pighizzini, editors, Proc. of DCFS 2011, volume 6808 of LNCS, pages 147–157. Springer, 2011. doi:10.1007/978-3-642-22600-7_12.
[5] Mark E. Gold. Complexity of automaton identification from given data. Information and Control, 37(3):302–320, 1978. doi:10.1016/S0019-9958(78)90562-4.
[6] Hermann Gruber, Markus Holzer, and Christian Rauch. The pumping lemma for regular languages is hard. In International Conference on Implementation and Application of Automata, pages 128–140. Springer, 2023. doi:10.1007/978-3-031-40247-0_9.
[7] Matthew Hennessy and Robin Milner. On observing nondeterminism and concurrency. In Jaco de Bakker and Jan van Leeuwen, editors, Automata, Languages and Programming (ICALP ‘1980), pages 299–309, Berlin, Heidelberg, 1980. Springer-Verlag. doi:10.5555/646234.758793.
[8] John Hopcroft. An n log n algorithm for minimizing states in a finite automaton. In Zvi Kohavi and Azaria Paz, editors, Theory of Machines and Computations, pages 189–196. Academic Press, 1971. doi:10.1016/B978-0-12-417750-5.50022-1.
[9] Ismaël Jecker, Orna Kupferman, and Nicolas Mazzocchi. Unary Prime Languages. In Javier Esparza and Daniel Král’, editors, Proc. of MFCS 2020, volume 170 of Leibniz International Proceedings in Informatics (LIPIcs), pages 51:1–51:12, Dagstuhl, Germany, 2020. Schloss Dagstuhl – Leibniz-Zentrum für Informatik. doi:10.4230/LIPIcs.MFCS.2020.51.
[10] Ondřej Klíma. Identity checking problem for transformation monoids. Semigroup Forum, 84(3):487–498, 2012. doi:10.1007/s00233-012-9401-7.
[11] Dexter Kozen. Lower bounds for natural proof systems. In Proc. of SFCS 1977, pages 254–266. IEEE, 1977. doi:10.1109/SFCS.1977.16.
[12] Orna Kupferman and Jonathan Mosheiff. Prime languages. Information and Computation, 240:90–107, 2015. doi:10.1016/j.ic.2014.09.010.
[13] Jan Martens. Deciding minimal distinguishing DFAs is NP-complete. arXiv preprint arXiv:2306.03533, 2023. doi:10.48550/arXiv.2306.03533.
[14] Jan Martens and Jan Friso Groote. Computing minimal distinguishing Hennessy-Milner formulas is NP-hard, but variants are tractable. In G.A. Pérez and J.-F. Raskin, editors, Proc. of CONCUR 2023, volume 279 of LIPIcs, pages 32:1–32:17. Schloss Dagstuhl – Leibniz-Zentrum für Informatik, 2023. doi:10.4230/LIPIcs.CONCUR.2023.32.
[15] Tyler Moore. Gedanken-experiments on sequential machines. In C. E. Shannon and J. McCarthy, editors, Automata Studies, Annals of Mathematical Studies, no. 34. Citeseer, 1956.
[16] John R. Myhill. Finite automata and the representation of events. WADD Technical Report, 57-624:112–137, 1957.
[17] Anil Nerode. Linear automaton transformations. Proceedings of the American Mathematical Society, 9(4):541–544, 1958. doi:10.1090/S0002-9939-1958-0135681-9.
[18] Charles P. Pfleeger. State reduction in incompletely specified finite-state machines. IEEE Transactions on Computers, 100(12):1099–1102, 1973. doi:10.1109/T-C.1973.223655.
[19] Rick Smetsers, Joshua Moerman, and David N Jansen. Minimal separating sequences for all pairs of states. In Adrian-Horia Dediu, Jan Janoušek, Carlos Martín-Vide, and Bianca Truthe, editors, Proc. of (LATA 2016), volume 9618 of LNCS, pages 181–193. Springer, 2016. doi:10.1007/978-3-319-30000-9_14.

[bib.bib1] [1] J. Almeida, M. V. Volkov, and S. V. Goldberg. Complexity of the identity checking problem for finite semigroups. Journal of Mathematical Sciences, 158(5):605–614, 2009. doi:10.1007/s10958-009-9397-z.

[bib.bib2] [2] Andrei A. Bulatov, Olga Karpova, Arseny M. Shur, and Konstantin Startsev. Lower bounds on words separation: Are there short identities in transformation semigroups?, August 2017. doi:10.37236/6450.

[bib.bib3] [3] Rance Cleaveland. On automatically explaining bisimulation inequivalence. In Edmund M. Clarke and Robert P. Kurshan, editors, Proc. Computer-Aided Verification (CAV 1990), pages 364–372, Berlin, Heidelberg, 1991. Springer Berlin Heidelberg. doi:10.1007/BFb0023750.

[bib.bib4] [4] Erik D Demaine, Sarah Eisenstat, Jeffrey Shallit, and David A Wilson. Remarks on separating words. In Markus Holzer, Martin Kutrib, and Giovanni Pighizzini, editors, Proc. of DCFS 2011, volume 6808 of LNCS, pages 147–157. Springer, 2011. doi:10.1007/978-3-642-22600-7_12.

[bib.bib5] [5] Mark E. Gold. Complexity of automaton identification from given data. Information and Control, 37(3):302–320, 1978. doi:10.1016/S0019-9958(78)90562-4.

[bib.bib6] [6] Hermann Gruber, Markus Holzer, and Christian Rauch. The pumping lemma for regular languages is hard. In International Conference on Implementation and Application of Automata, pages 128–140. Springer, 2023. doi:10.1007/978-3-031-40247-0_9.

[bib.bib7] [7] Matthew Hennessy and Robin Milner. On observing nondeterminism and concurrency. In Jaco de Bakker and Jan van Leeuwen, editors, Automata, Languages and Programming (ICALP ‘1980), pages 299–309, Berlin, Heidelberg, 1980. Springer-Verlag. doi:10.5555/646234.758793.

[bib.bib8] [8] John Hopcroft. An n log n algorithm for minimizing states in a finite automaton. In Zvi Kohavi and Azaria Paz, editors, Theory of Machines and Computations, pages 189–196. Academic Press, 1971. doi:10.1016/B978-0-12-417750-5.50022-1.

[bib.bib9] [9] Ismaël Jecker, Orna Kupferman, and Nicolas Mazzocchi. Unary Prime Languages. In Javier Esparza and Daniel Král’, editors, Proc. of MFCS 2020, volume 170 of Leibniz International Proceedings in Informatics (LIPIcs), pages 51:1–51:12, Dagstuhl, Germany, 2020. Schloss Dagstuhl – Leibniz-Zentrum für Informatik. doi:10.4230/LIPIcs.MFCS.2020.51.

[bib.bib10] [10] Ondřej Klíma. Identity checking problem for transformation monoids. Semigroup Forum, 84(3):487–498, 2012. doi:10.1007/s00233-012-9401-7.

[bib.bib11] [11] Dexter Kozen. Lower bounds for natural proof systems. In Proc. of SFCS 1977, pages 254–266. IEEE, 1977. doi:10.1109/SFCS.1977.16.

[bib.bib12] [12] Orna Kupferman and Jonathan Mosheiff. Prime languages. Information and Computation, 240:90–107, 2015. doi:10.1016/j.ic.2014.09.010.

[bib.bib13] [13] Jan Martens. Deciding minimal distinguishing DFAs is NP-complete. arXiv preprint arXiv:2306.03533, 2023. doi:10.48550/arXiv.2306.03533.

[bib.bib14] [14] Jan Martens and Jan Friso Groote. Computing minimal distinguishing Hennessy-Milner formulas is NP-hard, but variants are tractable. In G.A. Pérez and J.-F. Raskin, editors, Proc. of CONCUR 2023, volume 279 of LIPIcs, pages 32:1–32:17. Schloss Dagstuhl – Leibniz-Zentrum für Informatik, 2023. doi:10.4230/LIPIcs.CONCUR.2023.32.

[bib.bib15] [15] Tyler Moore. Gedanken-experiments on sequential machines. In C. E. Shannon and J. McCarthy, editors, Automata Studies, Annals of Mathematical Studies, no. 34. Citeseer, 1956.

[bib.bib16] [16] John R. Myhill. Finite automata and the representation of events. WADD Technical Report, 57-624:112–137, 1957.

[bib.bib17] [17] Anil Nerode. Linear automaton transformations. Proceedings of the American Mathematical Society, 9(4):541–544, 1958. doi:10.1090/S0002-9939-1958-0135681-9.

[bib.bib18] [18] Charles P. Pfleeger. State reduction in incompletely specified finite-state machines. IEEE Transactions on Computers, 100(12):1099–1102, 1973. doi:10.1109/T-C.1973.223655.

[bib.bib19] [19] Rick Smetsers, Joshua Moerman, and David N Jansen. Minimal separating sequences for all pairs of states. In Adrian-Horia Dediu, Jan Janoušek, Carlos Martín-Vide, and Bianca Truthe, editors, Proc. of (LATA 2016), volume 9618 of LNCS, pages 181–193. Springer, 2016. doi:10.1007/978-3-319-30000-9_14.

Minimal DFAs Witnessing Language Inequivalence

Abstract

Keywords and phrases:

Copyright and License:

2012 ACM Subject Classification:

Related Version:

DOI:

Event:

Editors:

Series and Publisher:

1 Introduction

Structure

2 Notation & Preliminaries

Definition 1.

Definition 2.

Theorem 3 (Myhill-Nerode [16, 17]).

Corollary 4.

Definition 5.

3 Minimal witnesses are NP-complete

Example 6.

Theorem 7.

Example 8.

Lemma 9.

Lemma 10.

Proof.

Proof of Theorem 7.

4 Minimal Pumping Length

Definition 11 (Pumping lemma).

Theorem 12 ([6, Cor. 15]).

Lemma 13.

Proof.

5 Minimal witnesses are primes

Definition 14 (k-composable).

Example 15.

Theorem 16.

Proof.

6 Separating words

Theorem 17 (([1, Thm. 2],[10, Thm. 1])).

Lemma 18.

Proof.

Corollary 19.

Lemma 20 ([4, Proposition 2.]).

Question 21.

7 Computing Witnesses in polynomial time

Simple witnesses

Definition 22 (Irreducible witness).

Theorem 23.

Unary DFAs

Lemma 24.

Proof.

Lemma 25.

Theorem 26.

8 Related work

9 Conclusions

References

Definition 14 ( $k$ -composable).