Register Automata with Permutations

Balachander, Mrudula; Filiot, Emmanuel; Gentilini, Raffaella; Tzevelekos, Nikos

doi:10.4230/LIPIcs.MFCS.2025.14

Register Automata with Permutations

Mrudula Balachander

Université libre de Bruxelles (ULB), Belgium Emmanuel Filiot

Université libre de Bruxelles (ULB), Belgium Raffaella Gentilini

University of Perugia, Italy Nikos Tzevelekos

Queen Mary University of London, UK

Abstract

We propose Permutation Deterministic Register Automata (pDRAs), a deterministic register automaton model where we allow permutations of registers in transitions. The model enables minimal canonical representations and pDRAs can be tested for equivalence in polynomial time. The complexity of minimization is between GI (the complexity of graph isomorphism) and NP. We then introduce a subclass of pDRAs, called register automata with fixed permutation policy, where the register permutation discipline is stipulated globally. This class generalizes the model proposed by Benedikt, Ley and Puppis in 2010, and we show that it also admits minimal and canonical representations, based on a finite-index word equivalence relation. As an application, we show that for any regular data language $L$ , the minimal register automaton with fixed permutation policy recognizing $L$ can be actively learned in polynomial time using oracles for membership, equivalence and data-memorability queries. We show that all the oracles can be implemented in polynomial time, and so this yields a polynomial time minimization algorithm.

Keywords and phrases:

Register automata, data words, equivalence, minimization, active learning

Funding:

Mrudula Balachander: Research Fellow at F.R.S.-FNRS

Emmanuel Filiot: Research Director at F.R.S.-FNRS. This work was partially funded by the FNRS project 40020726.

Copyright and License:

2012 ACM Subject Classification:

Theory of computation

\rightarrow

Formal languages and automata theory

DOI:

10.4230/LIPIcs.MFCS.2025.14

Event:

50th International Symposium on Mathematical Foundations of Computer Science (MFCS 2025)

Editors:

Paweł Gawrychowski, Filip Mazowiecki, and Michał Skrzypczak

Series and Publisher:

Leibniz International Proceedings in Informatics, Schloss Dagstuhl – Leibniz-Zentrum für Informatik

1 Introduction

Automata for words and trees over infinite alphabets have been extensively studied in the literature [9, 24, 23]. Automata operating over infinite alphabets are not as robust as finite automata (for example, non-determinism usually brings extra computational power), and that has led to introducing various models, with different expressiveness and algorithmic properties. However, register automata, introduced as finite memory automata by Kaminksi and Francez [13], are perhaps one of the most studied models of automata over infinite alphabets. They can naturally model systems processing sequences of data, which have to be stored and compared with respect to (dis)equality. Register automata (RAs) over an infinite dataset $\mathcal{D}$ are finite-state automata extended with a finite set of registers. When they read a sequence of data from $\mathcal{D}$ , data can be stored in the registers and compared against the data already in memory. Deterministic register automata (DRAs) enjoy better algorithmic properties than their non-deterministic counterparts, such as closure under complementation and a decidable equivalence problem. The class of languages recognized by DRAs are sometimes called regular data languages. In this paper we are interested in two fundamental questions about regular data languages: $(1)$ whether there is an effective, machine-independent representation thereof which is concise and enjoys good complexity bounds with respect to standard decision problems (such as the equivalence problem); and, in addition, $(2)$ the representation naturally yields a canonical and minimal representative.

Register automata models come in various guises, with some formalisms giving emphasis on succinctness (i.e. ability to write a concise automaton for a given language) and others on efficiency (ability to decide standard problems efficiently). For example, allowing registers to store duplicate data enhances succinctness [8], while ensuring that registers do not contain duplicate elements and cannot be empty yields a polynomial time equivalence checking routine (for deterministic automata) [17].¹¹1(Language) Equivalence is undecidable for non-deterministic RAs [19]. For deterministic ones, the problem is coNP-C if we allow empty registers [22, 16], and PSpace-hard if we cater for duplicate register content [10]. The problem is lowered to PTime by disallowing both features [17]. One could argue that there is an antagonism between the two notions: succinctness requires flexibility in how data are manipulated, whereas efficiency benefits from a restricted set of options. Part of the contribution is to show that this is not the case, and our automata strike a balance between the two notions.

Regular languages (over finite alphabets) are characterized by having finite-index Myhill-Nerode equivalence relations on words. This machine-independent characterization naturally yields a canonical and state-minimal DFA for any regular language. Minimization of register automata presents challenges related to choosing unique representatives for states that are equivalent up to data permutation, and ensuring the operations of the automata are compatible with said choice of representatives. To the best of our knowledge, the first extension of this result to regular data languages was given in [4], using the notion of memorable data (data that need to be stored to recognize a language). Moreover, [4] introduces a class of DRAs where, in each transition, registers are permuted so that their order follows the order of the last appearance of their data. We call this class Lar-DRA. For any regular data language, there exists a canonical Lar-DRA which is both state-minimal and register-minimal. The time complexity of minimizing Lar-DRAs is known to be exponential [4].

Register automata with permutations.

In this paper, we propose a new model of register automata whose equivalence problem is in PTime like the automata in [17] (called herein MRT-automata), which admits canonical and minimal representatives like Lar-automata, and which is exponentially more succinct than both models. The model, called deterministic register automata with permutations (pDRAs), is an extension of MRT-automata with the possibility of applying permutations on the registers. No duplicate values in registers nor empty registers are allowed, although some registers can be dropped (become unavailable) in some states. When reading a datum $d$ from the infinite alphabet $\mathcal{D}$ , a pDRA can test whether it is already stored in some register $r$ , or it is fresh (not stored in any register). In the latter case, the pDRA assigns the fresh datum $d$ to some register. In both cases, a permutation (which depends on the transition) is applied on the registers to permute their content. The content of unavailable registers is deleted.

Example 1.

Let $n\in\mathbb{N}$ and

L_{n}=\{d_{1}d_{2}\dots d_{n}d_{\pi(1)}d_{\pi(2)}\dots d_{\pi(n)}\mid\forall i% \neq j.\,d_{i}\neq d_{j}\land\pi\in\{{\sf id},{\sf rev}\}\}

where ${\sf id}$ is the identity permutation and ${\sf rev}=\{i\mapsto n-i\mid i\in[1,n]\}$ is the order-reversing permutation. It is recognizable by a pDRA (illustrated in Figure 1) with $2n+1$ states and at most $n$ available registers. By setting $q_{0}$ to be accepting and redirecting the last transition towards $q_{0}$ instead of $q^{\prime}_{n}$ , the modified pDRA recognizes $L_{n}^{*}$ . Since $q_{0}$ has no available registers, coming back to $q_{0}$ has the effect of deleting the whole memory.

Outline.

We study language-related properties of pDRAs. After showing that pDRAs are exponentially more succinct than MRT-automata and Lar-DRAs, we look at generalizations of known results in the pDRA setting. We show that pDRAs have membership, emptiness, equivalence and data memorability problems in PTime.

Figure 1: A pDRA recognizing

L_{n}

(see Example 1), with set of registers

\{1,\dots,n\}

. The available registers are indicated below each state. A transition

p\xrightarrow{r,\pi}q

, for

\pi\in\{{\sf id},{\sf rev}\}

, is fired when the current datum equals the content of register

r

, while a transition

p\xrightarrow{r^{\bullet},\pi}q

is fired when the current datum is different from any datum in memory, which results in storing

d

in register

r

. In both cases, the data stored in the registers are permuted according to

\pi

.

We then focus on mimimization and show that pDRAs have minimal and canonical representatives. We prove that pDRAs can be minimized in exponential time. The associated decision problem is in NP and we prove that it is as hard as the graph isomorphism problem.

The barrier in efficient minimization motivates a subclass of pDRAs, that we call pDRAs with fixed register policy. In this model, permutations applied to registers only depend on the number of available registers and the register test. Therefore, any fixed register policy $\varPi$ determines a class of pDRAs, which we denote $\varPi$ -DRAs. Additionally, in a $\varPi$ -DRA, registers are ordered and data are required to be stored in consecutive registers. For instance, the model of [4] coincides with Lar-DRAs, where Lar is a policy ensuring that registers are ordered by last occurrence of their contents.

As an equi-expressive particular case of pDRAs, $\varPi$ -DRAs inherit some of the properties of pDRAs, such as PTime equivalence (improving the exponential upper bound of [4] as a special case). Our next contribution is a Myhill-Nerode congruence for $\varPi$ -DRAs, which yields canonical and minimal $\varPi$ -DRAs for any regular data language. This generalizes the results of [4] with a different approach based on abstract residual languages. In some sense, compared to Lar-DRA, our result shows that the last appearance policy is not necessary to obtain a canonical and minimal model: fixing a register policy is actually sufficient.

We use the results gathered for $\varPi$ -DRAs, either directly or by inheritance from pDRAs, to set up an active learning framework for regular data languages, in the spirit of $L^{*}$ and $L^{\#}$ algorithms for DFAs [1, 25]. We show that any regular data language $L$ can be learned in polynomial time, using polynomially many queries (in the size of the minimal $\varPi$ -DRA for $L$ ) to oracles of three types: equivalence, membership and memorability. The latter asks, given a word and a datum in that word, whether it is memorable or not. Our learning algorithm returns a minimal $\varPi$ -DRA for $L$ , and, since the oracles can be implemented in PTime, this also yields a PTime minimization algorithm for $\varPi$ -DRAs, for any fixed policy $\varPi$ .

In summary, we start by introducing pDRAs. After establishing standard results (beginning of Section 3), we delve into distinctive features of pDRAs and their $\varPi$ -DRA subclass, and develop novel results and algorithms. Our strongest contributions include:

$\blacksquare$

Memorability problem in PTime (Section 3). This is important for efficient minimization and learning.
$\blacksquare$

Canonical representatives for both pDRAs (Section 4) and $\varPi$ -DRAs (Section 5) based on equivalence relations on data words, in the spirit of Myhill-Nerode relations. We are able to construct automata that are minimal both in the number of states and in the number of registers. The minimal pDRA for a language is unique once the order in which data that need to be memorized has been fixed (see Remark 20).
$\blacksquare$

Minimization in ExpTime and GI-hard for pDRAs (Section 4), and in PTime for pDRAs with a constant number of registers and for $\varPi$ -DRAs (Section 6, improving prior exponential bounds [4]).
$\blacksquare$

PTime active learning for $\varPi$ -DRAs (Section 6).

Related work.

A succinct register automata model has been developed in [8], where succinctness is achieved by allowing registers to contain duplicate elements. As mentioned earlier, this comes at the cost of a PSpace-hard equivalence. A different approach to obtain a Myhill-Nerode characterisation of regular data languages was proposed in [6, 7], in terms of deterministic nominal automata, which are equi-expressive as deterministic register automata [5]. Nominal automata are finite-state automata recast within the universe of nominal sets [21], and where finiteness is generalized to orbit-finiteness. That approach offers neat generalizations of standard constructions and algorithms from finite-state automata to nominal automata (e.g. learning with $L^{*}$ [14]). On the other hand, the inner-workings of nominal automata and their algorithms are reliant on nominal-sets reasoning, such as equivalence under data-permutation, which introduces a level of indirection. Libraries dedicated to nominal-sets reasoning are available to that effect, though in terms of theoretical complexity and practical benchmarking, direct custom-made register-automata algorithms are typically faster [18, 3]. To represent nominal automata outside nominal sets, a representation scheme akin to a register discipline is essentially needed. However, as argued in [7, Sec. 6.2], extending finite automata with registers implicitly puts a linear order on the stored data, which is incompatible with minimality in the sense of Myhill-Nerode. Adding register permutations on transitions solves this problem and it turns out that minimal orbit-finite nominal automata (with straight state sets) and minimal pDRAs have same number of states (we thank the anonymous reviewer for pointing this connection out).

The problem of minimisation of systems dealing with data coming from an infinite alphabet has been studied extensively in the setting of $\pi$ -calculus processes and in particular using History-Dependent Automata (HDAs) [15, 20]. Those works can be seen as precursors of the research in register automata and nominal automata. While the focus had been on mobile processes and bisimulation, rather than language acceptance, HDAs brought up central issues inherent in minimising systems with data, namely picking a canonical order of storing data and dealing with symmetries between stored data. Moreover, HDAs first featured transitions with reassignment of registers (names, in HDA terminology), of which the permutations we use in pDRAs can be seen as a special case.

The problem of learning register automata has been previously considered in [12, 11, 2]. The learner in [2] addresses Lar-DRA and works in a passive framework, i.e. without relying on the active interaction with an oracle (but only on positive and negative samples). Within active learning, the state of the art learner in [11] (improving on [12]) addresses the model in [8] and is exponential in both the number of registers and the length of the longest counterexample provided by the teacher. We already mentioned nominal $L^{*}$ (called $\nu L^{*}$ ) for learning nominal automata [14]. This has membership query complexity exponential in the number of states and registers (or longest counterexample, see loc. cit.). Note, however, that the learning algorithms above do not use a memorability oracle.

2 Data word languages and register automata with permutations

In this section we introduce our register automata model. We start with some preliminaries.

Numbers, sets, relations.

For $i,j\in\mathbb{N}$ , we let $[i,j]$ be the set $\{k\in\mathbb{N}\mid i\leq k\wedge k\leq j\}$ . Note that $[i,j]=\varnothing$ when $j<i$ . We let ${\cal P}(X)$ be the powerset of $X$ . Given a set $X$ , a permutation over $X$ is a bijection from $X$ to itself ( $\pi:X\overset{\cong}{\to}X$ ), whereas a partial permutation over $X$ is a bijection $\pi:X_{1}\to X_{2}$ for subsets $X_{1},X_{2}\subseteq X$ (we write $\pi:X\overset{\cong}{\rightharpoonup}X$ ).
For any relation $R\subseteq X\times Y$ , we define its domain as ${\sf dom}(R)=\{x\in X\mid\exists y:(x,y)\in R\}$ and its range as ${\sf rng}(R)=\{y\in Y\mid\exists x:(x,y)\in R\}$ . Given $X^{\prime}\subseteq{\sf dom}(R)$ , we may write $R|_{X^{\prime}}$ for the restriction $R\cap(X^{\prime}\times Y)$ . For any two relations $R_{1}\subseteq X\times Y,R_{2}\subseteq Y\times Z$ , we define their composition as $R_{1};R_{2}=\{(i,j)\mid\exists k.\,(i,k)\in R_{1}\wedge(k,j)\in R_{2}\}$ . For any $R\subseteq X\times Y$ and $i\in X,j\in Y$ , we define the update $R[i\mapsto j]=\{(i,j)\}\cup R\setminus(\{i\}\times X)$ .

Data alphabet, data words and nominal sets.

We fix an infinite alphabet $\mathcal{D}$ of data. Data are ranged over by $d$ and variants, or by $a,b,\dots$ and variants, while sometimes we may simply use numbers $1,2,3,\dots$ . A data word is a finite sequence $w\in\mathcal{D}^{*}$ . We denote by $\epsilon$ the empty word, $|w|$ the length of $w$ , and $w[i]$ the $i$ th datum in $w$ ( $i\in[1,|w|]$ ). We sometimes see $w$ as a function $\hat{w}:[1,|w|]\to\mathcal{D}$ . A data permutation is a bijection $\tau:\mathcal{D}\rightarrow\mathcal{D}$ fixing all but finitely many elements of $\mathcal{D}$ ; we write $\mathrm{Perm}(\mathcal{D})$ for the set of data permutations. Given $d,d^{\prime}\in\mathcal{D}$ , we write $(d\ d^{\prime})$ for the data permutation that swaps $d$ and $d^{\prime}$ , and fixes other data.

A nominal set [21] (over $\cal D$ ) is a set $X$ along with a group action ${\cdot}:\mathrm{Perm}(\mathcal{D})\times X\to X$ , that is, for each data permutation $\tau$ and $x\in X$ , we have $\tau\cdot x\in X$ . We say that a set $S\subseteq\cal D$ is a support of $x\in X$ if, for all permutations $\tau$ , if $\tau(d)=d$ for all $d\in S$ then $\tau\cdot x=x$ . We stipulate that all elements of a nominal set have finite support. Finite support is closed under intersection and, hence, each element $x$ of a nominal set $X$ has least finite support which we denote by $\mathsf{supp}(x)$ . We say that $x$ is equivariant if $\mathsf{supp}(x)=\emptyset$ , i.e. $\tau\cdot x=x$ for all $\tau\in\mathrm{Perm}(\mathcal{D})$ . For a subset $Y$ of a nominal set $X$ and any $\tau$ , we write $\tau\cdot Y$ for the set obtained from $Y$ by applying $\tau$ elementwise on it: $\tau\cdot Y=\{\tau\cdot x\mid x\in Y\}$ .

The set $\mathcal{D}^{*}$ of data words is a nominal set, with action $\tau\cdot w=\hat{w};\tau$ . For any $w\in\mathcal{D}^{*}$ , we can see that $\mathsf{supp}(w)=\{w[i]\mid i\in[1,|w|]\}$ . Note that a set of words $L\subseteq\mathcal{D}^{*}$ has finite support just if there is finite $S\subseteq\mathcal{D}$ such that, for all $\tau$ , if $\tau(d)=d$ for all $d\in S$ then $\tau\cdot L=L$ . We write $\mathsf{supp}(L)$ for the least finite support of $L$ . $L$ is called a data language if it is equivariant, that is, if it has empty support, i.e., it is closed under data permutation. Given $u,v\in\mathcal{D}^{*}$ , we write $u\simeq v$ if there exists $\tau\in\mathrm{Perm}(\mathcal{D})$ such that $u=\tau\cdot v$ .

Registers and register permutations.

In this paper registers are represented by natural numbers ranging over $[1,r]$ for some $r\geq 1$ , and register assignments are partial injective mappings $\rho:[1,r]\rightharpoonup\mathcal{D}$ . We observe that if $\rho_{1},\rho_{2}:[1,r]\rightharpoonup\mathcal{D}$ are two register assignments, then $\rho_{1};\rho_{2}^{-1}$ is a partial permutation over $[1,r]$ which identifies registers holding the same data: $\rho_{1};\rho_{2}^{-1}=\left\{(i,j)\mid i\in{\sf dom}(\rho_{1})\land j\in{\sf dom% }(\rho_{2})\land\rho_{1}(i)=\rho_{2}(j)\right\}$ .

For any $n\in\mathbb{N}$ , we let $\mathcal{S}_{n}$ be the group of permutations on $[1,n]$ . We write $\mathsf{id}$ for the identity permutation on $[1,n]$ . We may write permutations using sequence notation, i.e. as $(\pi(1),\dots,\pi(n))$ , for $\pi\in\mathcal{S}_{n}$ . Given a partial permutation $\tau:[1,n]\rightharpoonup[1,n]$ its canonical extension is the permutation $\hat{\tau}\in\mathcal{S}_{n}$ obtained by setting: $\hat{\tau}(x)=\tau(x)$ if $x\in\mathsf{dom}(\tau)$ ; and $\hat{\tau}(x)=\tau^{-\kappa}(x)$ if $x\in{\sf rng}(\tau)\setminus\mathsf{dom}(\tau)$ and $\kappa=\max\{\kappa\mid\tau^{-\kappa}(x)\text{ is defined}\}$ ( $\tau^{-\kappa}$ stands for the $\kappa$ -fold composition $\tau^{-1};\dots;\tau^{-1}$ ); and, finally, $\hat{\tau}(x)=x$ if $x\in[1,n]\setminus(\mathsf{dom}(\tau)\cup{\sf rng}(\tau))$ . Put otherwise, $\hat{\tau}$ extends $\tau$ by closing maximal paths in $\tau$ and by being the identity everywhere else. In particular, given sequences of pairwise distinct elements $\vec{i},\vec{j}\in[1,n]^{k}$ , for some $k\leq n$ , we write $(\vec{i}\mapsto\vec{j})$ for the canonical extension to $[1,n]$ of the map $\{(i_{l},j_{l})\mid l\in[1,k]\}$ .

Definition 2.

A deterministic permutation register automaton (pDRA) is a tuple ${\mathcal{A}}=\langle r,Q,\mu,q_{0},F,\delta\rangle$ , where:

$\blacksquare$

$[1,r]$ is a set of registers (with $r\geq 0$ );
$\blacksquare$

$Q$ is a finite set of states, $q_{0}\in Q$ is the initial state, and $F\subseteq Q$ are final states;
$\blacksquare$

$\mu:Q\to{\cal P}([1,r])$ is an availability function, such that $\mu(q_{0})=\varnothing$ ;
$\blacksquare$
$\delta=(\delta_{=}\cup\delta_{\neq})$ is a pair of partial transition functions of types:
- –
  
  $\delta_{=}:Q\times R\rightharpoonup Q\times\mathcal{S}_{r}$
- –
  
  $\delta_{\neq}:Q\rightharpoonup Q\times R\times\mathcal{S}_{r}$

We write $p\xrightarrow{k,\pi}q$ for $\delta_{=}(p,k)=(q,\pi)$ , and $p\xrightarrow{k^{\bullet},\pi}q$ for $\delta_{\neq}(p)=(q,k,\pi)$ . Transitions are subject to the following availability condition. Whenever $p\xrightarrow{x,\pi}q$ :

$\blacksquare$

if $x\in[1,r]$ then $x\in\mu(p)$ and $\mu(q)\subseteq\pi(\mu(p))$ ;
$\blacksquare$

if $x=k^{\bullet}$ then $\mu(q)\subseteq\pi(\mu(p)\cup\{k\})$ .

We say that ${\mathcal{A}}$ is complete if $\delta_{\neq}$ is total and, for all $p\in Q$ , it holds that $\mathsf{dom}(\delta_{=}(p,\cdot))=\mu(p)$ .

$\blacktriangleright$ Remark 3.

Let ${\mathcal{A}}$ be a pDRA as in the definition above. The automaton operation relies on states and registers, with each state $p\in Q$ being equipped with a set of registers $\mu(p)\subseteq[1,r]$ that are available at that state. During its operation, the automaton maintains a register assignment, which is an injective map from available registers to data (i.e. $\rho:\mu(p)\xrightarrow{\text{1-1}}\mathcal{D}$ ). Given a state $p\in Q$ and assignment $\rho$ , for any input datum $d$ :

$\blacksquare$

If $d$ is stored in some available register, say $d=\rho(x)$ with $x\in\mu(p)$ , and $\delta_{=}(p,x)=(q,\pi)$ , then the automaton reads $d$ and moves to state $q$ . Its register assignment will be updated by applying the permutation $\pi$ on its registers and discarding those registers that are not available in $q$ (i.e. the content of registers in $[1,r]\setminus\mu(q)$ gets deleted).
$\blacksquare$

If $d$ is a locally fresh datum (i.e. $d\notin{\sf rng}(\rho)$ ) and $\delta_{\neq}(p)=(q,k,\pi)$ , then the automaton reads $d$ and moves to state $q$ . The register assignment is updated by storing $d$ in register $k$ (rewriting the existing contents if $k$ were available), then permuting registers by $\pi$ and finally discarding those registers that are not available in $q$ .

Configurations, runs and languages.

A configuration of a pDRA ${\mathcal{A}}$ is a pair $(p,\rho)$ , where $p\in Q$ and $\rho:\mu(p)\rightarrow\mathcal{D}$ is a register assignment, defined as a (total) mapping from $\mu(p)$ to data. We denote by $\mathbb{C}_{{\mathcal{A}}}$ the set of configurations of ${\mathcal{A}}$ . Transitions between configurations are modeled as a labelled transition system ${\xrightarrow{}_{\mathcal{A}}}\subseteq\mathbb{C}_{\mathcal{A}}\times\mathcal{% D}\times\mathbb{C}_{\mathcal{A}}$ , whose elements are denoted by $\kappa_{1}\xrightarrow{d}_{\mathcal{A}}\kappa_{2}$ , for $\kappa_{i}\in\mathbb{C}_{\mathcal{A}},d\in\mathcal{D}$ . We let $(p,\rho_{1})\xrightarrow{d}_{\mathcal{A}}(q,\rho_{2})$ , when one of the following holds:

$\blacksquare$

$\rho_{1}(k)=d$ , for some $k\in\mu(p)$ , $\delta_{=}(p,k)=(q,\pi)$ and $\rho_{2}=(\pi^{-1};\rho_{1})|_{\mu(q)}$ (Equality)
$\blacksquare$

$d\not\in{\sf rng}(\rho_{1})$ , $\delta_{\neq}(p)=(q,k,\pi)$ , and $\rho_{2}=(\pi^{-1};\rho_{1}[k\mapsto d])|_{\mu(q)}$ (Disequality)

Note that, due to the availability condition, the assignment $\rho_{2}$ is in both cases well defined.

A run of a word $w=d_{1}d_{2}\dots d_{n}\in\mathcal{D}^{*}$ on ${\mathcal{A}}$ is a sequence of transitions $\kappa_{0}\xrightarrow{d_{1}}_{\mathcal{A}}\kappa_{1}\xrightarrow{d_{2}}_{% \mathcal{A}}\dotsc\xrightarrow{d_{n}}_{\mathcal{A}}\kappa_{n}$ where $\kappa_{i}\in\mathbb{C}_{\mathcal{A}}$ and $\kappa_{0}$ is the initial configuration, defined as $\kappa_{0}=(q_{0},\varnothing)$ where $\varnothing$ is the register assignment with empty domain. The pDRA ${\mathcal{A}}$ is deterministic in the sense that there is at most one transition per datum $d\in\mathcal{D}$ : given a configuration $(p,\rho_{1})$ , there is at most one configuration $(q,\rho_{2})$ such that $(p,\rho_{1})\xrightarrow{d}_{{\mathcal{A}}}(q,\rho_{2})$ .

Additionally, we also note that the transitions preserve the “injectivity” of the register assignments. Indeed, the $\delta_{=}$ transitions do not change the data stored in the registers but merely rearrange the storage or delete some values. New data are stored via $\delta_{\neq}$ transitions, which ensures that the new datum $d$ differs from all existing stored values in registers.

If $w$ is a data word and ${\mathcal{A}}$ has a run on $w$ from a configuration $(q,\rho)$ to a configuration $(q^{\prime},\rho^{\prime})$ , then we write $(q,\rho)\xrightarrow{w}_{{\mathcal{A}}}(q^{\prime},\rho^{\prime})$ . The language recognised by $(q,\rho)$ is defined as $\mathcal{L}(q,\rho)=\{w\in\mathcal{D}^{*}\mid(q,\rho)\xrightarrow{w}_{{% \mathcal{A}}}(q^{\prime},\rho^{\prime})\text{ and }q^{\prime}\in F\}$ . Accordingly, the language recognised by ${\mathcal{A}}$ is defined as $\mathcal{L}({\mathcal{A}})=\mathcal{L}(\kappa_{0})$ .

Definition 4.

We call regular data language any data language which is recognized by a pDRA. Given a data word $u$ and a language $L$ , the residual of $L$ by $u$ is the set $u^{-1}L=\{w\mid uw\in L\}$ .

Note that while every data language $L$ is equivariant, $u^{-1}L$ may not necessarily be so, although $\mathsf{supp}(u^{-1}L)\subseteq\mathsf{supp}(u)$ .

MRT-automata [17] are the permutation-free restriction of pDRAs, i.e. only the identity permutation is used. This restriction severely affects succinctness, as witnessed e.g. by the family of languages $L_{n}=\{d_{1}...d_{n}d_{\pi(1)}...d_{\pi(n)}\mid\forall i\neq j.\,d_{i}\neq d_% {j}\land\pi\in\mathcal{S}_{n}\}$ .

Lemma 5.

pDRAs are exponentially more succinct than MRT-automata.

$\blacktriangleright$ Remark 6 (Deterministic).

MRT-automata are equi-expressive as (deterministic) register automata [13], and so pDRAs can recognize any language recognizable by an MRT-automaton or a register automaton. Conversely, having permutations on transitions only brings succinctness, but no additional computational power. In fact, they can be simulated by hardcoding the order of the registers in the state space, at the price of an exponential blow-up. Alternatively, the configuration graphs of pDRAs correspond to nominal automata with straight state sets, which are equivalent to orbit-finite nominal automata and register automata [5].

3 Decision problems for register automata with permutations

We now examine basic decision problems for pDRAs. We shall assume that equality and disequality tests are done in constant time. We start with a result that can be lifted directly from finite-state automata. The membership problem asks, given a pDRA ${\mathcal{A}}$ and data word $w$ , if $w\in\mathcal{L}({\mathcal{A}})$ holds. The non-emptiness problem asks, given a pDRA ${\mathcal{A}}$ , if $\mathcal{L}({\mathcal{A}})\neq\varnothing$ .

Lemma 7.

For pDRAs, membership is in PTime, while non-emptiness is in NLogSpace.

We next look at three related notions of equivalence.

Bisimilarity: Let ${\mathcal{A}}$ be a pDRA, and $\rightarrow_{A}\subseteq\mathbb{C}_{\mathcal{A}}\times\mathcal{D}\times\mathbb% {C}_{\mathcal{A}}$ its associated labelled transition system, as defined in Section 2. A relation $R\subseteq\mathbb{C}_{\mathcal{A}}\times\mathbb{C}_{\mathcal{A}}$ is a bisimulation if for every $(\kappa_{1},\kappa_{2})\in R$ it holds that, for each $i\in\{1,2\}$ and $\kappa_{i}\xrightarrow{d}_{\mathcal{A}}\kappa_{i}^{\prime}$ , there exists $\kappa_{3-i}\xrightarrow{d}_{\mathcal{A}}\kappa_{3-i}^{\prime}$ such that $(\kappa^{\prime}_{1},\kappa^{\prime}_{2})\in R$ . Two configurations $\kappa_{1}$ and $\kappa_{2}$ are bisimilar, denoted by $\kappa_{1}\simeq\kappa_{2}$ , if there exists a bisimulation $R$ where $(\kappa_{1},\kappa_{2})\in R$ . The bisimilarity problem is defined as follows: given two configurations $\kappa_{1},\kappa_{2}$ of a given pDRA $\mathcal{A}$ , decide if $\kappa_{1}\simeq\kappa_{2}$ holds.
Residual equivalence: This problem asks: given two pDRAs $\mathcal{A}_{1}$ and $\mathcal{A}_{2}$ and two configurations $\kappa_{1},\kappa_{2}$ of ${\mathcal{A}}_{1}$ and ${\mathcal{A}}_{2}$ respectively, decide whether $\mathcal{L}_{{\mathcal{A}}_{1}}(\kappa_{1})=\mathcal{L}_{{\mathcal{A}}_{2}}(% \kappa_{2})$ holds.
(Language) equivalence: Finally, for this problem, given two pDRAs $\mathcal{A}_{1}$ and $\mathcal{A}_{2}$ , we need to decide whether $\mathcal{L}({\mathcal{A}}_{1})=\mathcal{L}({\mathcal{A}}_{2})$ holds. This is a special case of residual equivalence.

We note that, as pDRA are deterministic, the residual equivalence problem can be reduced to checking bisimilarity $\kappa_{1}\simeq\kappa_{2}$ in a slight modification of the disjoint union of ${\mathcal{A}}_{1}$ and ${\mathcal{A}}_{2}$ .

Theorem 8.

Bisimilarity, residual equivalence and (language) equivalence are all in PTime for pDRAs.

The methodology we use to prove this bound for bisimilarity is closely related to [17]. It is based on building a candidate bisimulation relation on the fly, and relying on a group-based representation of its equivalence classes to keep the representation in polynomial space, and invoking PTime-subroutines from computational group theory (e.g. for group membership). Then, the bound for residual (and plain) equivalence follows from the observation above relating this problem to bisimilarity in the deterministic setting.

In the remainder of this section we study a notion describing the data that “must be remembered” when processing data words [4].

Memorability: Given a data language $L\subseteq\mathcal{D}^{*}$ , a word $u\in L$ and a datum $d\in\mathcal{D}$ , we say that $d$ is $L$ -memorable in $u$ if $d\in\mathsf{supp}(u^{-1}L)$ . We let ${\sf mem}_{L}(u)$ be the set of all $L$ -memorable data of $u$ , i.e. ${\sf mem}_{L}(u)=\mathsf{supp}(u^{-1}L)$ .
Frugality: We say that a pDRA ${\mathcal{A}}$ is frugal if, for all configurations $(q,\!\rho)$ , ${\sf rng}(\rho)\!=\!\mathsf{supp}(\mathcal{L}(q,\rho))$ .
Minimality: A pDRA $\cal A$ is called state-minimal if there is no $\cal A^{\prime}$ with less states than $\cal A$ such that $L(\mathcal{A})=L(\mathcal{A^{\prime}})$ . It is called minimal if it is complete, frugal and state-minimal.

We can see that frugality essentially requires that all names stored in registers inside any configuration be $\mathcal{L}({\mathcal{A}})$ -memorable (in all words reaching said configuration).

A more useful algorithmically version of memorability is given below. We use it to show that any pDRA recognizing a language $L$ has to store at least the $L$ -memorable data, thereby justifying the term “memorable”. In Section 4 we show that it suffices to store only $L$ -memorable data.

Lemma 9.

The following hold:

1.

Let $d^{\prime}$ be fresh in $u$ . Then $d$ is $L$ -memorable in $u$ iff $u^{-1}L\neq((d\ d^{\prime})\cdot u)^{-1}L$ .
2.

Let $A$ be a pDRA recognizing a language $L$ , and $(q,\rho)$ be the configuration reached by $A$ upon reading a word $u$ . Then ${\sf mem}_{L}(u)\subseteq{\sf rng}(\rho)$ .

Proof.

For 1, by nominal sets reasoning, $d\in\mathsf{supp}(u^{-1}L)$ iff $u^{-1}L\neq((d\ d^{\prime})\cdot u)^{-1}L$ for any fresh $d^{\prime}$ .
For 2, suppose there exists $d\not\in{\sf rng}(\rho)$ such that $d\in{\sf mem}_{L}(u)$ . By 1, $u^{-1}L\neq((d\ d^{\prime})\cdot u)^{-1}L$ . However, as $d^{\prime}$ is fresh in $u$ and $d$ is not stored in $\rho$ , both $u$ and $(d\ d^{\prime})\cdot u$ reach the same configuration $(q,\rho)$ , which implies $u^{-1}L=\mathcal{L}(q,\rho)=((d\ d^{\prime})\cdot u)^{-1}L$ , contradiction. $\hfill\blacktriangleleft$

Example 10.

We can see that ${\sf mem}_{\mathcal{D}^{*}}(w)=\emptyset$ for any $w\in\mathcal{D}^{*}$ . Next, consider the language:

L_{1}=\{abwab\mid a\neq b\in\mathcal{D}\land w\in\mathcal{D}^{*}\}

then, for all $a\in\mathcal{D}$ and $w\in\mathcal{D}^{*}$ , ${\sf mem}_{L_{1}}(\epsilon)={\sf mem}_{L_{1}}(aaw)=\emptyset$ . For all $b\in\mathcal{D}$ such that $a\neq b$ , ${\sf mem}_{L_{1}}(abw)=\{a,b\}$ , because $w$ can always be completed by a suffix $v$ ending by $a b$ (so $abwv\in L_{1}$ ), or by a different ending (so $abwv\not\in L_{1}$ ).
On the other hand, consider the language of all words where the first and last element are repeated and distinct:

L_{2}=\{aawbb\mid a\neq b\in{\mathcal{D}}\land w\in\mathcal{D}^{*}\}

then, for all $a\neq b\in\mathcal{D}$ and $w\in\mathcal{D}^{*}$ , we have ${\sf mem}_{L_{2}}(\epsilon)={\sf mem}_{L_{2}}(abw)=\emptyset$ and ${\sf mem}_{L_{2}}(a)={\sf mem}_{L_{2}}(aa)={\sf mem}_{L_{2}}(aawa)=\{a\}$ . Note that $a$ is $L_{2}$ -memorable in $a a w$ , as, for instance $aawbb\in L_{2}$ , while $aawaa\not\in L_{2}$ . We also have ${\sf mem}_{L_{2}}(aawb)=\{a,b\}$ . Indeed, $a$ is $L_{2}$ -memorable (as above), while for $b$ we have $aawbb\in L_{2}$ whereas $aawbc\not\in L_{2}$ for any $c\neq b$ .

The memorability problem asks, given as input a pDRA ${\mathcal{A}}$ , a word $u$ and a datum $d$ in $u$ , whether $d$ is ${\cal L}({\mathcal{A}})$ -memorable in $u$ .

Theorem 11.

The memorability problem is in PTime for pDRAs. Moreover, any pDRA can be transformed, in polynomial time, into an equivalent frugal pDRA with the same number of states and transitions.

Proof.

For the first claim, following Lemma 9, take any datum $d^{\prime}$ fresh in $u$ . It suffices to compute the configurations $(q,\rho)$ and $(q^{\prime},\rho^{\prime})$ reached by the pDRA on reading $u$ and $(d\ d^{\prime})\cdot u$ respectively, and check if $\mathcal{L}(q,\rho)\neq\mathcal{L}(q^{\prime},\rho^{\prime})$ holds, which is in PTime (cf. Theorem 8).
For the second claim, we only provide a proof sketch. We show that the set of registers which contain the memorable data in a configuration, only depends on the state of the configuration. This allows one to turn any pDRA into a frugal one. $\hfill\blacktriangleleft$

4 Minimal register automata with permutations

In this section, given a data word language $L$ , we define a Myhill-Nerode equivalence relation $\equiv_{L}$ on data words, which is shown to have finite index iff $L$ is recognizable by a pDRA. Based on this equivalence relation, we show how to construct a unique and minimal pDRA. This generalizes the results of [4] to pDRAs.

For any two data words $u,v\in\mathcal{D}^{*}$ , we write $u=_{L}v$ whenever $(u\in L\Leftrightarrow v\in L)$ holds. Moreover, we recall that $u^{-1}L=\{w\in{\cal D}^{*}\mid uw\in L\}$ .

Definition 12 (Data word equivalence relation).

Given a data language $L\subseteq\mathcal{D}^{*}$ and two data words $u, v$ , we say that $u$ and $v$ are $L$ -equivalent, written $u\equiv_{L}v$ (or just $u\equiv v$ if $L$ is clear from the context), whenever there exists a data permutation $\tau$ such that $u^{-1}L=(\tau\cdot v)^{-1}L$ , i.e, for all $w\in\mathcal{D}^{*}$ , $uw=_{L}(\tau\cdot v)w$ .

Proposition 13.

$\equiv_{L}$ is an equivalence relation.

We may write $u\equiv_{L}^{\tau}v$ to emphasize $\tau$ in the definition above, however $u\equiv_{L}^{\tau}v$ is not an equivalence relation in general. We can immediately see the following.

Lemma 14.

If $u\equiv_{L}^{\tau}v$ , then ${\sf mem}_{L}(u)=\tau\cdot{\sf mem}_{L}(v)$ . Hence $|{\sf mem}_{L}(u)|=|{\sf mem}_{L}(v)|$ .

Proof.

Since $u\equiv^{\tau}_{L}v$ , we have $u^{-1}L=(\tau\cdot v)^{-1}L$ . By nominal sets reasoning (see e.g. [21, Ch. 2]), we have $\mathsf{supp}(u^{-1}L)=\mathsf{supp}((\tau\cdot v)^{-1}L)=\mathsf{supp}(\tau% \cdot(v^{-1}L))=\tau\cdot\mathsf{supp}(v^{-1}L)$ . Hence $|{\sf mem}_{L}(u)|=|{\sf mem}_{L}(v)|$ . $\hfill\blacktriangleleft$

We illustrate data word equivalence via a simple albeit non-trivial example. We then focus on the suitability of this relation as a Myhill-Nerode equivalence relation.

Example 15.

Consider the language $L=\{abab\mid a\neq b\in\mathcal{D}\}\cup\{abba\mid a\neq b\in\mathcal{D}\}$ . Let $a\neq b\in\mathcal{D}$ . There are six equivalence classes: $[\epsilon],[a],[aa],[ab],[aba],[abab]$ . Any non-empty word which is not a prefix of some word in $L$ is equivalent to $a a$ . Now, $aba\equiv_{L}^{\tau}abb$ holds for any data permutation $\tau$ which swaps $a$ and $b$ . Indeed, $(aba)^{-1}L=\{b\}$ , and $(\tau\cdot abb)^{-1}L=(baa)^{-1}L=\{b\}$ . Finally, $abab\equiv_{L}^{\sf id}abba$ as they are both in $L$ .

Theorem 16.

A language $L\subseteq\mathcal{D}^{*}$ is recognizable by a pDRA iff $\equiv_{L}$ has finite index.

Proof.

Let ${\mathcal{A}}$ be a pDRA recognizing $L$ . Take any state $q$ and let $u,v\in\mathcal{D}^{*}$ be such that: $(q_{0},\varnothing)\xrightarrow{u}_{\mathcal{A}}(q,\rho_{1})$ and $(q_{0},\varnothing)\xrightarrow{v}_{\mathcal{A}}(q,\rho_{2})$ . Since $\rho_{1},\rho_{2}$ are injective (as by definition of pDRA, there are no duplicates in register assignments), there exists a data permutation $\tau$ such that $\tau|_{{\sf rng}(\rho_{2})}=\rho_{2}^{-1};\rho_{1}$ . Then, from $(q_{0},\varnothing)\xrightarrow{v}_{\mathcal{A}}(q,\rho_{2})$ we get $(q_{0},\varnothing)\xrightarrow{\tau\cdot v}_{\mathcal{A}}(q,\rho_{2};\tau)=(q% ,\rho_{1})$ . Therefore, $u^{-1}L=(\tau\cdot v)^{-1}L$ , i.e. $u\equiv_{L}v$ . This implies that $\equiv_{L}$ has finite index.
The other direction is based on the construction of a canonical automaton explained below. $\hfill\blacktriangleleft$

Construction of a canonical automaton.

Let us assume that $\equiv_{L}$ has finite index. For each equivalence class $c$ of $\equiv_{L}$ , let $u_{c}$ be some representative of $c$ and $\rho_{c}:[1,|{\sf mem}_{L}(u_{c})|]\rightarrow{\sf mem}_{L}(u_{c})$ a register assignment. For any word $u$ , we hereby denote by $[u]$ its $\equiv_{L}$ -equivalence class. We now construct a minimal pDRA ${\mathcal{A}}_{L}$ recognizing $L$ , whose states are the chosen class representatives, with the following invariant (shown in Lemma 18 below): on reading any word $v$ , ${\mathcal{A}}_{L}$ reaches configuration $(u_{c},\rho_{c};\tau^{-1})$ for some $\tau$ such that $u_{c}\equiv_{L}^{\tau}v$ . We construct the minimal pDRA ${\mathcal{A}}_{L}=(Q_{L},\mu_{L},q_{0},F_{L},\delta_{L}=\delta_{=}\uplus\delta% _{\neq})$ as follows:

$\blacksquare$

the number of registers is $r=\max(\{|{\sf mem}_{L}(u)|\mid u\in\mathcal{D}^{*}\})$ (it exists by Lemma 14);
$\blacksquare$

$Q_{L}=\{u_{c}\mid c\in\mathcal{D}^{*}/_{\equiv_{L}}\}$ , $\mu_{L}(u_{c})=[1,|{\sf mem}_{L}(u_{c})|]$ for all $u_{c}\in Q_{L}$ , and $q_{0}=u_{[\epsilon]}$ ;
$\blacksquare$

$F_{L}=\{u_{c}\mid u_{c}\in L\}$ ;
$\blacksquare$

$\delta_{=}$ is defined as follows. Let $u_{c}$ be some representative and $d\in{\sf mem}_{L}(u_{c})$ . After a successful equality test with register $k$ , the automaton goes to state $u_{c^{\prime}}$ where $c^{\prime}=[u_{c}d]$ and $d=\rho_{c}(k)$ . So, we add the transition $\delta_{=}(u_{c},k)=(u_{c^{\prime}},\pi)$ for some $\pi$ that we now construct. The registers need to permuted to account for the fact some data of ${\sf mem}_{L}(u_{c})$ may need to be deleted (if they are not memorable in $u_{c}d$ ), and the fact that $u_{c}d$ and $u_{c^{\prime}}$ have equal residual languages up to some permutation $\tau$ . Let $i=|{\sf mem}_{L}(u_{c})|$ , $j=|{\sf mem}_{L}(u_{c^{\prime}})|$ and $\tau:\mathcal{D}\rightarrow\mathcal{D}$ such that $u_{c^{\prime}}\equiv^{\tau}_{L}u_{c}d$ . Let $\pi^{\prime}=\rho_{c};\tau;\rho_{c^{\prime}}^{-1}$ . Note that $j\leq i$ , ${\sf rng}(\pi^{\prime})=[1,j]$ and $\mathsf{dom}(\pi^{\prime})\subseteq[1,i]$ . Then, $\pi:[1,i]\rightarrow[1,i]$ is defined to be a canonical extension of $\pi^{\prime}$ to $[1,i]$ .
$\blacksquare$

$\delta_{\neq}$ is defined similarly as $\delta_{=}$ , we let $\delta_{\neq}(u_{c})=(u_{c^{\prime}},i+1,\pi)$ where $c^{\prime}=[u_{c}d]$ for $d$ any fresh datum, and $\pi$ is a canonical extension of $\pi^{\prime}$ to $[1,i+1]$ , for $\pi^{\prime}=\rho_{c}[i+1\mapsto d];\tau;\rho_{c^{\prime}}^{-1}$ . So, $d$ is stored in the first available register ( $i+1$ ), before the permutation $\pi$ is applied.

Note that ${\mathcal{A}}_{L}$ is complete. We illustrate this construction with the following example.

Figure 2: Canonical automaton

{\mathcal{A}}_{L}

of Example 17. Only accessible states are shown.

Example 17.

Consider again the language of Example 15, where we have defined representatives for the six equivalence classes of $\equiv_{L}$ . Note that, ${\sf mem}_{L}(\epsilon)={\sf mem}_{L}(abab)=\varnothing$ , ${\sf mem}_{L}(a)=\{a\}$ , ${\sf mem}_{L}(ab)=\{a,b\}$ and ${\sf mem}_{L}(aba)=\{b\}$ . We depict in Figure 2 the canonical automaton obtained for those representatives and chosen assignments $\rho_{[a]}=\{1\mapsto a\}$ , $\rho_{[ab]}=\{1\mapsto a,2\mapsto b\}$ , and $\rho_{[aba]}=\{1\mapsto b\}$ . Looking at the transitions, consider for instance source state $a b$ and test on register $1$ . The transition goes to state $a b a$ . As $aba\equiv_{L}^{\sf id}aba$ , the permutation is a canonical extension on domain $\{1,2\}$ of $\pi^{\prime}=\rho_{[ab]};{\sf id};\rho_{[aba]}^{-1}=\{2\mapsto 1\}$ , i.e. transposition $(2\ 1)$ . Consider now a test on register $2$ . The next state is the representative equivalent to $a b b$ , i.e. $a b a$ , as $aba\equiv_{L}^{\tau}abb$ for any $\tau$ swapping $a$ and $b$ . So, the permutation is $\pi^{\prime}=\rho_{[ab]};\tau;\rho_{[aba]}^{-1}$ , i.e. the identity on $\{1,2\}$ .

We finally prove that $\mathcal{L}({\mathcal{A}}_{L})=L$ and that the automaton is minimal. For this, we use the fact that runs of ${\mathcal{A}}_{L}$ compute the $\equiv_{L}$ -equivalence class of the word they read.

Lemma 18.

1.

For all words $v\in\mathcal{D}^{*}$ , if ${\mathcal{A}}_{L}$ reaches configuration $(u_{c},\rho)$ on reading $v$ then there is some $\tau$ such that: $(1)$ $u_{c}\equiv^{\tau}_{L}v$ , $(2)$ $\rho_{c}=\rho;\tau$ , and $(3)$ ${\sf rng}(\rho)={\sf mem}_{L}(v)$ .
2.

$\mathcal{L}({\mathcal{A}}_{L})=L$ and ${\mathcal{A}}_{L}$ is minimal.

Proof.

For space reasons, we only prove 2 here.It is immediate as a consequence of 1 that $\mathcal{L}({\mathcal{A}}_{L})=L$ . Indeed, $v\in\mathcal{{\mathcal{A}}_{L}}$ iff the state $u_{c}$ reached by ${\mathcal{A}}_{L}$ on $v$ is accepting, iff $u_{c}\in L$ , iff $\epsilon\in u_{c}^{-1}L=(\tau\cdot v)^{-1}L$ for some $\tau$ , iff $\tau\cdot v\in L$ , iff $v\in L$ as $L$ is equivariant.
We prove that ${\mathcal{A}}_{L}$ is minimal. First, ${\mathcal{A}}_{L}$ is complete by definition. It is frugal by 1, as registers contain exactly the memorable data (and it is necessary as proved by Lemma 9). If it is not state minimal, then there exists a pDRA with less states than the index of $\equiv_{L}$ recognizing $L$ . Therefore, there exist two inequivalent words $u\not\equiv_{L}v$ which respectively reach two configurations $(q,\rho_{1})$ and $(q,\rho_{2})$ with the same state $q$ . As shown in the proof of Theorem 16, this implies that $u\equiv_{L}v$ , contradiction. $\hfill\blacktriangleleft$

Minimization of pDRAs.

Minimizing a pDRA ${\mathcal{A}}$ recognizing a language $L$ can be done in EXPTime. Indeed, it suffices to apply the construction of the canonical automaton ${\mathcal{A}}_{L}$ step-by-step, starting from initial state $\epsilon$ , based on a subroutine deciding $\equiv_{L}$ . From a state $u_{c}$ with chosen assignment $\rho_{c}$ created so far and for all data $d\in{\sf rng}(\rho_{c})\cup\text{min}(\mathcal{D}\setminus{\sf rng}(\rho_{c}))$ , the algorithm checks whether $u_{c}d\equiv_{L}u_{c^{\prime}}$ for some existing state $u_{c^{\prime}}$ , otherwise state $u_{c}d$ is created, with $\rho_{[u_{c}d]}$ being arbitrary. Whether $u_{1}\equiv_{L}u_{2}$ holds, given $u_{1},u_{2}$ and ${\mathcal{A}}$ , can be decided in EXPTime: compute the configurations $(q_{i},\rho_{i})$ reached by ${\mathcal{A}}$ on $u_{i}$ , and check if there exists a permutation $\tau:{\sf rng}(\rho_{2})\rightarrow{\sf rng}(\rho_{2})$ such that $L_{\mathcal{A}}(q_{1},\rho_{1})=L_{\mathcal{A}}(q_{2},\rho_{2};\tau)$ . The latter can be done in PTime by Theorem 8. Note that due to the enumeration of register permutations, the minimization procedure is exponential only in the number of registers, which in turn is in PTime for a constant number of registers.

Whether we can do better than exponential time is a hard question. We can show that checking whether a complete and frugal pDRA is not minimal (called pDRA-nonMin problem) is at least as hard as the Graph Isomorphism problem (GI). GI is known to be in NP but not known to be in coNP.

Proposition 19.

pDRA-nonMin is GI-hard.

$\blacktriangleright$ Remark 20 (Canonicity).

As mentioned above, the construction of a canonical automaton can be used to minimize any pDRA. The construction of ${\mathcal{A}}_{L}$ only depends on the choices of the representatives $u_{c}$ and register assignments $\rho_{c}$ for the $L$ -memorable data of those representatives. So, once these choices have been fixed, any pDRA recognizing $L$ can be minimized into a unique pDRA ${\mathcal{A}}_{L}$ . This justifies that ${\mathcal{A}}_{L}$ is called canonical.

A minimal DFA recognizing a regular language is unique up to (automata) isomorphism. An interesting question is how minimal pDRAs recognizing a regular data language $L$ relate to each other, i.e. what is the right notion of isomorphism. It turns out that minimal pDRAs are isomorphic, for a notion of isomorphism which also takes into account permutations of the registers. Due to lack of space, we do not include this notion here. This notion is not simply a bijection which puts a correspondence between states while preserving transitions, as the order in which data are stored in registers also matters. Consider the language $L$ of words with at most two distinct data. When reading a word $d_{1}d_{2}$ , with $d_{1}\neq d_{2}$ , any minimal pDRA recognizing $L$ either stores $(d_{1},d_{2})$ or $(d_{2},d_{1})$ . Our notion of isomorphism identifies configurations up to permutation of their registers, so that minimal pDRAs for $L$ are considered isomorphic, no matter the order in which memorable data are stored.

5 $\varPi$ -DRAs: pDRAs with a Fixed Permutation Policy

We next consider the model obtained by syntactically restricting pDRAs to a fixed register assignment policy. Informally, a policy $\varPi$ fixes permutations depending on the size of the memory and the test on this memory, only. So, it is not state nor transition dependent. In particular, when a pDRA is in some state with a memory $\rho:[1,\dots,i]\rightarrow\mathcal{D}$ , and reads a datum $d=\rho(j)$ for some $1\leq j\leq i$ , then it applies a permutation $\Pi(i,j)\in\mathcal{S}_{i}$ to its registers before possibly deleting some data (this deletion is still transition-dependent) and proceeds to the next state. The gaps created by deletion are always removed by shifting all the data to an initial segment $[1,\dots,i^{\prime}]$ for $i^{\prime}$ the new memory size. Similarly, if $d$ read is fresh, then it is stored in register $i+1$ , and a permutation $\Pi(i,i+1)$ is applied to the registers.

We let a permutation policy be a function $\varPi$ that takes as input the size $i$ of the memory and an index $j\in[1,i+1]$ and returns a permutation $\pi\in\mathcal{S}_{\max(i,j)}$ . Given $j$ and $E\subseteq[1,j]$ we let $\mathsf{del}_{j}(E)\in\mathcal{S}_{j}$ be the permutation that shifts the elements of $E$ to the right, and those not in $E$ to the left, preserving their respective order. For example, $\mathsf{del}_{6}(\{2,4\})$ is the permutation $(1,5,2,6,3,4)$ , and $\mathsf{del}_{6}(\varnothing)$ is the identity. Formally, $\mathsf{del}_{j}(E)(k)=j-(|E|-n)$ if $k$ is the $n$ th (smallest) element of $E$ , otherwise, when $k\not\in E$ , $\mathsf{del}_{j}(E)(k)=k-n$ where $n=|\{e\in E\mid e<k\}|$ . We just write $\mathsf{del}(E)$ when $j$ is clear from the context.

Definition 21.

Let $\varPi$ be a permutation policy. A pDRA ${\mathcal{A}}$ with $r$ registers is said to have fixed permutation policy $\varPi$ , and called a $\varPi$ -DRA , if for each state $q$ there is $i$ s.t. $\mu(q)=[1,i]$ and, for all $q\xrightarrow{x,\pi}p$ there are $j\in\{i,i+1\}$ and $k$ with $x=k^{\bullet}$ or $x=k$ , and s.t.:

$\blacksquare$

if $x=k^{\bullet}$ then $j=k=i+1$ ; and otherwise $x=k$ and $j=i$ ;
$\blacksquare$

$\pi$ is $\pi_{1};\pi_{2}$ (canonically extended to $[1,r]$ ), where $\pi_{1}=\varPi(i,k)$ and $\pi_{2}=\mathsf{del}_{j}(E)$ for some $E\subseteq[1,j]$ with $|E|=j-|\mu(p)|$ .

Note that when applying transition $q\xrightarrow{x,\pi}p$ , all the data stored in registers $E$ are moved to the segment $[|\mu(p)|+1,\dots,i]$ and as a result are deleted. Also note that $\pi$ only depends on $\varPi$ , $|\mu(q)|$ , $x$ and $E$ , so when the permutation policy $\varPi$ is understood from the context, we denote each transition as $q\xrightarrow{x,E}p$ instead of $q\xrightarrow{x,\pi}p$ .

Example 22.

A natural example of $\varPi$ -DRA is when the policy always returns the identity permutation ${\sf id}$ . An ${\sf id}$ -DRA preserves the order in which the data were stored in memory. This can model processes where the memory is a queue of bounded size. It suffices to require that a datum read as input is deleted from memory when it is equal to the first memorized datum, or when it is fresh but the queue is full (otherwise, by definition of ${\sf id}$ -DRA, the fresh datum can be stored in the last register only).

Example 23 (Lar-automata).

The model by Benedikt, Ley, Puppis in [4] is exactly a pDRA with fixed permutation policy Lar (for Last Appearance Record):

\text{\sc Lar}(i,j)=\begin{cases}(1,\dots,j-1,i,j,j+1,\dots,i-1)&\text{ if }j% \in[1,i]\\ {\sf id}&\text{ if }j=i+1\end{cases}

In other words, a Lar-DRA saves (distinct) values in its registers ordering them according to their last appearance. In [4], the authors define a Myhill-Nerode relation for data languages allowing them to construct a minimal and canonical Lar-DRAs for any regular data language, though in exponential time. We extend and improve this by showing that pDRAs with any fixed permutation policy $\varPi$ admit a canonical model and can be minimized in PTime.

For the remainder of this section, let $\varPi$ be some fixed permutation policy. Given an arbitrary pDRA, we can always hardcode the permutations in the states (modulo an exponential blow-up) to turn it into a $\varPi$ -DRA.

Proposition 24.

Given a pDRA $\mathcal{A}$ , one can construct a (possibly exponentially larger) $\varPi$ -DRA $\cal B$ such that $L(\mathcal{A})=L(\mathcal{B})$ . Moreover, if $\mathcal{A}$ is frugal then so is $\cal B$ .

The exponential blow-up is in general unavoidable. We can show that pDRAs are exponentially more succinct than Lar-DRAs. We conjecture the blow-up is unavoidable for any fixed permutation policy $\varPi$ .

Lemma 25.

pDRAs are exponentially more succinct than Lar-DRAs.

Word of memorable data.

We introduce some notions towards an equivalence relation on data words, whose index corresponds to the minimal number of states that suffices for a $\varPi$ -DRA to recognize a data language $L$ . A key ingredient, unlike in pDRAs, is that the order in which $L$ -memorable data are stored by a $\varPi$ -DRA is canonical, i.e., it only depends on $\varPi$ . Thus we can define, for any word $w\in\mathcal{D}^{*}$ , a word ${\sf mem}_{L}^{\varPi}(w)$ which orders the $L$ -memorable data of $w$ . We illustrate it for $\varPi=\text{\sc Lar}$ . Initially, ${\sf mem}_{L}^{\text{\sc Lar}}(\epsilon)=\epsilon$ . Consider a word $w=1234$ where ${\sf mem}_{L}^{\text{\sc Lar}}(w)=1234$ and ${\sf mem}_{L}(w2)=\{2,3\}$ . Then, ${\sf mem}_{L}^{\text{\sc Lar}}(w2)=32$ .

In general, for an arbitrary policy $\varPi$ , ${\sf mem}_{L}^{\varPi}(w)$ can informally be defined as the register assignment reached by any frugal $\varPi$ -DRA recognizing $L$ and reading $w$ (modulo considering infinite state $\varPi$ -DRAs, as $L$ is equivariant but not necessarily regular). Formally, ${\sf mem}_{L}^{\varPi}(\epsilon)=\epsilon$ , and ${\sf mem}_{L}^{\varPi}(wd)$ is built from $m={\sf mem}_{L}^{\varPi}(w)$ and $M={\sf mem}_{L}(wd)$ as follows. First, let $m^{\prime}=md$ if $d$ is not $L$ -memorable in $w$ , and $m^{\prime}=m$ otherwise. Let $k$ be such that $m^{\prime}[k]=d$ . The positions of $m^{\prime}$ are reordered, giving a word $m^{\prime\prime}$ of same length, obtained by applying permutation $\varPi(|m|,k)$ on $m^{\prime}$ . Then, ${\sf mem}_{L}^{\varPi}(wd)$ is obtained by applying on $m^{\prime\prime}$ the erasing morphism which replaces any datum not in $M$ by $\epsilon$ .

Proposition 26.

The word ${\sf mem}_{L}^{\varPi}(w)\in\mathcal{D}^{*}$ is an enumeration of ${\sf mem}_{L}(w)\subseteq\mathcal{D}$ .

Abstract residual.

In the remainder of this section we fix a permutation policy $\varPi$ . Given a sequence of distinct data $\vec{a}\in{\cal D}^{*}$ and any object $X$ with finite support, we define the $\vec{a}$ -abstraction of the latter as $\langle\vec{a}\rangle X=\{\tau\cdot(\vec{a},X)\mid\tau\text{ fixes }\mathsf{% supp}(X)\setminus\{\vec{a}\}\}$ . Given a language $L$ and data word $u$ , we set $u^{-1}_{\varPi}L=\langle{\sf mem}^{\varPi}_{L}(u)\rangle(u^{-1}L)$ . Note that unravelling the last part of the definition above we have $u^{-1}_{\varPi}L=\{\tau\cdot({\sf mem}^{\varPi}_{L}(u),u^{-1}L)\mid\text{any }\tau\}$ .

Example 27.

Recall language $L_{2}$ from Example 10. For all $a,b\in\mathcal{D}$ :

(aa)_{\varPi}^{-1}L_{2}\ =\ \langle a\rangle((aa)^{-1}L_{2})\ =\ \{(d,\{wd^{% \prime}d^{\prime}\mid d\neq d^{\prime}\land w\in\mathcal{D}^{*}\})\mid d\in% \mathcal{D}\}\ =\ (bb)_{\varPi}^{-1}L_{2}

Canonical $\varPi$ -DRA.

Unlike $u^{-1}L$ , it can be shown that $u^{-1}_{\varPi}L$ has empty support, as it abstracts away the memorable data of $u$ . Based on this abstract residual notion, we define the following word equivalence “up-to memorable data”.

Definition 28 ( $\varPi$ -equivalence).

Given a language $L$ and data words $u, v$ , we let $\mathrel{\equiv^{\varPi}_{L}}$ be the word relation defined by $u\mathrel{\equiv^{\varPi}_{L}}v$ if $u^{-1}_{\varPi}L=v^{-1}_{\varPi}L$ .

The following lemma states that $\mathrel{\equiv^{\varPi}_{L}}$ refines $\equiv_{L}$ .

Lemma 29.

If $u\mathrel{\equiv^{\varPi}_{L}}v$ then there is some $\tau$ such that ${\sf mem}^{\varPi}_{L}(u)=\tau\cdot{\sf mem}^{\varPi}_{L}(v)\land u^{-1}L=(% \tau\cdot v)^{-1}L$ and, therefore, $u\equiv_{L}v$ .

$\blacktriangleright$ Remark 30.

The converse of Lemma 29 does not necessarily hold. For example, for $\varPi=\text{\sc Lar}$ , consider the language: $L_{3}=\{abbaab,ababab\mid a\neq b\in{\cal D}\}$ .
Take $a,b\in\mathcal{D}$ such that $a\neq b$ . Then $abba\equiv_{L_{3}}abab$ because $(abba)^{-1}L_{3}=(abab)^{-1}L_{3}=\{ab\}$ . However, $(abab)^{-1}_{\text{\sc Lar}}L_{3}=\{(a^{\prime}b^{\prime},\{a^{\prime}b^{% \prime}\})\mid a^{\prime}\neq b^{\prime}\in\mathcal{D}\}$ and $(abba)^{-1}_{\text{\sc Lar}}L_{3}=\{(b^{\prime}a^{\prime},\{a^{\prime}b^{% \prime}\})\mid a^{\prime}\neq b^{\prime}\in\mathcal{D}\}$ , hence $abba\not\equiv_{L_{3}}^{\text{\sc Lar}}abab$ .

The next lemma formalizes the intuition behind the definition of ${\sf mem}_{L}^{\varPi}(u)$ .

Lemma 31.

Let $\mathcal{A}$ be a frugal $\varPi$ -DRA and $u$ a word such that $\mathcal{A}$ reaches configuration $(q,\rho)$ when reading $u$ . Then, $\rho={\sf mem}^{\varPi}_{L}(u)$ .

Proof.

By Lemma 9 and frugality we have that $\rho$ and ${\sf mem}^{\varPi}_{L}(u)$ contain the same data. Using induction on the length of $u$ , we can show that their data are in the same order. $\hfill\blacktriangleleft$

We finally show a Myhill-Nerode theorem for $\varPi$ -DRAs involving $\mathrel{\equiv^{\varPi}}$ . Note that we can reformulate $\mathrel{\equiv^{\varPi}}$ so that $\equiv^{\text{\sc Lar}}$ coincide with word equivalence as defined in [4]. Thus, Theorem 32 generalizes the result of [4] following an alternative route using abstract residuals.

Theorem 32.

A language $L\subseteq\mathcal{D}^{*}$ is recognizable by a pDRA with fixed permutation policy $\varPi$ iff $\mathrel{\equiv^{\varPi}_{L}}$ has finite index.

Proof.

The R2L direction follows from Theorem 16, Lemma 29 and Proposition 24. In the next section (Theorem 34), we show that $\mathrel{\equiv^{\varPi}_{L}}$ allows one to construct a unique minimal $\Pi$ -DRA for $L$ (up to state renaming). We prove it via an active learning algorithm.
For the L2R direction, suppose $L$ is recognizable by a $\varPi$ -DRA ${\mathcal{A}}$ . By Proposition 24, we can assume that $\mathcal{A}$ is frugal. Given two words $u,u^{\prime}$ suppose they reach configurations $(q,\rho),(q,\rho^{\prime})$ , and say $\rho=\vec{a}$ and $\rho^{\prime}=\vec{a}^{\prime}$ . Let $(\vec{a}\mapsto\vec{a}^{\prime})$ be the data permutation mapping $\vec{a}$ to $\vec{a}^{\prime}$ component-wise. Then, $\rho^{\prime}=(\vec{a}\mapsto\vec{a}^{\prime})\cdot\rho$ and thus $u^{\prime-1}L=(\vec{a}\mapsto\vec{a}^{\prime})\cdot u^{-1}L$ . By Lemma 31, $\vec{a}={\sf mem}^{\varPi}_{L}(u)$ and $\vec{a}^{\prime}={\sf mem}^{\varPi}_{L}(u^{\prime})$ . We thus have that: $\langle{\sf mem}^{\varPi}_{L}(u)\rangle u^{-1}L=(\vec{a}\mapsto\vec{a}^{\prime% })\cdot\langle{\sf mem}^{\varPi}_{L}(u)\rangle u^{-1}L=\langle(\vec{a}\mapsto% \vec{a}^{\prime})\cdot{\sf mem}^{\varPi}_{L}(u)\rangle((\vec{a}\mapsto\vec{a}^% {\prime})\cdot u^{-1}L)=\langle{\sf mem}^{\varPi}_{L}(u^{\prime})\rangle u^{% \prime-1}L$ . $\hfill\blacktriangleleft$

6 Active Learning and Minimization of $\varPi$ -DRAs

Let $L$ be a language recognizable by a $\varPi$ -DRA. We show that it is possible to learn in PTime a canonical and minimal $\varPi$ -DRA for $L$ in the active framework [1, 25], relying on the answers to (polynomially-many) queries from oracles of membership $q_{m}$ , language equivalence $q_{e}$ and data memorability $q_{r}$ (see Section 3 for definition). Since the involved oracles can be implemented in PTime, this yields also a PTime minimization procedure for $\varPi$ -DRAs.

We give some intuition and notation concerning the learning procedure for $\varPi$ -DRAs. We assume that $\mathcal{D}$ has an arbitrary linear order, which we extend to $\mathcal{D}^{*}$ by length-lexicographic order. The $\varPi$ -DRA learner maintains in $B$ the set of known (representatives of) distinct $\equiv^{\varPi}_{L}$ -classes. Initially, $B=\{\epsilon\}$ and each data-word $w=ud$ added to $B$ , where $u\in B$ , has the following property: either $d\in mem_{L}(u)$ or $d$ is a fresh (minimal) value. Therefore $B$ somehow codes a tree-shaped subgraph of the canonical $\varPi$ -DRA for $L$ . Moreover, the learner grows a sample-set $S=(S^{+},S^{-})$ , where $S^{+}\subseteq L$ and $S^{-}\subseteq\mathcal{D}^{*}\setminus L$ are sets of positive and negative samples respectively. Such sample-set $S$ is used to guarantee the invariant $B\subseteq\mathcal{D}^{*}/_{\equiv^{\varPi}_{L}}$ and to forbid the completion of the tree-shaped $\varPi$ -DRA coded by $B$ with the addition of wrong transitions. A transition from $u\in B$ to $v\in B$ on $d\in mem_{L}(u)$ (or fresh) is forbidden by $S$ if $u d$ is apart from $v$ according to $S$ , written $ud\#_{S}v$ , formalized below.

Definition 33 (Apartness).

Two words $u, v$ are said apart according to a sample-set $S=(S^{+}\subseteq L,S^{-}\subseteq\mathcal{D}^{*}\setminus L)$ for $L$ , written $u\#_{S}v$ , if:

$\blacksquare$

either $|mem^{\varPi}_{L}(u)|\neq|mem^{\varPi}_{L}(v)|$ ,
$\blacksquare$

or $\exists u^{\prime},v^{\prime}$ s.t. $mem^{\varPi}_{L}(u)u^{\prime}\simeq mem^{\varPi}_{L}(v)v^{\prime}$ , but $vv^{\prime}\in S^{+}$ iff $uu^{\prime}\in S^{-}$ .

Recall that $u\simeq v$ denotes that $u=\tau\cdot v$ for some $\tau\in\mathrm{Perm}(\mathcal{D})$ , that can be checked in PTime by inspecting whether $|v|=|u|=n$ and $v[i]=v[j]$ iff $u[i]=u[j]$ , for all $1\leq i,j\leq n$ .

Algorithm 1

\varPi

-DRA_Learn.

Description of the (active) learning algorithm.

We describe the learning procedure, given by Algorithm 1. The set $B$ is initialized to $\{\epsilon\}$ , and $S^{+}=S^{-}$ to $\emptyset$ . Then the learner proceeds with the main loop consisting of three main steps.

The first step (lines 3-4) is a while-loop growing $B$ as long as there is $u\in B$ such that any transition out from $u$ on $d$ (where $d\in mem_{L}(u)$ or $d$ is a fresh minimal value), towards any $v\in B$ , is forbidden by $S$ , in which case $u d$ is added to $B$ .

In the second step (line 5), $B$ is used to construct a hypothesis $\varPi$ -DRA. In particular, the tree-shaped $\varPi$ -DRA coded by $B$ is used to build a complete hypothesis $\varPi$ -DRA ${\mathcal{A}}$ as follows. The set of states of ${\mathcal{A}}$ is exactly $B$ . The membership oracle $q_{m}$ is used to set final states: $u\in B$ is final iff $u\in L$ . Let $u\in B$ , $d\in mem_{L}(u)$ and $x$ such that $mem_{L}^{\varPi}(u)[x]=d$ . If $ud\in B$ , then $u d$ is apart from any $z$ in $B$ , and we let $u^{\prime}=ud$ . Otherwise, $u d$ is not apart from some $z\in B$ , and we let $u^{\prime}=z$ (for $z$ chosen arbitrarily). The transition $u\xrightarrow{x,E}z$ is added to ${\mathcal{A}}$ , where $E$ is chosen so that it ensures $(u,{\sf mem}_{L}^{\varPi}(u))\xrightarrow{d}_{\mathcal{A}}(z,{\sf mem}_{L}^{% \varPi}(ud))$ holds in the LTS of configurations. The transitions in $\delta_{\neq}$ for fresh values are similarly set up.

In the third step (lines 6-8), the hypothesis is submitted to the equivalence oracle $q_{e}$ that either accepts it, or provides a counterexample $w$ . If $q_{e}$ accepts the hypothesis, the algorithm terminates and returns ${\mathcal{A}}$ . Otherwise, the algorithm calls a procedure to process the counter-example, ProcessCEX $(w,B,{\mathcal{A}})$ This procedure returns a pair of samples $(w,w^{\prime})$ witnessing that the current hypothesis ${\mathcal{A}}$ contains a wrong transition. Adding $(w,w^{\prime})$ to $S$ will forever forbid the latter wrong transition in future hypothesis. See the paragraph in the sequel for a more detailed description of ProcessCEX and its pseudocode. Hence, if a canonical $\varPi$ -DRA for $L$ contains $n$ states, $m$ transitions and $k$ is the maximum number of available registers in any state, then the $\varPi$ -DRA learner will perform at most $n^{2}(k+1)-m$ iterations of the main loop, since this is the number of detectable wrong transitions. Finally, Theorem 34 states that $\varPi$ -DRA_Learn is a correct polynomial learner for the class of $\varPi$ -DRAs.

Theorem 34.

Given membership oracle $q_{m}$ , an equivalence oracle $q_{e}$ and memory oracle $q_{r}$ for regular data language $L$ , $\varPi$ -DRA_Learn $(q_{m},q_{e},q_{r})$ learns the unique (up to state renaming) minimal $\varPi$ -DRA $A^{\varPi}_{L}=(R_{L},Q_{L},\mu_{L},q_{0},F_{L},\delta_{L})$ for $L$ in time polynomial wrt $|Q_{L}|$ , $|\delta_{L}|$ , $|R_{L}|$ and the size of the longest counterexample returned by $q_{e}$ .

In turn, this yields a polynomial upper bound for minimizing a given $\varPi$ -DRA as the oracles $q_{m},q_{e},q_{r}$ can be implemented in PTime, by Lemma 7, Theorem 8 and Theorem 11.

Corollary 35.

$\varPi$ -DRAs can be minimized in PTime.

Note that the above result is for a fixed policy $\varPi$ . A natural problem is to find, given a $\varPi$ -DRA, a minimal $\varPi^{\prime}$ -DRA amongst all the policies $\varPi^{\prime}$ . However, the GI-hardness proof (Proposition 19) can be slightly modified to show that this latter problem is also GI-hard.

Counter-example processing.

A key ingredient for proving Theorem 34 is the correctness of the procedure ProcessCEX. We informally describe this procedure and state its correctness.

ProcessCEX is a recursive procedure whose first step is decomposing the input counterexample as $w=ur$ , where $u$ is the longest prefix in $B$ . Being $u\in B$ final iff $u\in L$ , $r=dv$ . If $d\notin mem_{L}(u)$ and $d^{*}=min(\mathcal{D}\setminus mem_{L}(u))$ , then $u((d\>d^{*})\cdot v)$ is also a counterexample. Hence, we can eventually find a counterexample $w=udv$ , with $u\in B$ , $ud\notin B$ , $d\in mem_{L}(u)$ or $d=min(\mathcal{D}\setminus mem_{L}(u))$ : The last transition $t=u\xrightarrow{x,E}z$ followed by $\mathcal{A}$ reading $u d$ was added (to the tree-shaped $\varPi$ -DRA coded by $B$ ) by BuildHP. Let $\tau\in Perm(\mathcal{D})$ mapping $mem_{L}^{\varPi}(ud)[i]$ to $mem_{L}^{\varPi}(z)[i]$ , for $1\leq i\leq|mem_{L}(ud)|$ . If $udv\neq_{L}z(\tau\cdot v)$ , then $udv,z(\tau\cdot v)$ witness that $t$ is wrong. Otherwise, $z(\tau\cdot v)$ is a counterexample. Let $u^{\prime}$ be the longest prefix in $B$ of $w^{\prime}=z(\tau\cdot v)=u^{\prime}r^{\prime}$ . Being $z\in B$ , $|r^{\prime}|\leq|\tau(v)|<|r|$ . So, ProcessCEX terminates, returning a pair of words to prevent forever a wrong transition.

Algorithm 2 ProcessCEX Procedure.

Lemma 36.

The procedure ProcessCEX called at Line 7 terminates performing a number of recursive calls linear in the length of the input counterexample $w$ , and returning (an ordered version of) a set of two words $\{vv^{\prime},zz^{\prime}\}$ such that:

$\blacksquare$

$v=ud$ , where $u\in B$ , $d\in mem_{L}(u)\cup min\{b\mid b\not\in mem(u)\}$ but $ud\notin B$
$\blacksquare$

$u\xrightarrow{x,E}z$ is the last transition followed by ${\mathcal{A}}$ upon reading the word $ud=v$
$\blacksquare$

$norm(mem^{\varPi}_{L}(v)v^{\prime})=norm(mem^{\varPi}_{L}(z)z^{\prime})$ but $vv^{\prime}\in L$ iff $zz^{\prime}\notin L$

7 Discussion

Our learning algorithm can be extended from $\varPi$ -DRAs to pDRAs, modulo modifying the notion of apartness by quantifying over all permutations of memorable data. This would entail an exponential blow-up in the maximal number of memorable data. On the other hand, learning a pDRA can be done as follows: learn a $\varPi$ -DRA in PTime, for some $\varPi$ , and minimize it into a pDRA (in exponential time, and PTime if the maximal number of memorable data is constant). An interesting question is whether enumerating the permutations is avoidable (for minimizing pDRAs or learning them).

We have shown that minimizing pDRAs is hard for the graph isomorphism problem. It is also in NP, as pDRA equivalence can be checked in PTime. We leave this gap open.

An interesting direction is to find a register automaton model which admits polynomial time active learning without relying on a memorability oracle. All known active learning frameworks for register automata which do not rely on memorability oracles have exponential time complexity [12, 11, 14]. On the other hand, $\varPi$ -DRAs can be minimized in PTime, can be checked for equivalence in PTime, have PTime memorability problem and admit passive learning in polynomial time and data (for $\varPi=\text{\sc Lar}$ ) [2]. The latter can be easily generalized to any fixed policy. This makes $\varPi$ -DRAs a good candidate to actively learn regular data languages in polynomial time, without memorability oracles.

References

[1] Dana Angluin. Learning regular sets from queries and counterexamples. Information and Computation, 75(2):87–106, 1987. doi:10.1016/0890-5401(87)90052-6.
[2] Mrudula Balachander, Emmanuel Filiot, and Raffaella Gentilini. Passive Learning of Regular Data Languages in Polynomial Time and Data. In Rupak Majumdar and Alexandra Silva, editors, 35th International Conference on Concurrency Theory (CONCUR 2024), volume 311 of Leibniz International Proceedings in Informatics (LIPIcs), pages 10:1–10:21, Dagstuhl, Germany, 2024. Schloss Dagstuhl – Leibniz-Zentrum für Informatik. doi:10.4230/LIPIcs.CONCUR.2024.10.
[3] M. H. Bandukara and Nikos Tzevelekos. On-the-fly bisimulation equivalence checking for fresh-register automata. J. Syst. Archit., 145:103010, 2023. doi:10.1016/J.SYSARC.2023.103010.
[4] Michael Benedikt, Clemens Ley, and Gabriele Puppis. What you must remember when processing data words. In Alberto H. F. Laender and Laks V. S. Lakshmanan, editors, Proceedings of the 4th Alberto Mendelzon International Workshop on Foundations of Data Management, Buenos Aires, Argentina, May 17-20, 2010, volume 619 of CEUR Workshop Proceedings. CEUR-WS.org, 2010. URL: https://ceur-ws.org/Vol-619/paper11.pdf.
[5] Mikołaj Bojańczyk. Slightly infinite sets. Draft version, September 2019. URL: https://www.mimuw.edu.pl/˜bojan/paper/atom-book.
[6] Mikołaj Bojańczyk, Bartek Klin, and Sławomir Lasota. Automata with group actions. In Proceedings of the 26th Annual IEEE Symposium on Logic in Computer Science, LICS 2011, June 21-24, 2011, Toronto, Ontario, Canada, pages 355–364. IEEE Computer Society, 2011. doi:10.1109/LICS.2011.48.
[7] Mikołaj Bojańczyk, Bartek Klin, and Sławomir Lasota. Automata theory in nominal sets. Log. Methods Comput. Sci., 10(3), 2014. doi:10.2168/LMCS-10(3:4)2014.
[8] Sofia Cassel, Falk Howar, Bengt Jonsson, Maik Merten, and Bernhard Steffen. A succinct canonical register automaton model. J. Log. Algebraic Methods Program., 84(1):54–66, 2015. doi:10.1016/J.JLAMP.2014.07.004.
[9] Loris D’Antoni. In the maze of data languages. CoRR, abs/1208.5980, 2012. arXiv:1208.5980.
[10] Stéphane Demri and Ranko Lazic. LTL with the freeze quantifier and register automata. ACM Trans. Comput. Log., 10(3):16:1–16:30, 2009. doi:10.1145/1507244.1507246.
[11] Simon Dierl, Paul Fiterau-Brostean, Falk Howar, Bengt Jonsson, Konstantinos Sagonas, and Fredrik Tåquist. Scalable tree-based register automata learning. In Bernd Finkbeiner and Laura Kovács, editors, Tools and Algorithms for the Construction and Analysis of Systems, pages 87–108, Cham, 2024. Springer Nature Switzerland. doi:10.1007/978-3-031-57249-4_5.
[12] Falk Howar, Bernhard Steffen, Bengt Jonsson, and Sofia Cassel. Inferring canonical register automata. In Viktor Kuncak and Andrey Rybalchenko, editors, Verification, Model Checking, and Abstract Interpretation, pages 251–266, Berlin, Heidelberg, 2012. Springer Berlin Heidelberg. doi:10.1007/978-3-642-27940-9_17.
[13] Michael Kaminski and Nissim Francez. Finite-memory automata. Theoretical Computer Science, 134(2):329–363, 1994. doi:10.1016/0304-3975(94)90242-9.
[14] Joshua Moerman, Matteo Sammartino, Alexandra Silva, Bartek Klin, and Michal Szynwelski. Learning nominal automata. In Giuseppe Castagna and Andrew D. Gordon, editors, Proceedings of the 44th ACM SIGPLAN Symposium on Principles of Programming Languages, POPL 2017, Paris, France, January 18-20, 2017, pages 613–625. ACM, 2017. doi:10.1145/3009837.3009879.
[15] Ugo Montanari and Marco Pistore. An introduction to history dependent automata. In Andrew D. Gordon, Andrew M. Pitts, and Carolyn L. Talcott, editors, Second Workshop on Higher-Order Operational Techniques in Semantics, HOOTS 1997, Stanford, CA, USA, December 8-12, 1997, volume 10 of Electronic Notes in Theoretical Computer Science, pages 170–188. Elsevier, 1997. doi:10.1016/S1571-0661(05)80696-6.
[16] Andrzej S. Murawski, Steven J. Ramsay, and Nikos Tzevelekos. Bisimilarity in fresh-register automata. In 30th Annual ACM/IEEE Symposium on Logic in Computer Science, LICS 2015, Kyoto, Japan, July 6-10, 2015, pages 156–167. IEEE Computer Society, 2015. doi:10.1109/LICS.2015.24.
[17] Andrzej S. Murawski, Steven J. Ramsay, and Nikos Tzevelekos. Polynomial-Time Equivalence Testing for Deterministic Fresh-Register Automata. In Igor Potapov, Paul Spirakis, and James Worrell, editors, 43rd International Symposium on Mathematical Foundations of Computer Science (MFCS 2018), volume 117 of Leibniz International Proceedings in Informatics (LIPIcs), pages 72:1–72:14, Dagstuhl, Germany, 2018. Schloss Dagstuhl – Leibniz-Zentrum für Informatik. doi:10.4230/LIPIcs.MFCS.2018.72.
[18] Andrzej S. Murawski, Steven J. Ramsay, and Nikos Tzevelekos. DEQ: equivalence checker for deterministic register automata. In Yu-Fang Chen, Chih-Hong Cheng, and Javier Esparza, editors, Automated Technology for Verification and Analysis - 17th International Symposium, ATVA 2019, Taipei, Taiwan, October 28-31, 2019, Proceedings, volume 11781 of Lecture Notes in Computer Science, pages 350–356. Springer, 2019. doi:10.1007/978-3-030-31784-3_20.
[19] Frank Neven, Thomas Schwentick, and Victor Vianu. Finite state machines for strings over infinite alphabets. ACM Trans. Comput. Log., 5(3):403–435, 2004. doi:10.1145/1013560.1013562.
[20] Marco Pistore. History dependent automata. PhD thesis, University of Pisa, March 1999.
[21] Andrew M. Pitts. Nominal Sets: Names and Symmetry in Computer Science. Cambridge Tracts in Theoretical Computer Science. Cambridge University Press, 2013.
[22] Hiroshi Sakamoto and Daisuke Ikeda. Intractability of decision problems for finite-memory automata. Theor. Comput. Sci., 231(2):297–308, 2000. doi:10.1016/S0304-3975(99)00105-X.
[23] Thomas Schwentick. Automata for XML - A survey. J. Comput. Syst. Sci., 73(3):289–315, 2007. doi:10.1016/J.JCSS.2006.10.003.
[24] Luc Segoufin. Automata and logics for words and trees over an infinite alphabet. In Zoltán Ésik, editor, Computer Science Logic, 20th International Workshop, CSL 2006, 15th Annual Conference of the EACSL, Szeged, Hungary, September 25-29, 2006, Proceedings, volume 4207 of Lecture Notes in Computer Science, pages 41–57. Springer, 2006. doi:10.1007/11874683_3.
[25] Frits Vaandrager, Bharat Garhewal, Jurriaan Rot, and Thorsten Wißmann. A new approach for active automata learning based on apartness. In Dana Fisman and Grigore Rosu, editors, Tools and Algorithms for the Construction and Analysis of Systems, pages 223–243, Cham, 2022. Springer International Publishing. doi:10.1007/978-3-030-99524-9_12.

[bib.bib1] [1] Dana Angluin. Learning regular sets from queries and counterexamples. Information and Computation, 75(2):87–106, 1987. doi:10.1016/0890-5401(87)90052-6.

[bib.bib2] [2] Mrudula Balachander, Emmanuel Filiot, and Raffaella Gentilini. Passive Learning of Regular Data Languages in Polynomial Time and Data. In Rupak Majumdar and Alexandra Silva, editors, 35th International Conference on Concurrency Theory (CONCUR 2024), volume 311 of Leibniz International Proceedings in Informatics (LIPIcs), pages 10:1–10:21, Dagstuhl, Germany, 2024. Schloss Dagstuhl – Leibniz-Zentrum für Informatik. doi:10.4230/LIPIcs.CONCUR.2024.10.

[bib.bib3] [3] M. H. Bandukara and Nikos Tzevelekos. On-the-fly bisimulation equivalence checking for fresh-register automata. J. Syst. Archit., 145:103010, 2023. doi:10.1016/J.SYSARC.2023.103010.

[bib.bib4] [4] Michael Benedikt, Clemens Ley, and Gabriele Puppis. What you must remember when processing data words. In Alberto H. F. Laender and Laks V. S. Lakshmanan, editors, Proceedings of the 4th Alberto Mendelzon International Workshop on Foundations of Data Management, Buenos Aires, Argentina, May 17-20, 2010, volume 619 of CEUR Workshop Proceedings. CEUR-WS.org, 2010. URL: https://ceur-ws.org/Vol-619/paper11.pdf.

[bib.bib5] [5] Mikołaj Bojańczyk. Slightly infinite sets. Draft version, September 2019. URL: https://www.mimuw.edu.pl/˜bojan/paper/atom-book.

[bib.bib6] [6] Mikołaj Bojańczyk, Bartek Klin, and Sławomir Lasota. Automata with group actions. In Proceedings of the 26th Annual IEEE Symposium on Logic in Computer Science, LICS 2011, June 21-24, 2011, Toronto, Ontario, Canada, pages 355–364. IEEE Computer Society, 2011. doi:10.1109/LICS.2011.48.

[bib.bib7] [7] Mikołaj Bojańczyk, Bartek Klin, and Sławomir Lasota. Automata theory in nominal sets. Log. Methods Comput. Sci., 10(3), 2014. doi:10.2168/LMCS-10(3:4)2014.

[bib.bib8] [8] Sofia Cassel, Falk Howar, Bengt Jonsson, Maik Merten, and Bernhard Steffen. A succinct canonical register automaton model. J. Log. Algebraic Methods Program., 84(1):54–66, 2015. doi:10.1016/J.JLAMP.2014.07.004.

[bib.bib9] [9] Loris D’Antoni. In the maze of data languages. CoRR, abs/1208.5980, 2012. arXiv:1208.5980.

[bib.bib10] [10] Stéphane Demri and Ranko Lazic. LTL with the freeze quantifier and register automata. ACM Trans. Comput. Log., 10(3):16:1–16:30, 2009. doi:10.1145/1507244.1507246.

[bib.bib11] [11] Simon Dierl, Paul Fiterau-Brostean, Falk Howar, Bengt Jonsson, Konstantinos Sagonas, and Fredrik Tåquist. Scalable tree-based register automata learning. In Bernd Finkbeiner and Laura Kovács, editors, Tools and Algorithms for the Construction and Analysis of Systems, pages 87–108, Cham, 2024. Springer Nature Switzerland. doi:10.1007/978-3-031-57249-4_5.

[bib.bib12] [12] Falk Howar, Bernhard Steffen, Bengt Jonsson, and Sofia Cassel. Inferring canonical register automata. In Viktor Kuncak and Andrey Rybalchenko, editors, Verification, Model Checking, and Abstract Interpretation, pages 251–266, Berlin, Heidelberg, 2012. Springer Berlin Heidelberg. doi:10.1007/978-3-642-27940-9_17.

[bib.bib13] [13] Michael Kaminski and Nissim Francez. Finite-memory automata. Theoretical Computer Science, 134(2):329–363, 1994. doi:10.1016/0304-3975(94)90242-9.

[bib.bib14] [14] Joshua Moerman, Matteo Sammartino, Alexandra Silva, Bartek Klin, and Michal Szynwelski. Learning nominal automata. In Giuseppe Castagna and Andrew D. Gordon, editors, Proceedings of the 44th ACM SIGPLAN Symposium on Principles of Programming Languages, POPL 2017, Paris, France, January 18-20, 2017, pages 613–625. ACM, 2017. doi:10.1145/3009837.3009879.

[bib.bib15] [15] Ugo Montanari and Marco Pistore. An introduction to history dependent automata. In Andrew D. Gordon, Andrew M. Pitts, and Carolyn L. Talcott, editors, Second Workshop on Higher-Order Operational Techniques in Semantics, HOOTS 1997, Stanford, CA, USA, December 8-12, 1997, volume 10 of Electronic Notes in Theoretical Computer Science, pages 170–188. Elsevier, 1997. doi:10.1016/S1571-0661(05)80696-6.

[bib.bib16] [16] Andrzej S. Murawski, Steven J. Ramsay, and Nikos Tzevelekos. Bisimilarity in fresh-register automata. In 30th Annual ACM/IEEE Symposium on Logic in Computer Science, LICS 2015, Kyoto, Japan, July 6-10, 2015, pages 156–167. IEEE Computer Society, 2015. doi:10.1109/LICS.2015.24.

[bib.bib17] [17] Andrzej S. Murawski, Steven J. Ramsay, and Nikos Tzevelekos. Polynomial-Time Equivalence Testing for Deterministic Fresh-Register Automata. In Igor Potapov, Paul Spirakis, and James Worrell, editors, 43rd International Symposium on Mathematical Foundations of Computer Science (MFCS 2018), volume 117 of Leibniz International Proceedings in Informatics (LIPIcs), pages 72:1–72:14, Dagstuhl, Germany, 2018. Schloss Dagstuhl – Leibniz-Zentrum für Informatik. doi:10.4230/LIPIcs.MFCS.2018.72.

[bib.bib18] [18] Andrzej S. Murawski, Steven J. Ramsay, and Nikos Tzevelekos. DEQ: equivalence checker for deterministic register automata. In Yu-Fang Chen, Chih-Hong Cheng, and Javier Esparza, editors, Automated Technology for Verification and Analysis - 17th International Symposium, ATVA 2019, Taipei, Taiwan, October 28-31, 2019, Proceedings, volume 11781 of Lecture Notes in Computer Science, pages 350–356. Springer, 2019. doi:10.1007/978-3-030-31784-3_20.

[bib.bib19] [19] Frank Neven, Thomas Schwentick, and Victor Vianu. Finite state machines for strings over infinite alphabets. ACM Trans. Comput. Log., 5(3):403–435, 2004. doi:10.1145/1013560.1013562.

[bib.bib20] [20] Marco Pistore. History dependent automata. PhD thesis, University of Pisa, March 1999.

[bib.bib21] [21] Andrew M. Pitts. Nominal Sets: Names and Symmetry in Computer Science. Cambridge Tracts in Theoretical Computer Science. Cambridge University Press, 2013.

[bib.bib22] [22] Hiroshi Sakamoto and Daisuke Ikeda. Intractability of decision problems for finite-memory automata. Theor. Comput. Sci., 231(2):297–308, 2000. doi:10.1016/S0304-3975(99)00105-X.

[bib.bib23] [23] Thomas Schwentick. Automata for XML - A survey. J. Comput. Syst. Sci., 73(3):289–315, 2007. doi:10.1016/J.JCSS.2006.10.003.

[bib.bib24] [24] Luc Segoufin. Automata and logics for words and trees over an infinite alphabet. In Zoltán Ésik, editor, Computer Science Logic, 20th International Workshop, CSL 2006, 15th Annual Conference of the EACSL, Szeged, Hungary, September 25-29, 2006, Proceedings, volume 4207 of Lecture Notes in Computer Science, pages 41–57. Springer, 2006. doi:10.1007/11874683_3.

[bib.bib25] [25] Frits Vaandrager, Bharat Garhewal, Jurriaan Rot, and Thorsten Wißmann. A new approach for active automata learning based on apartness. In Dana Fisman and Grigore Rosu, editors, Tools and Algorithms for the Construction and Analysis of Systems, pages 223–243, Cham, 2022. Springer International Publishing. doi:10.1007/978-3-030-99524-9_12.

Register Automata with Permutations

Abstract

Keywords and phrases:

Funding:

Copyright and License:

2012 ACM Subject Classification:

DOI:

Event:

Editors:

Series and Publisher:

1 Introduction

Register automata with permutations.

Example 1.

Outline.

Related work.

2 Data word languages and register automata with permutations

Numbers, sets, relations.

Data alphabet, data words and nominal sets.

Registers and register permutations.

Definition 2.

▶ Remark 3.

Configurations, runs and languages.

Definition 4.

Lemma 5.

▶ Remark 6 (Deterministic).

3 Decision problems for register automata with permutations

Lemma 7.

Theorem 8.

Lemma 9.

Proof.

Example 10.

Theorem 11.

Proof.

4 Minimal register automata with permutations

Definition 12 (Data word equivalence relation).

Proposition 13.

Lemma 14.

Proof.

Example 15.

Theorem 16.

Proof.

Construction of a canonical automaton.

Example 17.

Lemma 18.

Proof.

Minimization of pDRAs.

Proposition 19.

▶ Remark 20 (Canonicity).

5 𝚷-DRAs: pDRAs with a Fixed Permutation Policy

Definition 21.

Example 22.

Example 23 (Lar-automata).

Proposition 24.

Lemma 25.

Word of memorable data.

Proposition 26.

Abstract residual.

Example 27.

Canonical 𝚷-DRA.

Definition 28 (Π-equivalence).

Lemma 29.

▶ Remark 30.

Lemma 31.

Proof.

Theorem 32.

Proof.

6 Active Learning and Minimization of 𝚷-DRAs

Definition 33 (Apartness).

Description of the (active) learning algorithm.

Theorem 34.

Corollary 35.

Counter-example processing.

Lemma 36.

7 Discussion

References

$\blacktriangleright$ Remark 3.

$\blacktriangleright$ Remark 6 (Deterministic).

$\blacktriangleright$ Remark 20 (Canonicity).

5 $\varPi$ -DRAs: pDRAs with a Fixed Permutation Policy

Canonical $\varPi$ -DRA.

Definition 28 ( $\varPi$ -equivalence).

$\blacktriangleright$ Remark 30.

6 Active Learning and Minimization of $\varPi$ -DRAs