Synthesis of Data Word Transducers

Léo Exibard
Aix Marseille Univ, Université de Toulon, CNRS, LIS, Marseille, France
Université libre de Bruxelles, Brussels, Belgium

Emmanuel Filiot
Université libre de Bruxelles, Brussels, Belgium

Pierre-Alain Reynier
Aix Marseille Univ, Université de Toulon, CNRS, LIS, Marseille, France

Abstract
In reactive synthesis, the goal is to automatically generate an implementation from a specification of the reactive and non-terminating input/output behaviours of a system. Specifications are usually modelled as logical formulae or automata over infinite sequences of signals (ω-words), while implementations are represented as transducers. In the classical setting, the set of signals is assumed to be finite. In this paper, we consider data ω-words instead, i.e., words over an infinite alphabet. In this context, we study specifications and implementations respectively given as automata and transducers extended with a finite set of registers. We consider different instances, depending on whether the specification is nondeterministic, universal or deterministic, and depending on whether the number of registers of the implementation is given or not.

In the unbounded setting, we show undecidability for both universal and non-deterministic specifications, while decidability is recovered in the deterministic case. In the bounded setting, undecidability still holds for non-deterministic specifications, but can be recovered by disallowing tests over input data. The generic technique we use to show the latter result allows us to reprove some known result, namely decidability of bounded synthesis for universal specifications.

2012 ACM Subject Classification Theory of computation → Logic and verification; Theory of computation → Automata over infinite objects; Theory of computation → Transducers

Keywords and phrases Register Automata, Synthesis, Data words, Transducers


Funding Léo Exibard: Funded by a FRIA fellowship from the F.R.S.-FNRS.
Emmanuel Filiot: Research associate of F.R.S.-FNRS. He is supported by the ARC Project Transform Fédération Wallonie-Bruxelles and the FNRS CDR J013116F and MIS F451019F projects.
Pierre-Alain Reynier: Partly funded by the DeLTA project (ANR-16-CE40-0007).

Acknowledgements We would like to thank Ayrat Khalimov for his remarks and suggestions, which helped improve the quality of the paper.

1 Introduction

Reactive synthesis is an active research domain whose goal is to design algorithmic methods able to construct a reactive system from a specification of its admissible behaviours. Such systems are notoriously difficult to design correctly, and the main appealing idea of synthesis is to automatically generate systems correct by construction. Reactive systems are non-terminating systems that continuously interact with the environment in which they are executed, through input and output signals. At each time step, the system receives an input signal from a set In and produces an output signal from a set Out. An execution is then modelled as an infinite sequence alternating between input and output signals, i.e., an ω-word in (In,Out)ω. Classically, the sets In and Out are assumed to be finite and reactive.
systems are modelled as (sequential) transducers. Transducers are simple finite-state machines with transitions of type $\text{States} \times \text{In} \rightarrow \text{States} \times \text{Out}$, which, at any state, can process any input signal and deterministically produce some output signal, while possibly moving, again deterministically, to a new state. A specification is then a language $S \subseteq (\text{In}.\text{Out})^\omega$ telling which are the acceptable behaviours of the system. It is also classically represented as an automaton, or as a logical formula then converted into an automaton. Some regular specifications may not be realisable by any transducer, and the realisability problem asks, given a regular specification $S$, whether there exists a transducer $T$ whose behaviours satisfy $S$ (are included in $S$). The synthesis problem asks to construct $T$ if it exists.

A typical example of reactive system is that of a server granting requests from a finite set of clients $C$. Requests are represented as the set of input signals $\text{In} = \{(r, i) \mid i \in C\} \cup \{\text{idle}\}$ (client $i$ requests the resource) and grants by the set of output signals $\text{Out} = \{(g, i) \mid i \in C\} \cup \{\text{idle}\}$ (server grants client $i$’s request). A typical constraint to be imposed on such a system is that every request is eventually granted, which can be represented by the LTL formula $\bigwedge_{i \in C} G((r, i) \rightarrow F(g, i))$. The latter specification is realisable for instance by the transducer which outputs $(g, i)$ whenever it reads $(r, i)$ and idle whenever it reads idle.

It is well-known that the realisability problem is decidable for $\omega$-regular specifications, $\text{ExpTime-c}$ when represented by parity automata [9] and $2\text{ExpTime-c}$ for LTL specifications [16]. Such positive results have triggered a recent and very active research interest in efficient symbolic methods and tools for reactive synthesis (see e.g. [1]). Extensions of this classical setting have been proposed to capture more realistic scenarios [1]. However, only a few works have considered infinite sets of input and output signals. In the previous example, the number of clients is assumed to be finite, and small. To the best of our knowledge, existing synthesis tools do not handle large alphabets, and it is more realistic to consider an unbounded (infinite) set of client identifiers, e.g. $C = \mathbb{N}$. The goal of this paper is to investigate how reactive synthesis can be extended to handle infinite sets of signals.

Data words are infinite sequences $x_1x_2\ldots$ of labelled data, i.e., pairs $(\sigma, d)$ with $\sigma$ a label from a finite alphabet, and $d \in \mathbb{N}$. They can naturally model executions of reactive systems over an infinite set of signals. Among other models, register automata are one of the main extensions of automata recognising languages of data words [10, 18]. They can use a finite set of registers in which to store data that are read, and to compare the current data with the content of some of the registers (in this paper, we allow comparison of equality). Likewise, transducers can be extended to register transducers as a model of reactive systems over data words: a register transducer is equipped with a set of registers, and when reading an input labelled data $(\sigma, d)$, it can test $d$ for equality with the content of some of its registers, and depending on the result of this test, deterministically assign some of its registers to $d$ and output a finite label $\beta$ together with the content of one of its registers. Its executions are then data words alternating between input and output labelled data, so register automata can be used to represent specifications, as languages of such data words.

Contributions. We consider two classical semantics for register automata, non-deterministic and universal, both with a parity acceptance condition, which give two classes of register automata respectively denoted NRA and URA. Since NRA are not closed under complement (already over finite data words), NRA and URA define incomparable classes of specifications. The request-grant specification given previously with an infinite number of clients is expressible by an URA [12]: whenever a request is made by client $i$ (labelled data $(r, i)$), universally trigger a run which stores $i$ in some register and verifies that the labelled data $(g, i)$ eventually occurs in the data word; but no NRA can define it. On the contrary, consider the specification $S_0$:...
“all input data but one are copied on the output, the missing one being replaced by some data which occurred before it”, modelled as the set of data sequences $d_1d_2d_2 \ldots d_1d_{i+1}d_{i+1} \ldots$ for all $i \geq 0$ and $j < i$ (finite labels are irrelevant and not represented). $S_0$ is not definable by any URA (it would require to guess $j$, which can be arbitrarily smaller than $i$), but it is expressible by some NRA making this guess.

However, we show (unsurprisingly) that the realisability problem by register transducers of specifications defined by NRA is undecidable. The same negative result also holds for URA, solving an open question raised in [12]. On the positive side, we show that decidability is recovered for deterministic (parity) register automata (DRA). One of the difficulties of register transducer synthesis is that the number of registers needed to realise the specification is, a priori, unbounded with regards to the number of registers of the specification. We show it is in fact not the case for DRA: any specification expressed as a DRA with $r$ registers is realisable by a register transducer if and only if it is realisable by a transducer with $r$ registers.

A way to obtain decidability is to fix a bound $k$ and to target register transducers with at most $k$ registers. This setting is called bounded synthesis in [12], which establishes that bounded synthesis is decidable in $2\text{ExpTime}$ for URA. We show that unfortunately, bounded synthesis is still undecidable for NRA specifications. To recover decidability for NRA, we disallow equality tests on the input data and add a syntactic requirement which entails that on any accepted word, each output data is the content of some register which has been assigned an input data occurring before. This defines a subclass of NRA that we call (input) test-free NRA (NRA$_{tf}$). NRA$_{tf}$ can express how output data can be obtained from input data (by copying, moving or duplicating them), although they do not have the whole power of register automata on the input nor the output side. Note that the specification $S_0$ given before is NRA$_{tf}$-definable.

To reprove the result of [12], with a rather short proof based on standard results from the theory of register automata, indicating that it might allow to establish decidability for other classes of data specifications. Our results are summarised in Table 1.

<table>
<thead>
<tr>
<th></th>
<th>DRA</th>
<th>NRA</th>
<th>URA</th>
<th>NRA$_{tf}$</th>
</tr>
</thead>
<tbody>
<tr>
<td>Bounded Synthesis</td>
<td>$\text{ExpTime}$ (Thm. 13)</td>
<td>Undecidable ($k \geq 1$) (Thm. 3)</td>
<td>$2\text{ExpTime}$ ([12] and Thm. 12)</td>
<td>$2\text{ExpTime}$ (Thm. 16)</td>
</tr>
<tr>
<td>General Case</td>
<td>$\text{ExpTime}$ (Thm. 6)</td>
<td>Undecidable (Thm. 2)</td>
<td>Undecidable (Thm. 4)</td>
<td>Open</td>
</tr>
</tbody>
</table>

**Table 1** Decidability status of the problems studied.

**Related Work.** As already mentioned, bounded synthesis of register transducers is considered in [12] where it is shown to be decidable for URA. We reprove this result in a shorter way. Our proof bears some similarities with that of [12], but it seems that our formulation benefits more from the use of existing results. The technique is also more generic and we instantiate it to NRA$_{tf}$. NRA$_{tf}$ correspond to the one-way, non-deterministic version of the expressive transducer model of [5], which however does not consider the synthesis problem.

The synthesis problem over infinite alphabets is also considered in [6], in which data represent identifiers and specifications (given as particular automata close to register automata) can depend on equality between identifiers. However, the class of implementations is very expressive: it allows for unbounded memory through a queue data structure. The synthesis problem is shown to be undecidable and a sound but incomplete algorithm is given.
Finally, classical reactive synthesis has strong connections with game theory on finite graphs. Some extension of games to infinite graphs whose vertices are valuations of variables in an infinite data domain have been considered in [8]. Such games are shown to be undecidable and a decidable restriction is proposed, which however does not seem to match our context.

2 Data Words and Register Automata

For a (possibly infinite) set S, we denote by $S^\omega$ the set of infinite words over this alphabet. For $1 \leq i \leq j$, we let $u[i:j] = u_i u_{i+1} \ldots u_j$ and $u[i] = u[i:i]$ the $i$th letter of $u$. For $u, v \in S^\omega$, we define their interleaving $(u, v) = u[1]v[1]u[2]v[2] \ldots$

**Data Words.** Let $\Sigma$ be a finite alphabet and $\mathcal{D}$ a countably infinite set, denoting, all over this paper, a set of elements called data. We also distinguish an (arbitrary) data value $d_0 \in \mathcal{D}$. Given a set $R$, let $\tau^R_0$ be the constant function defined by $\tau^R_0(r) = d_0$ for all $r \in R$. A labelled data (or l-data for short) is a pair $x = (\sigma, d) \in \Sigma \times \mathcal{D}$, where $\sigma$ is the label and $d$ the data. We define the projections $\text{lab}(x) = \sigma$ and $\text{dt}(x) = d$. A data word over $\Sigma$ and $\mathcal{D}$ is an infinite sequence of labelled data, i.e. a word $w \in (\Sigma \times \mathcal{D})^\omega$. We extend the projections $\text{lab}$ and $\text{dt}$ to data words naturally, i.e. $\text{lab}(w) \in \Sigma^\omega$ and $\text{dt}(w) \in \mathcal{D}^\omega$. We denote the set of data words over $\Sigma$ and $\mathcal{D}$ by $\text{DW}(\Sigma, \mathcal{D})$ (DW when clear from the context). A data word language is a subset $L \subseteq \text{DW}(\Sigma, \mathcal{D})$. Note that in this paper, data words are infinite, otherwise they are called finite data words, and we denote by $\text{DW}_f(\Sigma, \mathcal{D})$ the set of finite data words.

**Register Automata.** Register automata are automata recognising data word languages. They were first introduced in [10] as finite-memory automata. Here, we define them in a spirit close to [18], but over infinite words, with a parity acceptance condition.

A register automaton (RA) is a tuple $A = (\Sigma, \mathcal{D}, Q, q_0, \delta, c)$, where:

- $\Sigma$ is a finite alphabet of labels, $\mathcal{D}$ is an infinite alphabet of data
- $Q$ is a finite set of states and $q_0 \in Q$ is the initial state
- $\delta$ is a finite set of registers. We denote $\text{Tst}_R = 2^R$ and $\text{Asgn}_R = 2^R$.\(^2\)
- $c : Q \rightarrow \{1, \ldots, d\}$, where $d \in \mathbb{N}$ is the number of priorities, is the colouring function, used to define the acceptance condition

A transition $(q, \sigma, \text{tst}, \text{asgn}, q')$ is also written $q \xrightarrow{a, \text{tst}, \text{asgn}}_A q'$. We may omit $A$ in the latter notation. Intuitively such transition means that on input $(\sigma, d)$ in state $q$ the automaton: it first checks that $\text{tst}$ is exactly the set of registers containing $d$: for all $r \in \text{tst}$, $d$ is the current content of register $r$ and for all $r \notin \text{tst}$, $d$ is not in register $r$, then it assigns $d$ to all the registers in $\text{asgn}$ ($\text{asgn}$ might be empty), and finally transitions to state $q'$.

$A$ is said to be deterministic (resp. complete) when, given any state, any label and any possible test, at most (resp. at least) one transition can be taken: $\forall q \in Q, \forall \sigma \in \Sigma, \forall \text{tst} \in \text{Tst}_R, \exists \leq 1 \text{asgn} \in \text{Asgn}, \exists \leq 1 q' \in \mathcal{D}$ such that $q \xrightarrow{a, \text{tst}, \text{asgn}}_A q'$ (resp. $\forall q, \forall \sigma, \forall \text{tst}, \exists \geq 1 \text{asgn}, \exists \geq 1 q'$ s.t. $q \xrightarrow{a, \text{tst}, \text{asgn}}_A q'$). Since tests are mutually exclusive, this syntactically ensures that for any state and in any register configuration, the transition to take is determined by the input l-data (respectively that a transition can always be taken, regardless the input l-data). The class of deterministic register automata will be denoted $\text{DRA}$.

---

1 In the terminology of [13], it corresponds to multiple-assignment register automata with registers initially filled, i.e. the class $MF$, with the additional requirement that all are filled with the same data.

2 Those sets are identical but have distinct semantics.
Configurations and Runs. A configuration is a pair \((q, τ) \in Q \times (R \rightarrow \mathcal{D})\). Given \(τ \in \text{Tst}_R\) and \(d \in \mathcal{D}\), we say that \(τ, d\) satisfies \(τ\), denoted \(τ, d \models τ\), if \(τ^{-1}(d) = τ\). Given a transition \(t = p \xrightarrow{σ, asgn} p'\), we say that \((q, τ)\) enables \(t\) on reading \((σ', d)\) if \(q = p\), \(σ' = σ\) and \(τ, d \models τ\). Let \(\text{next}(τ, asgn, d)\) be the configuration \(τ'\) defined by \(τ'(i) = d\) if \(i \in \text{asgn}\), and \(τ'(i) = τ(i)\) otherwise. We extend this notation to configurations as follows: if \(γ = (q, τ)\) enables \(t\) on input \((σ, d)\), the successor configuration of \((q, τ)\) by \(t\) on input \((σ, d)\) is \(\text{next}(γ, asgn, d) = (p', \text{next}(τ, asgn, d))\). We also write \(\text{next}(γ, t, σ, d)\) to denote the successor of \((q, τ)\) by transition \(t\) when \((q, τ)\) enables \(t\) on input \((σ, d)\). The initial configuration is \((q_0, τ_0^D)\). Then, a run over a data word \(σ_1, d_1)(σ_2, d_2)\ldots\) is an infinite sequence of transitions \(t_0t_1\ldots\) such that there exists a sequence of configurations \(γ_0γ_1\ldots = (q_0, τ_0)(q_1, τ_1)\ldots\) such that \(γ_0\) is initial and for all \(i \geq 0\), \(γ_{i+1} = \text{next}(γ_i, t_i, σ_i, d_i)\). To a run \(ρ\), we associate its sequence of states \(\text{states}(ρ) = q_0q_1\ldots\)

Languages Defined by RA. Given a run \(ρ\), we denote, by a slight abuse of notation, \(c(ρ) = \max\{j \mid c(q_j) = j\text{ for infinitely many }q_j \in \text{states}(ρ)\}\) the maximum color that occurs infinitely often in \(ρ\). Then, in the parity acceptance condition, \(ρ\) is accepting whenever \(c(ρ)\) is even. We consider two dual semantics for RA: non-deterministic (N) and universal (U). Given an RA \(A\), depending on whether it is considered nondeterministic or universal, it recognises \(L_N(A) = \{w \mid \text{there exists an accepting run } ρ\text{ on } w\}\) or \(L_U(A) = \{w \mid \text{all runs } ρ\text{ on } w\text{ are accepting}\}\).

We denote by NRA (resp. URA) the class of register automata interpreted with a non-deterministic (resp. universal) parity acceptance condition, and given \(A \in \text{NRA}\) (resp. \(A \in \text{URA}\)), we write \(L(A)\) instead of \(L_N(A)\) (resp. \(L_U(A)\)). We also denote by DRA the class of deterministic parity register automata.

3 Synthesis of Register Transducers

Specifications, Implementations and the Realisability Problem. Let \(Σ_δ\) and \(Σ_o\) be two finite alphabets of labels, and \(\mathcal{D}\) a countable set of data. A relational data word is an element of \(w \in [(Σ_δ \times \mathcal{D}) \times (Σ_o \times \mathcal{D})]^ω\). Such a word is called relational as it defines a pair of data words in \(\text{DW}(Σ_δ, \mathcal{D}) \times \text{DW}(Σ_o, \mathcal{D})\) through the following projections. If \(w = x_1 δ_1 x_2 δ_2 x_3 δ_3 \ldots\), we let \(\text{inp}(w) = x_1 x_2 \ldots\) and \(\text{out}(w) = x_1 δ_1 x_2 δ_2 \ldots\). We denote by \(\text{RW}(Σ_δ, Σ_o, \mathcal{D})\) (just \(\text{RW}\) when clear from the context) the set of relational data words. A specification is simply a language \(S \subseteq \text{RW}(Σ_δ, Σ_o, \mathcal{D})\). An implementation is a total function \(I : (Σ_δ \times \mathcal{D})^ω \rightarrow Σ_o \times \mathcal{D}\).

We associate to \(I\) another function \(f_I : \text{DW}(Σ_δ, \mathcal{D}) \rightarrow \text{DW}(Σ_o, \mathcal{D})\) which, to an input data word \(w_δ = x_1 δ_1 \ldots\) in \(Σ_δ \times \mathcal{D}\), associates the output data word \(f_I(w_δ) = x_1 δ_1 x_2 δ_2 \ldots\) such that \(\forall i \geq 1, x_i δ_i = I(x_i δ_i x_{i-1} δ_{i-1})\). \(I\) also defines a language of relational data words \(L(I) = \{w_δ, f_I(w_δ) \mid w_δ \in \text{DW}(Σ_δ, \mathcal{D})\}\).

We say that \(I\) realises \(S\) when \(L(I) \subseteq S\), and that \(S\) is realisable if there exists an implementation realising it. The realisability problem consists, given a (finite representation of a) specification \(S\), in checking whether \(S\) is realisable. In general, we parameterise this problem by classes of specifications \(S\) and of implementations \(I\), defining the \((S, I)\)-realisability problem, denoted \(\text{Real}(S, I)\). Given a specification \(S \in \mathcal{S}\), it asks whether \(S\) is realisable by some implementation \(I \in \mathcal{I}\). We now introduce the classes \(\mathcal{S}\) and \(\mathcal{I}\) we consider.

Specification Register Automata. In this paper, we consider specifications defined from register automata alternately reading input and output l-data. We assume that the set of states is partitioned into \(Q_i\) (called input states, reading only labels in \(Σ_δ\)) and \(Q_o\) (called output states, reading only labels in \(Σ_o\)), where \(q_0 \in Q_i\) and \(F \subseteq Q_i\), and such that the
transition relation $\delta$ alternates between these two sets, i.e. $\delta \subseteq \cup_{i \in D} (Q_{\alpha} \times \Sigma_{\alpha} \times \text{Tst}_{R} \times \text{Asgn}_{R} \times Q_{\pi})$, where $i = \varnothing$ (resp. $\pi = i$). We denote by DRA (resp. NRA, URA) the class of specifications defined by deterministic (resp. non-deterministic, universal) parity register automata.

**Register Transducers As Implementations.** We consider implementations represented as transducers processing data words. A register transducer is a tuple $T = (\Sigma_{\alpha}, Q_{\alpha}, q_{0}, \delta, R)$ where $Q$ is a finite set of states with initial state $q_{0}$, $R$ is a finite set of registers, and $\delta : Q \times Q_{\alpha} \times \text{Tst}_{R} \rightarrow \text{Asgn}_{R} \times \Sigma_{\alpha} \times R \times Q$ is the (total) transition function (as before, $\text{Tst}_{R} = \text{Asgn}_{R} = 2^{R}$). When processing an l-data $(\sigma_{\alpha}, d)$, $T$ compares $d$ with the content of some of its registers, and depending on the result, moves to another state, stores $d$ in some registers, and outputs some label in $\Sigma_{\alpha}$ along with the content of some register $r \in R$.

Let us formally define the semantics of a register transducer $T$, as an implementation $I_{T}$. First, for a finite input data word $w = (\sigma_{\alpha}^{1}, d_{1}) \ldots (\sigma_{\alpha}^{n}, d_{n})$ in $(\Sigma_{\alpha} \times D)^{*}$, we denote by $(q_{i}, \tau_{i})$ the $i$th configuration reached by $T$ on $w$, where $(q_{0}, \tau_{0})$ is initial and for all $0 < i < n$, $(q_{i}, \tau_{i})$ is the unique configuration such that there exists a transition $\delta(q_{i-1}, \sigma_{\alpha}^{i}, \text{tst}) = (\text{asgn}, q_{\alpha}, r, q_{i})$ such that $\tau_{i-1}, d_{i} \models \text{tst}$ and $\tau_{i} = \text{next}(\tau_{i-1}, d_{i}, \text{asgn})$. We let $(\sigma_{\alpha}^{0}, d_{0}) = (\sigma_{\alpha}, \tau_{0}(r))$ and $I_{T}(w) = (\sigma_{\alpha}^{n}, d_{n})$. Then, we denote $f_{T} = f_{I_{T}}$ and $L(T) = L(I_{T})$. Note that if $T$ is interpreted as a DRA with exactly one transition per output state and whose states are all accepting (i.e. have even maximal parity 2), then $L(I_{T})$ is indeed the language of such register automaton. We denote by $\text{RT}[k]$ the class of implementations defined by register transducers with at most $k$ registers, and by $\text{RT} = \bigcup_{k \geq 0} \text{RT}[k]$ the class of implementations defined by register transducers.

**Synthesis from Data-Free Specifications.** If in the latter definitions of the problem, one considers specifications defined by RA with no registers, and implementations defined by RT with no registers, then the data in data-words can be ignored and we are in the classical reactive synthesis setting, for which important results are known:

- **Theorem 1** ([9]). Given a (data-free) specification $S$ defined by some (register-free) universal or non-deterministic parity automaton, the realisability problem of $S$ by (register-free) transducers is EXPTime-c.

### 4 Unbounded Synthesis

In this section, we consider the unbounded synthesis problem $\text{REAL}$(RA, RT). Thus, we do not fix a priori the number of registers of the implementation. Let us first consider the case of NRA and URA, which are, in our setting, the most natural devices to express data word specifications. By reducing the universality of NRA over finite words (which is undecidable [14]) to our synthesis problems, we show:

- **Theorem 2.** $\text{REAL}$(NRA, RT) is undecidable.

- **Theorem 3.** For all $k \geq 1$, $\text{REAL}$(NRA, RT$[k]$) is undecidable.

Now, we can show that the unbounded synthesis problem is also undecidable for URA, answering a question left open in [12].

- **Theorem 4.** $\text{REAL}$(URA, RT) is undecidable.
Proof. We present a reduction from the emptiness problem of URA over finite words (which is undecidable by a direct reduction from the universality problem of NRA, undecidable by [14]) to our synthesis problem.

First, consider the relation \( S_1 = \{(u \neq v, u \neq w) \mid u \in DW_f, v \in DW, \text{each data of } u \text{ appears infinitely often in } w\} \). \( S_1 \) is recognised by a 1-register URA which, upon reading a data \( d \) in \( u \), stores it in its register and checks that it appears infinitely often in \( w \) by visiting a state with maximal parity 2 every time it sees \( d \) (all other states have parity 1). Note that for all \( k \geq 1 \), \( S_1 \cap \{(u \neq v, u \neq w) \mid u \in DW_f, v, w \in DW \text{ and } |dt(u)| \leq k\} \) is realisable by a \( k \)-register transducer: on reading \( u \), store each distinct data in one register, and after the \( \# \) outputs them in turn in a round-robin fashion. However, \( S_1 \) is not realisable: on reading the \( \# \) separator, any implementation must have all the data of \( dt(u) \) in its registers, but the size of \( dt(u) \) is not bounded (\( u \) can have pairwise distinct data and be of arbitrary length).

Then, let \( A \) be a URA over finite data words. Consider the specification \( S = S_1 \cup S_2 \cup T \), where \( S_2 = \{(u \neq v, u \neq w \# (a, d_0)^* \mid u \in DW_f, v \in DW, w \in L(A)\} \) and \( T = \{(u, w) \mid u \notin DW_f \# DW, w \in DW\} \). \( S \) has total domain, and is recognisable by a URA. Indeed, URA are closed under union, by the same product construction as for the intersection of NRA [10], and each part is URA-recognisable: \( S_1 \) is, as described above, \( S_2 \) is by simulating \( A \) on the output to check \( w \in L(A) \) then looping over \((a, d_0)\), and \( T \) simply checks a regular property.

Now, if \( L(A) \neq \emptyset \), let \( w \in L(A) \), and \( k \) its number of distinct data. Then \( S \) is realisable by a \( k \)-register transducer realising \( S_1 \) when the number of data in \( u \) is lower than or equal to \( k \), and, when it is greater than \( k \), by outputting \( u \neq \hat{w} \# (a, d_0)^* \) where \( \hat{w} \) is a membership-preserving renaming of \( w \) using \( k \) distinct data of \( u \) (this can always be done thanks to the so-called “indistinguishability property” stated in [10]). Conversely, if \( L(A) = \emptyset \), then \( S \) is not realisable. If it were, \( S \cap DW_f \# DW = S_1 \) would be too, as a regular domain restriction, but we have seen above that this is not the case. Thus, \( S \) is realisable iff \( L(A) = \emptyset \). ▷

However, we show that restricting to DRA allows to recover the decidability, modulo one additional assumption, namely that no output transition of the specification is such that \( \text{tst} = \emptyset \) (the output data is different from all register contents). We denote by \( \text{DRA}_\emptyset \) this class of DRA. Such assumption rules out pathological, and to our opinion uninteresting and technical cases stemming from the asymmetry between the class of specifications and implementations. E.g., consider the single-register DRA in Fig. 1a (finite labels are arbitrary and not depicted). It starts by reading one input data \( d \) and stores it in \( r \), asks that the corresponding output data is different from the content \( d \) of \( r \) (with \( \text{tst} = \emptyset \) depicted here \( \neq r \)), then accepts any output over any input (transitions \( \top \) are always takeable). It is not realisable because transducers necessarily output the content of some register (hence producing a data which already appeared). On the other hand, having tests \( \text{tst} = \emptyset \) does not imply unrealisability, as shown by the DRA of Fig. 1b: it starts by reading one data \( d_1 \), asks to copy it on the output, then reads another data \( d_2 \), and requires that the output is either distinct from \( d_1 \) or equal to it, depending on whether \( d_2 \neq d_1 \). It happens that such specification is realisable by the identity.

\[\begin{align*}
\text{(a) An unrealisable DRA.} \\
\text{(b) A similar DRA, surprisingly realisable.}
\end{align*}\]

**Figure 1** Pathological DRA specifications.
To check for realizability of \( \text{DRA}_\omega \)-specifications with \( r \)-registers, we show that it suffices to target transducers with \( r \) registers only.

\section*{Proposition 5} Let \( S \) be a specification defined by a \( \text{DRA}_\omega \) with \( r \) registers. If \( S \) is realisable by a register transducer, then it is realisable by a transducer with \( S \) whenever there exist the following transitions of \( 3 \)

\begin{proof}
Proposition 5. ▶

\text{Definition of } I

\begin{align*}
\text{next}(C, \text{tst}^S, \text{asgn}^S, \text{tst}^I, \text{asgn}^I) &= C\backslash((\text{asgn}^S \times \text{tst}^I) \cup (R^S \times \text{asgn}^I)) \\
&\cup((\text{tst}^S \cup \text{asgn}^S) \times (\text{tst}^I \cup \text{asgn}^I))
\end{align*}

\end{proof}

Constraints. A constraint represents the equality relations between the registers in \( R^S \) and those in \( R^I \) (note that such idea is pervasive in the study of registers automata, e.g. to recognise the projection over finite labels). Thus, a constraint is a subset \( C \subset R^S \times R^I \), which is intended to be a set of equalities between the content of registers in \( R^S \) and in \( R^I \). Then, knowing tests \( \text{tst}^S, \text{tst}^I \) and assignments \( \text{asgn}^S, \text{asgn}^I \) performed by \( S \) and \( I \) respectively allows to update the constraints: we define

\begin{align*}
\text{next}(C, \text{tst}^S, \text{asgn}^S, \text{tst}^I, \text{asgn}^I) &= C\backslash((\text{asgn}^S \times \text{tst}^I) \cup (R^S \times \text{asgn}^I)) \\
&\cup((\text{tst}^S \cup \text{asgn}^S) \times (\text{tst}^I \cup \text{asgn}^I))
\end{align*}

For instance, assume \( R^I = \{r_1, r_2\} \) and \( R^S = \{s_1, s_2\} \), and at some point in a run, we have\(^3\) \( C = \{(s_2, r_1), (s_2, r_2)\} \), i.e. \( s_2 = r_1 = r_2 \) and \( s_1 \neq r_1 \) (inequalities are implicit, since \( C \) is an exhaustive list of equalities). Now, \( S \) reads some data \( d \) which satisfies the tests \( \text{tst}^S = \{s_1\} \) in \( S \) and \( \text{tst}^I = \emptyset \) in \( I \) (such tests are consistent because \( s_1 \neq r_1, r_2 \)), and conducts assignments \( \text{asgn}^S = \emptyset \) and \( \text{asgn}^I = \{r_2\} \). Then, on the one hand, \( s_1 = r_2 \) (both contain \( d \)), and on the other hand \( s_2 = r_1 \) (since the content of those registers did not change). Moreover, \( r_1 \neq r_2 \) since \( r_2 \) has been reassigned and \( s_1 \neq r_1 \) still holds. This is represented by the set of constraints \( C' = \{(s_1, r_2), (s_2, r_1)\} \), and indeed, \( \text{next}(C, \{s_1\}, \emptyset, \emptyset; \{r_2\}) = C' \).

Abstracting the behaviour of \( I \) modifies its language (it somehow simplifies it), but we will see that what we build is still an implementation.

\section*{Definition of \( I' \)} We build \( I' = (\Sigma_3, \Sigma_0, D, Q, q_0, \delta, R^S) \), where \( Q = Q^S \times Q^I \times 2^{R^S \times R^I} \) and \( q_0 = (q_0^S, q_0^I, R^S \times R^I) \); we now define \( \delta \). For each state \( (q_0^S, q_0^I, C) \in Q \), for each input test \( (\sigma_3, \text{tst}^S_3) \in \Sigma_3 \times \text{Tst}^{R^S} \), we construct a transition \( t = (q_3^S, q_3^I, C) \xrightarrow{i} (q_3^S, q_3^I, C') \) whenever there exist the following transitions of \( S \) and \( I \):

\begin{align*}
t_3^S &= q_3^S \xrightarrow{\sigma_3, \text{tst}^S_3, \text{asgn}^S} q_3^S \\
t_0^S &= q_0^S \xrightarrow{\sigma_0, \text{tst}^S_0, \text{asgn}^S} q_0^S \\
t_I^I &= q_I^I \xrightarrow{\sigma_I, \text{tst}^I, \text{asgn}^I, d, \text{asgn}_0} q_I^I
\end{align*}

such that, for some fixed arbitrary order on \( R^S \), we have:

\begin{enumerate}
\item \( \text{tst}^I = \{r \in R^I \mid \exists s \in \text{tst}^S, (s, r) \in C\} \)
\item \( C' = \text{next}(C, \text{tst}^S_3, \text{asgn}^S, \text{tst}^I, \text{asgn}^I) \)
\item \( \text{tst}^S_0 = \{s \in R^S \mid (s, r_0) \in C'\} \)
\item \( s_0 = \min \text{tst}^S_0 \)
\end{enumerate}

\(^3\) For readability, we confuse a register with its content.
Item (i) ensures with the help of constraints that in any reachable configuration of $I'$, there exists at least one input data which satisfies both $\text{tst}^S$ and $\text{tst}^I$, which allows $I'$ to synchronise $S$ with $I$. Note that this does not mean that $I'$ is the synchronous product of $S$ and $I$ on any input: since $I'$ only has the registers of $S$, it cannot discriminate data as subtly as $I$, and might thus adopt a different behaviour. For instance, it can be that upon reading some input data word, at some point, $I$ would store some input data $d$ in some register $r$ that $S$ would not, and use it later on in a test $\text{tst}^I = \{r\}$ to take different actions, while neither $I'$ nor $S$ could discriminate between those choices: on reading $d$, $I'$ simulates $S$ with $\text{tst}^S = \emptyset$ and synchronously simulates in $I$ the transition with input test $\text{tst}^I = \emptyset$. Nevertheless, we show the existence of some relational data word common to $I$ and $I'$ for each run of $I'$ (which is also a run of $S$). This is sufficient to conclude that $I'$ realises $S$, because then each run of $I'$, interpreted as a run of $S$, is accepting. Then, items (iii) and (iv) ensures the same property as item (i) does, but this time on output positions.

We shall see that for a transition $I$ such that $(q^S, q^I, C)$ is accessible on some finite input data word, $t^S_i, u^S_i$ and $t^I_i$ exist and are unique. So, for a run $ρ = t_1 t_2 \ldots$ of $I'$, we define $ρ^S = t_1^S t_2^S t_3^S \ldots$ and $ρ^I = t_1^I t_2^I \ldots$

Proof of Correctness. Let us show that $I'$ is indeed a transducer realising $S$: we show that for all $u \in \text{DW}(Σ_2, D)$, there exists a unique sequence of transitions $ρ$ in $I'$, a unique output data word $u_0 \in \text{DW}(Σ_0, D)$ (we denote $u = ⟨u_0, u_o⟩$) and a $w \in L(I)$ such that:

1. $ρ^I$ is the run of $I$ over $w$
2. $ρ^S$ is the run of $S$ over $w$
3. $ρ$ is the run of $I'$ over $u$
4. $ρ^S$ is the run of $S$ over $u$

Note that the above properties imply $\text{lab}(u) = \text{lab}(w)$, but it can be that $u \neq w$, which is consistent with the observations we made. Let us show that they entail the result we need: let $u \in \text{DW}(Σ_3, D)$ be some input data word. By property 3, $f_{I'}(u)$ exists and is unique, so $I'$ has total domain. Now, by denoting $ρ^S$ the run of $S$ over $u = ⟨u_0, f_{I'}(u_0)⟩$, we know by property 2 that there exists $w \in L(I)$ such that $ρ^S$ is the run of $S$ over $w$. Then, $ρ^S$ is accepting because $I$ realises $S$ so $w \in L(S)$, hence $u \in L(S)$. Thus, $I'$ realises $S$.

The proof of properties 1-4 is rather technical, and can be found in [7].

Such result allows us to reduce unbounded synthesis to bounded synthesis for $\text{DRA}_S$. Bounded synthesis is in $\text{ExpTime}$ for $\text{DRA}$ (Thm. 13) and is the topic of the next section.

**Theorem 6.** $\text{Real}(\text{DRA}_S, \text{RT})$ is decidable in $\text{ExpTime}$.

## 5 Bounded Synthesis: A Generic Approach

In this section, we study the setting where target implementations are register transducers in the class $\text{RT}[k]$, for some $k \geq 0$ that we now fix for the whole section. For the complexity analysis, we assume $k$ is given as input, in unary. Indeed, describing a $k$-register automaton in general requires $O(k)$ bits, and not $O(\log k)$ bits. We prove the decidable cases of the first line of Table 1 (page 3), by reducing the problems to realisability problems for data-free specifications.

**Abstract Actions.** We reduce the problem to a finite alphabet problem. Since we synthesise $k$-register transducers, we take the input and output actions of the transducers as symbols of our finite input and output alphabets. Let $R_k = \{1, \ldots, k\}$ and $\text{Tst}_k = \text{Asgn}_k = 2^{R_k}$. The
finite input actions are \( A^k_i = \Sigma_i \times \text{Tst}_k \) which corresponds to picking a label and a test over the \( k \) registers, and the output actions are \( A^k_o = \Sigma_o \times \text{Asgn}_k \times R_k \), corresponding to picking some output symbol, some assignment and some register whose content is to be output.

An alternating sequence of actions \( \pi = (\sigma_1^1, \text{test}_1)(\sigma_1^2, \text{asgn}_1, r_1) \cdots \in (A^k_i A^k_o)^\omega \) abstracts a set of relational data words of the form \( w = (\sigma_1^1, d_1^1)(\sigma_1^2, d_1^2) \cdots \in \text{RW}(\Sigma_i, \Sigma_o, D) \) via a compatibility relation that we now define. We say that \( w \) is compatible with \( \pi \) if there exists a sequence of register configurations \( \tau_0 \tau_1 \cdots \in (R_k \to D)^\omega \) such that \( \tau_0 = \tau^R_k \) and for all \( i \geq 1 \), \( \tau_i, d_i^i \models \text{test}_i \), \( d_i^0 = \tau_i(r_i) \) and \( \tau_{i+1} = \text{next}(\tau_i, d_i^i, \text{asgn}_i) \). Note that this sequence is unique if it exists. We denote by \( \text{Comp}(\pi) \) the set of relational data words compatible with \( \pi \).

Given a specification \( S \), we let \( W_{S,k} = \{ \pi \mid \text{Comp}(\pi) \subseteq S \} \). The set \( W_{S,k} \) can be seen as a specification over the finite input (resp. output) alphabets \( A^k_i \) (resp. \( A^k_o \)).

**Theorem 7** (Transfer). Let \( S \) be a data word specification. The following are equivalent:

1. \( S \) is realisable by a transducer with \( k \) registers.
2. The (data-free) word specification \( W_{S,k} \) is realisable by a (register-free) finite transducer.

**Proof.** Let \( T \) be a transducer with \( k \) registers realising \( S \). The transducer \( T \) can be seen as a finite transducer \( T' \) over input alphabet \( A^k_i \) and output alphabet \( A^k_o \). Indeed, since the transition function of \( T \) is total, it is also the case of \( T' \) (this is required by the definition of transducer defining implementations).

Let us show that \( W_{S,k} \) is realisable by \( T' \), i.e. \( L(T') \subseteq W_{S,k} \). Take a sequence \( \pi = a_1 e_1 a_2 e_2 \cdots \in L(T') \). We show that \( \text{Comp}(\pi) \subseteq S \). Let \( w \in \text{Comp}(\pi) \). Then, there exists a run \( q_0 q_1 q_2 \cdots \) of \( T' \) on \( \pi \) since \( \pi \in L(T') \). By definition of compatibility for \( w \), there exists a sequence of register configurations \( \tau_0 \tau_1 \cdots \in (R_k \to D)^\omega \) satisfying the conditions in the definition of compatibility. From this we can deduce that \( (q_0, \tau_0)(q_1, \tau_1) \cdots \) is an initial sequence of configurations of \( T \) over \( w \), so \( w \in L(T) \). Finally, \( L(T) \subseteq S \), since \( T \) realises \( S \).

Conversely, suppose that \( W_{S,k} \) is realisable by some finite transducer \( T' \) over the input (output) alphabets \( A^k_i \) (\( A^k_o \)). Again, the transducer \( T \) can be seen as a transducer with \( k \) registers over data words. We show that \( T \) realises \( S \), i.e., \( L(T) \subseteq S \). Let \( w \in L(T) \).

The run of \( T \) over \( w \) induces a sequence of actions \( \pi \) in \( (A^k_i A^k_o)^\omega \) which, by definition of compatibility, satisfies \( w \in \text{Comp}(\pi) \). Moreover, \( \pi \in L(T') \). Hence, since \( T' \) realises \( W_{S,k} \), we get \( \text{Comp}(\pi) \subseteq S \), so \( w \in S \), concluding the proof.

### 5.1 The case of URA specifications

In this section, we show that for any \( S \) a data word specification given as some URA, the language \( W_{S,k} \) is effectively \( \omega \)-regular, entailing the decidability of \( \text{REAL}(\text{URA}, \text{RT}[k]) \), by Theorem 7 and the decidability of (data-free) synthesis. Let us first prove a series of intermediate lemmas.

We define an operation \( \otimes \) between relational data words \( w \in \text{RW}(\Sigma_i, \Sigma_o, D) \) and sequences of actions \( \pi \in (A^k_i A^k_o)^\omega \) as follows: \( w \otimes \pi \in \text{RW}(A^k_i, A^k_o, D) \) is defined only if for all \( i \geq 1 \), \( \text{lab}(w[i]) = \text{lab}(\pi[i]) \) where \( \text{lab}(\pi[i]) \) is the first component of \( \pi[i] \), (a label in \( \Sigma_i \cup \Sigma_o \)), by \( (w \otimes \pi)[i] = (\pi[i], \text{dt}(w[i])) \).

**Lemma 8.** The language \( L_k = \{ w \otimes \pi \mid w \in \text{Comp}(\pi) \} \) is definable by some NRA.

**Proof.** We define an NRA with \( k \) registers which roughly follows the actions it reads on its input. Its set of states is \( \{ q \} \cup \text{Asgn}_k \), with initial state \( q \). In state \( q \), it is only allowed to read labelled data in \( A^k_i \times D \). On reading \((\sigma_1, \text{tst}, d)\), it guesses some assignment \( \text{asgn} \), performs
the test \texttt{tst} and the assignment \texttt{asgn} and goes to state \texttt{asgn}. In any state \texttt{asgn} \in \text{Asgn}_{RT}, it is only allowed to read labelled data of the form \((s_\sigma, \text{asgn}, r, d)\), for which it tests whether \(d\) is equal to the content of \(r\). It does no assignment and moves back to state \(q\). All states are accepting (i.e. have maximal even parity 2). Such NRA has size \(O(2^k)\).

Let \(S\) be a specification defined by some \text{URA} \(A_S\) with set of states \(Q\). The following subset of \(L_k\) is definable by some NRA, where \(\overline{S}\) denotes the complement of \(S\):

\begin{lemma}
The language \(L_{\overline{S}, k} = \{ w \otimes \pi \mid w \in \text{Comp}(\pi) \cap \overline{S}\} \) is definable by some NRA.
\end{lemma}

\begin{proof}
Since \(S\) is definable by the \text{URA} \(A_S\), \(\overline{S}\) is NRA-definable with the same automaton, denoted now \(A_{\overline{S}}\), interpreted as an NRA. Let \(B\) be some NRA defining \(L_k\) (it exists by Lemma 8). It now suffices to take a product of \(A_{\overline{S}}\) and \(B\) to get an NRA defining \(L_{\overline{S}, k}\).
\end{proof}

Given a data word language \(L\), we denote by \(\text{lab}(L) = \{ \text{lab}(w) \mid w \in L\} \) its projection on labels. The language \(W_{S,k}\) is obtained as the complement of the label projection of \(L_{\overline{S}, k}\):

\begin{lemma}
\(W_{S,k} = \text{lab}(L_{\overline{S}, k})\).
\end{lemma}

\begin{proof}
Let \(\pi \in (A_1^+ A_2^+)^\omega\). Then, \(\pi \notin W_{S,k} \iff \text{Comp}(\pi) \not\subseteq S \iff \exists w \in \text{RW}, w \in \text{Comp}(\pi) \cap \overline{S} \iff \exists w \in \text{RW}, w \otimes \pi \in L_{\overline{S}, k} \iff \pi \in \text{lab}(L_{\overline{S}, k})\).
\end{proof}

We are now able to show regularity of \(W_{S,k}\):

\begin{lemma}
Let \(S\) be a data word specification, \(k \geq 0\). If \(S\) is definable by some \text{URA} with \(n\) states and \(r\) registers, then \(W_{S,k}\) is effectively \(\omega\)-regular, definable some deterministic parity automaton with \(O(2^{n^2 16(r+k)^2})\) states and \(O(n 4^{(r+k)^2})\) priorities.
\end{lemma}

\begin{proof}
First, \(L_{\overline{S}, k}\) is definable by some NRA with \(O(2^{k^2} n)\) states and \(O(r + k)\) registers by Lemma 10, obtained as product between the NRA \(A_{\overline{S}}\) and the automaton obtained in Lemma 8, of size \(O(2^{k^2})\). It is known that the projection on the alphabet of labels of a language of data words recognised by some \text{NRA} is effectively regular [10]. The same construction, which is based on extending the state space with register equality types, carries over to \(\omega\)-words, and one obtains a non-deterministic parity automaton with \(O(n 4^{(r+k)^2})\) states and \(d\) priorities recognising \(\text{lab}(L_{\overline{S}, k})\). It can be complemented into a deterministic parity automaton with \(O(2^{n^2 16(r+k)^2})\) states and \(O(n 4^{(r+k)^2})\) priorities using standard constructions [15].
\end{proof}

We are now able to reprove the following result, known from [12]:

\begin{theorem}
For all \(k \geq 0\), \text{Real}(\text{URA}, \text{RT}[k])\) is in \text{2ExpTime}.
\end{theorem}

\begin{proof}
By Lemma 11, we construct a deterministic parity automaton \(P_{S,k}\) for \(W_{S,k}\). Then, according to Theorem 7, it suffices to check whether it is realisable by a (register-free) transducer. This is decidable by Theorem 1. The way to decide it is to see \(P_{S,k}\) as a two-player parity game and check whether the protagonist has a solution. Parity games can be solved in time \(O(m^{\log d})\) [3] where \(m\) is the number of states of the game and \(d\) the number of priorities. Overall, it requires doubly exponential time, more precisely in \(O(2^{n^2 16(r+k)^2})\).
\end{proof}

In [11], it is shown that the complexity of the problem is actually only singly exponential in \(k\), and such analysis extends to our construction. Similarly, when the specification automaton is deterministic, we can show that:

\begin{theorem}
\text{Real}(\text{DRA}, \text{RT}[k])\) is in \text{ExpTime}.
\end{theorem}
The case of test-free NRA specifications

Unfortunately, by Theorem 3, the synthesis problem for specifications expressed as NRA is undecidable, even when the number of registers of the implementation is bounded. And indeed, if we mimic the reasoning of the previous section, Lemma 10 does not allow to conclude because $L_{\Sigma, k}$ is definable by a URA but the string projection of a URA is not $\omega$-regular in general. E.g., consider $L = \{(r, d_1) \dots (r, d_n) | \forall i \neq j, d_i \neq d_j \land \forall 1 \leq i \leq n, \exists j, d_j' = d_i\}$, which consists in a word $w \in r^n$ with pairwise distinct data followed by a word $w \in g^m$ which contains at least all the data of $w$ (it is over finite words for simplicity but can be extended to $\omega$-words). $L$ is recognised by the URA which, on reading $(r, d_i)$, universally triggers a run checking three properties: firstly, once a label $g$ is read, only $g$’s are read, secondly, $(r, d_i)$ does not appear again, and thirdly, $(g, d_i)$ appears at least once. Now, it is readily seen that $\text{lab}(L) = \{r^ng^m | m \geq n\}$, which is not regular. Here, we defined $L$ over finite words for simplicity but it is easily extended to an $\omega$-language which is not $\omega$-regular.

Thus, in this section, we consider a restriction on NRA which do not perform tests on input data. A test-free register automaton is a tuple $A = (\Sigma_\delta, \Sigma_\omega, D, Q, q_0, \delta, R, c)$ such that $\delta \subseteq Q \times \Sigma_\delta \times \text{Asgn}_R \times \Sigma_\omega \times R \times Q$. Such a register automaton reads two labelled data at once. In a configuration $(q, \tau)$, when reading $(\sigma_1, d_i)(\sigma_o, d_o)$, it can fire any transition of the form $(q, \sigma_1, \text{asgn}, r, \sigma_o, q') \in \delta$ such that $\tau(r) = d_o$ and move to configuration $(q', \tau')$ where $\tau' = \text{next}(\tau, \text{asgn}, d_i)$. It is easily seen that a test-free register automaton can be converted into a proper register automaton, justifying its name. Such automata will be interpreted by a non-deterministic parity acceptance condition; we denote the class $\text{NRA}_\text{tf}^4$.

It is not clear whether $W_{S,k}$ is regular for such specifications, but we show that it suffices to consider another set denoted $W_{S,k}^{\text{tf}}$ which is easier to analyse (and can be proven regular), and based on the behaviour of $S$ over input with pairwise distinct data. The intuition behind restricting to such case is that $\text{NRA}_\text{tf}$ cannot conduct test on input data, so they behave the same on an input word whose data are all distinct, and such choice ensures that two equal input data will not ease the task of the implementation. An interesting side-product of this approach is that it implies that we can restrict to test-free implementations. A test-free transducer is a transducer whose transitions do not depend on tests over input data; formally $\delta : Q \times \Sigma_\delta \rightarrow \text{Asgn}_R \times \Sigma_\omega \times R \times Q$. In the following, we let $\text{AllDiff}$ denote the set of relational data words whose input data are pairwise distinct: $\text{AllDiff} = \{w = (\sigma_1^0, d_i^0)(\sigma_o^0, d_o^0) \cdots \in RW | \forall 0 \leq i < i', d_i^0 \neq d_i'^0\}$; by convention $d_i^0 = d_o^0$.

Proposition 14. Let $S$ be a $\text{NRA}_\text{tf}$ specification. The following are equivalent:

(i) $S$ is realisable

(ii) $W_{S,k}^{\text{tf}} = \{\pi \in (A_\pi^0 A_k^0)^\omega | \text{Comp}(\pi) \cap S \cap \text{AllDiff} \neq \emptyset\}$, where $A_\pi^0 = \Sigma_\delta \times \{0\}$, has domain $(A_k^0)^\omega$ and is realisable by a register-free transducer

(iii) $S$ is realisable by a test-free transducer

Proof. (i) $\Rightarrow$ (ii): If $S$ is realisable, then by Theorem 7 $W_{S,k}$ has total domain and is realisable by some transducer $I$. Now, since transducers are closed under regular domain restriction, $W_{S,k}^{\omega} \subseteq W_{S,k}^{\text{tf}} \cap (A_\pi^0 A_k^0)^\omega$ has domain $(A_\pi^0)^\omega$ and is realisable by $I \cap (A_\pi^0 A_k^0)^\omega$. Moreover, $W_{S,k}^{\omega} \subseteq W_{S,k}^{\text{tf}}$. Indeed, if $\text{Comp}(\pi) \subseteq S$, then, since $S$ has total domain and $\pi \in (A_\pi^0 A_k^0)^\omega$, $\text{Comp}(\pi) \cap S \cap \text{AllDiff} \neq \emptyset$. Thus, $W_{S,k}^{\omega}$ also has domain $(A_\pi^0)^\omega$ and is realisable by any transducer realising $W_{S,k}^{\omega}$.

4 The bounded synthesis of URA is already decidable, so we do not consider their test-free restriction.
Theorem 16. Let $\rho : q_0 \xrightarrow{a_1,\text{asgn}_1,r_1,a_1^1} q_1 \xrightarrow{a_2,\text{asgn}_2,r_2,a_2^2} q_2 \ldots$ corresponds the origin function $o_\rho : j \mapsto \max\{i \leq j \mid r_j \in \text{asgn}_i\}$, with the convention $\max\emptyset = 0$.

Now, for an origin function $o : N \setminus \{0\} \to N$ and for a relational data word $w \in RW$, we say $w$ is compatible with the origin function $o$, denoted $w \models o$, whenever for all $j \geq 1$, $dt((\text{out}(w))[j]) = dt((\text{inp}(w))[o(j)])$, with the convention $dt((\text{inp}(w))[0]) = d_0$.

The following lemma shows that actual data values in a word $w$ do not matter with respect to membership in some NRA$_{df}$ only the compatibility with origin functions does:

Lemma 15. Let $w \in RW$ and $\rho$ a sequence of transitions of some NRA$_{df}$. Then,

(i) If $\rho$ is a run over $w$, then $w \models o_\rho$.
(ii) If $\rho$ is a run over $w$ and $w \in \text{ALLDIFF}$, then for all $o : N \setminus \{0\} \to N$, $w \models o \iff o = o_\rho$.
(iii) If $w$ and $\rho$ have the same finite labels and if $w \models o_\rho$, then $\rho$ is a run over $w$.

Proof. (i) and (iii) follow from the semantics of NRA$_{df}$, which do not conduct any test on the input data. The direction (ii) is exactly (i). Now, assume $w \in \text{ALLDIFF}$ admits $\rho$ as a run, and let $o$ such that $w \models o$. Then, let $j \geq 1$ be such that $dt((\text{out}(w))[j]) = dt((\text{inp}(w))[o(j)])$. By (i) we know that $dt((\text{out}(w))[j]) = dt((\text{inp}(w))[o_\rho(j)])$, so $dt((\text{inp}(w))[o(j)]) = dt((\text{inp}(w))[o_\rho(j)])$. Since $w \in \text{ALLDIFF}$, this implies $o(j) = o_\rho(j)$, so, overall, $o = o_\rho$.

Now, assume $W^\text{df}_{S,k}$ is realisable by some transducer $I$. We show that $I$, when ignoring the test inputs, is actually an implementation of $S$. Thus, let $I'$ be the same transducer as $I$ except that all input transitions $(\sigma_i, \varnothing)$ are now simply labelled $\sigma_i$. Note that $I'$, interpreted as a register transducer, is test-free. Let $w \in DW$, and $\pi_k = \text{lab}(w) \times \varnothing^\omega$ be the input action in $A^\omega_k$ with same finite labels as $w$. Let $\overline{\pi} = I(\pi_k)$, and let $w' \in \text{Comp}(\overline{\pi}) \cap S \cap \text{ALLDIFF}$ (such $w'$ exists because $W^\text{df}_{S,k}$ has domain $(A^\omega_k)^\omega$ and $I$ realises $W^\text{df}_{S,k}$). Then, since $\text{lab}(w) = \text{lab}(w')$, they admit the same run $\rho'$ in $I$, so $w', w' \models o_{\rho'}$. Then, $w' \in S$, so it admits an accepting run $\rho^S$ in $S$, which implies $w' \models o_{\rho^S}$. Moreover, $w' \in \text{ALLDIFF}$ so, by Lemma 15 ii, we get $o_{\rho'} = o_{\rho^S}$. Therefore, $w \models o_{\rho^S}$, so, by iii, $w$ admits $\rho^S$ as a run, i.e., $w \in S$. Overall, $L(I) \subseteq S$ meaning that $I$ is a (test-free) implementation of $S$. End of proof of Prop. 14.

Finally, $W^\text{df}_{S,k} = \{\overline{\pi} \in (A^\omega_k A^k_0)^\omega \mid \text{Comp}(\overline{\pi}) \cap S \cap \text{ALLDIFF} \neq \emptyset\}$ is regular. Indeed, $W^\text{df}_{S,k} = \{\overline{\pi} \in (A^\omega_k A^k_0)^\omega \mid \text{Comp}(\overline{\pi}) \cap S^\omega \neq \emptyset\}$, where $S^\omega$ is the same automaton as $S$ except that all transitions $q \xrightarrow{a_1,\text{asgn},r,a_2} q'$ have been replaced with $q \xrightarrow{a_1,\text{asgn},r,a_2} q'$, because, for all $\overline{\pi} \in (A^\omega_k A^k_0)^\omega$, $\text{Comp}(\overline{\pi}) \cap S \cap \text{ALLDIFF} \neq \emptyset \Leftrightarrow \text{Comp}(\overline{\pi}) \cap S^\omega \neq \emptyset$ (the direction is trivial, and the $\Rightarrow$ stems from the fact that an ALLDIFF input only takes $\text{tst} = \emptyset$ transitions).

Then, $L^k_{S,k} = \{w \in RW \land (A^\omega_k A^k_0)^\omega \mid w \in \text{Comp}(\overline{\pi}) \cap S^\omega\}$ is NRA-definable. Indeed, $S$ is NRA-$\text{df}$-definable, so $S^\omega$ is NRA-definable, and by Lemma 8, $L_k = \{w \in \overline{\pi} \mid w \in \text{Comp}(\overline{\pi})\}$ is NRA-definable, so their product recognises $L^k_{S,k}$. Finally, $W^\text{df}_{S,k} = \text{lab}(L^k_{S,k})$, and the projection of a NRA over some finite alphabet is regular [10].

Overall, by Theorems 1 and 7, we get (the complexity analysis is the same as for URA):

Theorem 16. For all $k \geq 0$, $\text{REAL}(\text{NRA}_{df}, \text{RT}[k])$ is decidable and in $2\text{EXP}$-TIME.
6 Synthesis and Uniformisation

In this section, we discuss the relation between realisability and uniformisation of relations: in this paper, if $S$ is realisable by a register transducer, then, in particular, it has universal domain, i.e. $\text{inp}(S) = \text{DW}(\Sigma_i, D)$, otherwise it cannot be that $L(T) \subseteq S$ for $T$ a register transducer, since by definition $\text{inp}(T) = \text{DW}(\Sigma_i, D)$. However, when defining a specification, the user might be interested only in a subset of behaviours (for instance, s/he knows that all input data will be pairwise distinct). In the finite alphabet setting, since the formalisms used to express specifications are closed under complement (whether it is LTL or $\omega$-automata), it suffices to complete the specification by allowing any behaviour on the input not considered. However, since register automata are not closed under complement, such approach is not sufficient here. Thus, it is relevant to generalise the realisability problem to the case where the domain of the specification is not universal. This can be done by equipping register transducers with an acceptance condition. It is also necessary to adapt the notion of realisability; otherwise, any transducer accepting no words realises any specification. A natural way is to consider synthesis as an instance of the uniformisation problem. An (implementation) function $f : I \rightarrow O$ is said to uniformise a (specification) relation $R \subseteq I \times O$ whenever $\text{dom}(f) = \text{dom}(R)$ and for all $i \in \text{dom}(f)$, $(i, f(i)) \in R$. In the context of reactive synthesis, where $f = f_I$ is defined from an implementation $I$ and $R$ is given as a language of relational words, it can be rephrased as $\text{inp}(L(I)) = \text{inp}(R)$ and for all $w_3 \in \text{inp}(L(I)), (w_3, f_I(w_3)) \in R$. Note that such definition coincides with the one of realisability when the class of implementations has universal domain. In the following, we denote by $\text{Unif}(S, I)$ the uniformisation problem from specifications in $S$ to implementations in $I$. Unfortunately, this setting is actually much harder, as shown by the next two theorems:

▶ Theorem 17. Given $S$ a specification represented by a DRA, checking whether $\text{inp}(S) = \text{DW}(\Sigma_i, D)$ is undecidable.

While the uniformisation setting obviously preserves the undecidability results from the synthesis setting, the above result allows to show that the somehow more general uniformisation problem is undecidable. For instance, we can prove:

▶ Theorem 18. For all $k \geq 1$, $\text{Unif}(\text{URA},\text{RT}[k])$ is undecidable.

If the domain is DRA-recognisable, it is possible to reduce the uniformisation problem to realizability, by allowing any behaviour on the complement on the domain (which is DRA-recognisable). However, such property is undecidable as a direct corollary of Theorem 17.

7 Conclusion

In this paper, we have given a picture of the decidability landscape of the synthesis of register transducers from register automata specifications. We studied the parity acceptance condition because of its generality, but our results allow to reduce the synthesis problem for register automata specifications to the one for finite automata while preserving the acceptance condition. We have also introduced and studied test-free NRA, which do not have the ability to test their input, but still have the power of duplicating, removing or copying the input data to form the output. We have shown that they allow to recover decidability in presence of non-determinism, in the bounded case. We leave open the unbounded case, which we conjecture to be decidable. As future work, we want to study synthesis problems for specifications given by logical formulae, for decidable data words logics such as two-variable fragments of FO [2, 17, 4].
References