First-Order Intuitionistic Linear Logic and Hypergraph Languages

Pshenitsyn, Tikhon

doi:10.4230/LIPIcs.ICALP.2025.170

First-Order Intuitionistic Linear Logic and Hypergraph Languages

Tikhon Pshenitsyn

Steklov Mathematical Institute of Russian Academy of Sciences, Moscow, Russian Federation

Abstract

The Lambek calculus is a substructural logic known to be closely related to the formal language theory: on the one hand, it is used for generating formal languages by means of categorial grammars and, on the other hand, it has formal language semantics, with respect to which it is sound and complete. This paper studies a similar relation between first-order intuitionistic linear logic ILL1 along with its multiplicative fragment MILL1 on the one hand and the hypergraph grammar theory on the other. In the first part, we introduce a novel concept of hypergraph first-order logic categorial grammar, which is a generalisation of string MILL1 grammars introduced in Richard Moot’s works. We prove that hypergraph ILL1 grammars generate all recursively enumerable hypergraph languages and that hypergraph MILL1 grammars are as powerful as linear-time hypergraph transformation systems. In addition, we show that the class of languages generated by string MILL1 grammars is closed under intersection and that it includes a non-semilinear language as well as an NP-complete one. This shows how much more powerful string MILL1 grammars are as compared to Lambek categorial grammars.
In the second part, we develop hypergraph language models for MILL1. In such models, formulae of the logic are interpreted as hypergraph languages and multiplicative conjunction is interpreted using parallel composition, which is one of the operations of HR-algebras introduced by Courcelle. We prove completeness of the universal-implicative fragment of MILL1 with respect to these models and thus present a new kind of semantics for a fragment of first-order linear logic.

Keywords and phrases:

linear logic, categorial grammar, MILL1 grammar, first-order logic, hypergraph language, graph transformation, language semantics, HR-algebra

Category:

Track B: Automata, Logic, Semantics, and Theory of Programming

Copyright and License:

2012 ACM Subject Classification:

Theory of computation

\rightarrow

Linear logic ; Theory of computation

\rightarrow

Grammars and context-free languages ; Theory of computation

\rightarrow

Rewrite systems

Related Version:

Extended Version: https://arxiv.org/abs/2502.05816 [30]

Acknowledgements:

I thank Richard Moot and Sergei Slavnov for their attention to my work and for the productive discussions. I am also grateful to the anonymous reviewers for valuable remarks.

Funding:

This work was supported by the Russian Science Foundation under grant no. 23-11-00104, https://rscf.ru/en/project/23-11-00104/.

DOI:

10.4230/LIPIcs.ICALP.2025.170

Event:

52nd International Colloquium on Automata, Languages, and Programming (ICALP 2025)

Editors:

Keren Censor-Hillel, Fabrizio Grandoni, Joël Ouaknine, and Gabriele Puppis

Series and Publisher:

Leibniz International Proceedings in Informatics, Schloss Dagstuhl – Leibniz-Zentrum für Informatik

1 Introduction

There is a strong connection between substructural logics, especially non-commutative ones, and the theory of formal languages and grammars [5, 23]. This connection is two-way. On the one hand, a logic can be used as a derivational mechanism for generating formal languages, which is the essence of categorial grammars. One prominent example is Lambek categorial grammars based on the Lambek calculus $\mathrm{L}$ [18]; formulae of the latter are built using multiplicative conjunction “ $\cdot$ ” and two directed implications “ $\backslash$ ”, “ $/$ ”. In a Lambek categorial grammar, one assigns a finite number of formulae of $\mathrm{L}$ to each symbol of an alphabet and chooses a distinguished formula $S$ ; then, the grammar accepts a string $a_{1}\ldots a_{n}$ if the sequent $A_{1},\ldots,A_{n}\vdash S$ is derivable in $\mathrm{L}$ where, for $i=1,\ldots,n$ , $A_{i}$ is one of the formulae assigned to $a_{i}$ . A famous result by Pentus [25] says that Lambek categorial grammars generate exactly context-free languages (without the empty word, to be precise).

On the other hand, algebras of formal languages can serve as models for substructural logics. For example, one can define language semantics for the Lambek calculus as follows: a language model is a function $u$ mapping formulas of $\mathrm{L}$ to formal languages such that $u(A\cdot B)=\{vw\mid v\in u(A),w\in u(B)\}$ , $u(B\backslash A)=\{w\mid\forall v\in u(B)\;vw\in u(A)\}$ , and $u(A/B)=\{v\mid\forall w\in u(B)\;vw\in u(A)\}$ ; a sequent $A\vdash B$ is interpreted as inclusion $u(A)\subseteq u(B)$ . Another famous result by Pentus [26] is that $\mathrm{L}$ is sound and complete w.r.t. language semantics; strong completeness for the fragment of $\mathrm{L}$ without “ $\cdot$ ” had been proved earlier by Buszkowski in [4] using canonical models.

Numerous variants and extensions of the Lambek calculus have been studied, including its nonassociative version [23], commutative version [37] (i.e. multiplicative intuitionistic linear logic), the multimodal Lambek calculus [19], the displacement calculus [24] etc. These logics have many common properties, which motivates searching for a unifying logic. One such “umbrella” logic is the first-order multiplicative intuitionistic linear logic $\mathrm{MILL}1$ [20, 22], which is the multiplicative fragment of first-order intuitionistic linear logic $\mathrm{ILL}1$ . The Lambek calculus can be embedded in $\mathrm{MILL}1$ [22]: for example, the $\mathrm{L}$ sequent $p\cdot p\backslash q\vdash q$ is translated into the $\mathrm{MILL}1$ sequent $p(x_{0},x_{1})\otimes\forall y(p(y,x_{1})\multimap q(y,x_{2}))\vdash q(x_{0},x% _{2})$ . In such a translation, variables “fix” the order of formulae. Although $\mathrm{MILL}1$ is a first-order generalisation of $\mathrm{L}$ , derivability problems in these logics have the same complexity, namely, they are NP-complete. One can define $\mathrm{MILL}1$ categorial grammars in the same manner as Lambek categorial grammars [20, 34]. The former generalise the latter, hence generate all context-free languages (see the definition of the latter in [14]). Moot proved in [20] that $\mathrm{MILL}1$ grammars generate all multiple context-free languages, hence some non-context-free ones, e.g. $\{ww\mid w\in\Sigma^{\ast}\}$ .

It turns out that the interplay between propositional substructural logics and formal grammars can be elevated fruitfully to that between first-order substructural logics and hypergraph grammars. This is the subject of this article. Hypergraph grammar approaches generate sets of hypergraphs; usually they are designed as generalisations of grammar formalisms for strings. For example, hyperedge replacement grammar [8] is a formalism that extends context-free grammar. A rule of a hyperedge replacement grammar allows one to replace a hyperedge in a hypergraph by another hypergraph; see an example below.

Figure 1: A hyperedge replacement rule (in a box) and an example of it being applied twice.

Note that, in this approach, hyperedges but not nodes are labeled. Some of the nodes, called external, are distinguished in a hypergraph; e.g., in the above example, these are the nodes marked by $(i),(o),(i^{\prime}),(o^{\prime})$ . External nodes are needed to specify how hyperedge replacement is done. A more general approach, which corresponds to type-0 grammars in the Chomsky hierarchy, is hypergraph transformation systems (the term is taken from [17]), which allow one to replace a subhypergraph in a hypergraph by another hypergraph.

Naturally, a hypergraph can be represented by a linear logic formula. Namely, one can interpret hyperedges as predicates, nodes as variables, and external nodes as free variables. For example, the hypergraph in the box from Figure 1 can be converted into the formula $\exists x.\exists y.a(i,x)\otimes b(i^{\prime},y)\otimes A(x,o,y,o^{\prime})$ . This idea underlies the concept of hypergraph first-order categorial grammars, which we introduce in Section 3. Roughly speaking, given a first-order logic $\mathcal{L}$ , say, $\mathrm{MILL}1$ , a hypergraph $\mathcal{L}$ categorial grammar takes a hypergraph, assigns a formula of $\mathcal{L}$ to each its hyperedge and node, converts the resulting hypergraph into a $\mathcal{L}$ sequent and check whether it is derivable in $\mathcal{L}$ . Using first-order linear logic for generating hypergraph languages is a novel idea which has not yet been explored in the literature. In Section 3, we study expressive power of grammars defined thusly and prove the following.

1.

Hypergraph $\mathrm{ILL}1$ grammars are equivalent to hypergraph transformation systems and thus they generate all recursively enumerable hypergraph languages (Theorem 23). This result relates hypergraph $\mathrm{ILL}1$ grammars to the well studied approach in the field of hypergraph grammars based on the double pushout graph transformation procedure.
2.

Hypergraph $\mathrm{MILL}1$ grammars are at least as powerful as linear-time hypergraph transformation systems (Theorem 25). The latter are hypergraph transformation systems where the length of a derivation is bounded by a linear function w.r.t. the size of the resulting hypergraph. The linear-time bound has been studied for many grammar formalisms [2, 11, 29, 35], but, to our best knowledge, it is the first time it is used for graph grammars.

The proofs partially use the techniques from [29] where languages generated by grammars over the commutative Lambek calculus are studied. As compared to [29], the proofs in this paper are more technically involved because of complications arising when working with quantifiers and variables in the first-order setting.

In Section 4, using the methods developed for hypergraph $\mathrm{MILL}1$ grammars, we discover the following properties of the class of languages generated by string $\mathrm{MILL}1$ grammars.

1.

Languages generated by string $\mathrm{MILL}1$ grammars are closed under intersection (Theorem 26). Consequently, non-semilinear languages, e.g. the language $\{a^{2n^{2}}\mid n\in\mathbb{N}\}$ , can be generated by string $\mathrm{MILL}1$ grammars.
2.

String $\mathrm{MILL}1$ grammars generate an NP-complete language (Theorem 30). Note that the Lambek calculus is NP-complete [27] but Lambek categorial grammars generate only context-free languages, which are in P. The logic $\mathrm{MILL}1$ is NP-complete as well but string $\mathrm{MILL}1$ grammars are able to generate non-polynomial languages (assuming P $\neq$ NP), hence they are much more powerful.

The question whether string $\mathrm{MILL}1$ grammars generate only multiple context-free grammars was left open by Richard Moot in [20], and I considered Theorem 30 to be the first one answering it. Recently, however, Sergei Slavnov pointed out to an alternative answer to this question, using previously known techniques. Namely, in [21], it is shown that hybrid type-logical grammars can be translated into $\mathrm{MILL}1$ ; hybrid type-logical grammars generalise abstract categorial grammars, and it is proved in [32] that the latter generate an NP-complete language. Our proof relies on a different technique, namely, on reducing linear-time hypergraph transformation systems, which are a rule-based approach unlike hybrid type-logical grammars. In general, finding a natural rule-based formalism equivalent to a $\mathrm{MILL}1$ grammars is an interesting question to study, and Theorem 30 is a step towards the answer¹¹1We believe that linear-time hypergraph transformation systems are essentially equivalent to $\mathrm{MILL}1$ grammars in terms of generative power because, as we conjecture, it is possible to simulate $\mathrm{MILL}1$ axioms and rules by hypergraph transformation rules and hence to convert hypergraph $\mathrm{MILL}1$ grammars into linear-time transformation systems. However, some subtleties arise related to the definition of hypergraph transformation rules when one tries to do so; we discuss them after the proof of Theorem 25..

The second part of the paper (Section 5), not related to the first one, is devoted to developing hypergraph language semantics for $\mathrm{MILL}1$ , thus establishing the other way round connection between first-order linear logic and hypergraph languages. Linear logic is considered as a logic for reasoning about resources [10], and language models for the Lambek calculus are one of formalisations of this statement with resources being words; this agrees with linguistic applications of $\mathrm{L}$ . In this paper, we shall show that hypergraphs can be treated as “first-order resources.” In a hypergraph language model (Definition 33), $\mathrm{MILL}1$ formulas are interpreted by sets of hypergraphs, and the tensor operation $A\otimes B$ is interpreted using the parallel composition operation. The latter one is “gluing” of hypergraphs; it is studied well in the hypergraph grammar theory, namely, it is one of the operations in the language of HR-algebras [7]. Hypergraph language models are a particular case of intuitionistic phase semantics (see the definition in [15]) with the trivial closure operator $\mathrm{Cl}(X)=X$ .

Our main result concerning hypergraph language models is soundness of $\mathrm{MILL}1$ and completeness of its $\{\multimap,\forall\}$ -fragment w.r.t. them (Theorem 35). The proof is inspired by Buszkowski’s one [4] but it is more technically involved, again because of the first-order setting. This result’s importance amounts to the fact that hypergraph language models is one of few, to our best knowledge, examples of a specific semantics for a fragment of first-order intuitionistic linear logic, which, moreover, is grounded in the hypergraph language theory.

2 Preliminaries

In Section 2.1, we introduce notions from the field of graph grammars, and, in Section 2.2, we introduce first-order intuitionistic linear logic.

2.1 Hypergraphs & Hypergraph Transformation Systems

There are many paradigms in the field of graph grammars, including node replacement grammars, hyperedge replacement grammars, algebraic approaches (double pushout, single pushout) with a more categorical flavour, definability in monadic second-order logic etc. We shall work with the definition of a hypergraph from the field of hyperedge replacement grammars [8, 9, 12] because it fits first-order linear logic better than other ones. In hypergraphs we shall deal with, only hyperedges are labeled while nodes play an auxiliary role. Some of the nodes are marked as external; informally, they play the role of gluing points. Throughout the paper, we shall explore a natural correspondence between hypergraphs defined thusly and first-order linear logic formulae. In contrast, there would be no such correspondence if we sticked to the definition of a hypergraph where nodes are labeled and edges are not.

We fix a countable set $\Sigma$ ; its elements are called selectors. In the grammar-logic correspondence we shall develop, selectors will guide variable substitution.

Definition 1.

A $\Sigma$ -typed alphabet is a set $C$ along with a function $\mathrm{type}:C\to\mathcal{P}(\Sigma)$ such that $\mathrm{type}(c)$ is finite for $c\in C$ .

Definition 2.

Let $C$ be a finite $\Sigma$ -typed alphabet of hyperedge labels. A hypergraph over $C$ is a tuple $H=\langle V_{H},E_{H},\mathit{lab}_{H},\mathit{att}_{H},\mathit{ext}_{H}\rangle$ where $V_{H}$ is a finite set of nodes; $E_{H}$ is a finite set of hyperedges; for each $e\in E_{H}$ , $\mathit{lab}_{H}:E_{H}\to C$ is the labeling function and $\mathit{att}_{H}(e):\mathrm{type}(\mathit{lab}_{H}(e))\to V_{H}$ is the attachment function; $\mathit{ext}_{H}:\mathrm{type}(H)\to V_{H}$ is a function with the domain $\mathrm{type}(H)\subseteq\Sigma$ . Elements of $\mathrm{ran}(\mathit{ext}_{H})$ are called external nodes. The set of hypergraphs over $C$ is denoted by $\mathcal{H}(C)$ . Let $\mathrm{type}_{H}(e):=\mathrm{dom}(\mathit{att}_{H}(e))$ for each $e\in E_{H}$ .

In drawings of hypergraphs, nodes are depicted as black circles and hyperedges are depicted as labeled rectangles. When depicting a hypergraph $H$ , we draw a line with a label $\sigma$ from $e$ to $v$ if $\mathit{att}_{H}(e)(\sigma)=v$ . External nodes are represented by symbols in round brackets: if $ext_{H}(\sigma)=v$ , then we mark $v$ as $(\sigma)$ .

There is a standard issue with distinguishing between concrete and abstract hypergraphs, i.e. between hypergraphs and their isomorphism classes. When one considers a hypergraph language $L$ , it is reasonable to assume that it consists of abstract hypergraphs (or, equivalently, that it is closed under isomorphism); however, when we write $H\in L$ , we assume that $H$ is a concrete hypergraph. Following tradition, we often do not distinguish between abstract and concrete hypergraphs to avoid excessive bureaucracy.

Let us fix two selectors, $\mathbf{s}$ and $\mathbf{t}$ . If $\mathrm{type}_{H}(e)=\{\mathbf{s},\mathbf{t}\}$ , then the hyperedge $e$ is called an edge and it is depicted by an arrow going from $\mathit{att}_{H}(e)(\mathbf{s})$ to $\mathit{att}_{H}(e)(\mathbf{t})$ .

Definition 3.

A string graph $\mathrm{sg}(w)$ induced by a string $w=a_{1}\dots a_{n}$ is defined as follows: $V_{\mathrm{sg}(w)}=\{v_{0},\ldots,v_{n}\}$ , $E_{\mathrm{sg}(w)}=\{e_{1},\ldots,e_{n}\}$ ; $\mathrm{type}(e_{i})=\mathrm{type}(\mathrm{sg}(w))=\{\mathbf{s},\mathbf{t}\}$ , $\mathit{att}_{\mathrm{sg}(w)}(e_{i})(\mathbf{s})=v_{i-1}$ , $\mathit{att}_{\mathrm{sg}(w)}(e_{i})(\mathbf{t})=v_{i}$ , $\mathit{lab}_{\mathrm{sg}(w)}(e_{i})=a_{i}$ (for $i=1,\ldots,n$ ); $\mathit{ext}_{\mathrm{sg}(w)}(\mathbf{s})=v_{0}$ , $\mathit{ext}_{\mathrm{sg}(w)}(\mathbf{t})=v_{n}$ . For example, $\mathrm{sg}(ab)=\begin{matrix}\includegraphics{dagpub-standalone-manual-dagpub% -svg-2025-09-18-15-25-22.pdf2svg.svg}\end{matrix}$ .

Definition 4.

Given $a\in C$ , $a^{\bullet}$ is a hypergraph such that $V_{a^{\bullet}}=\mathrm{type}(a)$ ; $E_{a^{\bullet}}=\{e\}$ with $\mathrm{type}_{a^{\bullet}}(e)=\mathrm{type}(a)$ ; $\mathit{att}_{a^{\bullet}}(\sigma)=\mathit{ext}_{a^{\bullet}}(\sigma)=\sigma$ for $\sigma\in\mathrm{type}(a)$ . For example, if $\mathrm{type}(a)=\{x,y,z\}$ , then $a^{\bullet}=\begin{matrix}\includegraphics{dagpub-standalone-manual-dagpub-svg% -2025-09-18-15-25-33.pdf2svg.svg}\end{matrix}$ .

Definition 5.

If $H_{1},H_{2}$ are hypergraphs with $\mathrm{type}(H_{1})\cap\mathrm{type}(H_{2})=\emptyset$ , then their disjoint union is the hypergraph $H_{1}+H_{2}:=\langle V_{H_{1}}\sqcup V_{H_{2}},E_{H_{1}}\sqcup E_{H_{2}},% \mathit{att},\mathit{lab},\mathit{ext}\rangle$ where $\mathit{att}=\mathit{att}_{H_{i}}$ , $\mathit{lab}=\mathit{lab}_{H_{i}}$ on $E_{i}$ for $i=1,2$ , and $\mathit{ext}$ is the union of functions $\mathit{ext}_{H_{1}}$ and $\mathit{ext}_{H_{2}}$ .

Now, let us define the hyperedge replacement operation.

Definition 6.

Let $H$ be a hypergraph and let $R$ be a binary relation on $V_{H}$ . Let $\equiv_{R}$ be the smallest equivalence relation on $V_{H}$ containing $R$ . Then $H/R=H^{\prime}$ is the following hypergraph: $V_{H^{\prime}}=\{[v]_{\equiv_{R}}\mid v\in V_{H}\}$ ; $E_{H^{\prime}}=E_{H}$ ; $\mathit{lab}_{H^{\prime}}=\mathit{lab}_{H}$ ; $\mathit{att}_{H^{\prime}}(e)(s)=[\mathit{att}_{H}(e)(s)]_{\equiv_{R}}$ ; $\mathit{ext}_{H^{\prime}}(s)=[\mathit{ext}_{H}(s)]_{\equiv_{R}}$ .

Definition 7.

Let $H, K$ be two hypergraphs over $C$ ; let $e\in E_{H}$ be a hyperedge such that $\mathrm{type}(e)=\mathrm{type}(K)$ . Then the replacement of $e$ by $K$ in $H$ (the result being denoted by $H[e/K]$ ) is defined as follows:

1.

Remove $e$ from $H$ and add a disjoint copy of $K$ . Formally, let $L$ be the hypergraph such that $V_{L}=V_{H}\sqcup V_{K}$ , $E_{L}=(E_{H}\setminus\{e\})\sqcup E_{K}$ , $\mathit{lab}_{L}$ is the restriction of $\mathit{lab}_{H}\cup\mathit{lab}_{K}$ to $E_{L}$ , $\mathit{att}_{L}$ is the restriction of $\mathit{att}_{H}\cup\mathit{att}_{K}$ to $E_{L}$ , and $\mathit{ext}_{L}=\mathit{ext}_{H}$ .
2.

Glue the nodes that are incident to $e$ in $H$ with the external nodes of $K$ . Namely, let $H[e/K]:=L/R$ where $R=\{(\mathit{att}_{H}(e)(s),\mathit{ext}_{K}(s))\mid s\in\mathrm{type}(e)\}$ .

Using hyperedge replacement, we define hypergraph transformation system, a formalism which shall be used to describe expressive power of hypergraph categorial grammars. It enables one to replace a subhypergraph in a hypergraph with another hypergraph.

Definition 8.

A hypergraph transformation rule (ht-rule) is of the form $r=(H\to H^{\prime})$ where $H,H^{\prime}$ are hypergraphs such that $\mathrm{type}(H)=\mathrm{type}(H^{\prime})$ and $\mathit{ext}_{H},\mathit{ext}_{H^{\prime}}$ are injective.
We say that $G$ is transformed into $G^{\prime}$ via $r$ and denote this by $G\Rightarrow_{r}G^{\prime}$ (also by $G\Rightarrow G^{\prime}$ ) if $G=K[e/H]$ and $G^{\prime}=K[e/H^{\prime}]$ for some $K$ and $e\in E_{K}$ such that $\mathit{att}_{K}(e)$ is injective.
A hypergraph transformation system (ht-system) is a tuple $\mathcal{G}=\langle N,T,P,S\rangle$ where $N, T$ are $\Sigma$ -typed alphabets, $P$ is a finite set of hypergraph transformation rules and $S\in\mathcal{H}(N\cup T)$ is a start hypergraph such that $\mathit{ext}_{S}$ is injective. The language $L(\mathcal{G})$ generated by $\mathcal{G}$ consists of hypergraphs $H\in\mathcal{H}(T)$ such that $S\Rightarrow^{\ast}H$ .

Figure 2: An example of a hypergraph transformation rule (boxed) and of its application.

The definition of a hypergraph transformation rule, although given in a slightly unconventional way through hyperedge replacement, coincides with the standard notion of a graph transformation rule with injective morphisms of the double pushout approach formalism in the corresponding category of hypergraphs; compare it with [17]. Also, our definition is essentially the same as that from [36] (with the only difference that the cited paper deals with graphs rather than with hypergraphs). In that paper, the following proposition is proved.

Proposition 9.

Hypergraph transformation systems generate all recursively enumerable hypergraph languages.

This is a rather expected result related to string rewriting systems generating all recursively enumerable string languages. To prove Proposition 9, it suffices to show how to convert a string representation of a hypergraph into the hypergraph itself by means of ht-rules.

The injectivity requirements in Definition 8 are quite standard, ht-systems defined thusly correspond to the class of $\mathrm{DPO}^{i/i}$ grammars investigated in [13] (“DPO” stands for “double-pushout approach”). In our paper, injectivity is crucial in the proof of Theorem 25.

2.2 First-Order Intuitionistic Linear Logic

We assume the reader’s familiarity with the basic principles and issues of first-order logic. Let us fix a countable set of variables $\mathrm{Var}$ and a countable set of predicate symbols with arities. Atomic formulae are of the form $p(x_{1},\ldots,x_{n})$ where $p$ is a predicate symbol of arity $n$ and $x_{1},\ldots,x_{n}$ are variables. Following [16, 20, 22], we do not allow function symbols; note that complex terms would not fit in the variables are nodes, predicates are hyperedges paradigm. We also do not allow constants because they can easily be simulated by variables.

Formulae of intuitionistic linear logic $\mathrm{ILL}1$ are built from atomic formulae and propositional constants $0,1,\top$ using the multiplicative connectives $\otimes,\multimap$ , the additive ones $\wedge,\vee$ , and the exponential one ${!}$ along with the quantifiers $\exists,\forall$ . The multiplicative fragment of $\mathrm{ILL}1$ denoted by $\mathrm{MILL}1$ does not have constants and uses only $\otimes,\multimap$ and $\exists,\forall$ . A sequent is a structure of the form $\Gamma\vdash B$ where $\Gamma$ is a multiset of formulae and $B$ is a formula.

Note that we shall sometimes describe multisets using the notation $\{f(x)\mid\Phi(x)\}$ . An element $a$ belongs to this multiset $n$ times if there are exactly $n$ elements $x_{1},\ldots,x_{n}$ such that $f(x_{i})=a$ and such that $x_{i}$ satisfies $\Phi$ .

The only axiom is $A\vdash A$ . The rules for $\mathrm{MILL}1$ are presented below.

\Gamma,A\otimes B\vdash C\Gamma,A,B\vdash C\qquad\Gamma,\Delta\vdash A\otimes B% \lx@proof@logical@and\Gamma\vdash A\Delta\vdash B

\Gamma,\Pi,B\multimap A\vdash C\lx@proof@logical@and\Pi\vdash B\Gamma,A\vdash C% \qquad\Gamma\vdash B\multimap A\Gamma,B\vdash A

\Gamma,\exists xA\vdash B\Gamma,A[z/x]\vdash B\qquad\Gamma\vdash\exists xA% \Gamma\vdash A[y/x]

\Gamma,\forall xA\vdash B\Gamma,A[y/x]\vdash B\qquad\Gamma\vdash\forall xA% \Gamma\vdash A[z/x]

Here $y$ is any variable while $z$ is a variable which is not free in $\Gamma,A,B$ ; $A[y/x]$ denotes replacing all free occurrences of $x$ in $A$ by $y$ . More generally, if $h:\mathrm{FVar}(A)\to\mathrm{Var}$ is a function defining a correct substitution, then $A[h]$ denotes the result of substituting $h(x)$ for $x$ for $x\in\mathrm{FVar}(A)$ ( $\mathrm{FVar}(A)$ is the set of free variables in $A$ ). Rules for additive and exponential connectives of $\mathrm{ILL}1$ can be found e.g. in [33, Appendix E].

The cut rule is admissible in $\mathrm{ILL}1$ :

\Gamma,\Delta\vdash C\lx@proof@logical@and\Gamma\vdash AA,\Delta\vdash C

Using it, one can prove that the rules $(\otimes L)$ , $(\multimap R)$ , $(\exists L)$ , and $(\forall R)$ are invertible, i.e. that, if a conclusion of any of these rules is provable in $\mathrm{ILL}1$ , then so is its premise.

3 Hypergraph First-Order Categorial Grammars

The idea of extending the Lambek calculus and categorial grammars to hypergraphs was explored recently in the work [28], where the hypergraph Lambek calculus was introduced. This is a propositional logic whose formulae are built using two operators, $\times$ and $\div$ (similar to linear logic $\otimes$ and $\multimap$ ). Formulae of this calculus can be used as labels on hyperedges: e.g. if $H$ is a hypergraph labeled by formulas, then $\times(H)$ is a formula. Based on the hypergraph Lambek calculus, hypergraph Lambek grammars were defined and their properties were investigated. Although the definition of the hypergraph Lambek calculus is justified in [28], the syntax of this logic is somewhat cumbersome. We are going to show that one can use any first-order logic, such as $\mathrm{MILL}1$ , as the underlying logic for hypergraph categorial grammars. The definitions we propose below are much simpler than those from [28]; besides, they enable one to rely on the well studied apparatus of linear logic.

Let us start with the definition of a string categorial grammar over a first-order logic. This notion appears in [20, 34] for $\mathrm{MILL}1$ but we would like to start with a more general exposition. Let $\mathcal{L}$ be a first-order sequent calculus of interest and let $\mathrm{Fm}(\mathcal{L})$ denote the set of its formulas. Let $\mathbf{s},\mathbf{t}$ be two fixed variables (note that earlier we used them as selectors).

Definition 10.

A string $\mathcal{L}$ grammar is a tuple $\mathcal{G}=\langle T,S,\triangleright\rangle$ where $T$ is a finite alphabet, $S$ is a formula of $\mathcal{L}$ such that $\mathrm{FVar}(S)\subseteq\{\mathbf{s},\mathbf{t}\}$ , and $\triangleright\subseteq T\times\mathrm{Fm}(\mathcal{L})$ is a finite binary relation such that $a\triangleright A$ implies $\mathrm{FVar}(A)\subseteq\{\mathbf{s},\mathbf{t}\}$ . The language $L(\mathcal{G})$ generated by $\mathcal{G}$ is defined as follows: $a_{1}\ldots a_{n}\in L(\mathcal{G})$ if and only if there are formulas $A_{1},\ldots,A_{n}$ such that $a_{i}\triangleright A_{i}$ for $i=1,\ldots,n$ and such that the sequent $A_{1}[x_{0}/\mathbf{s},x_{1}/\mathbf{t}],\ldots,A_{n}[x_{n-1}/\mathbf{s},x_{n}% /\mathbf{t}]\vdash S[x_{0}/\mathbf{s},x_{n}/\mathbf{t}]$ is derivable in $\mathcal{L}$ where $x_{0},\ldots,x_{n}$ are distinct variables.

Example 11.

Let $T=\{a\}$ , let $S=q(\mathbf{s},\mathbf{t})$ , and let $\triangleright$ consist of the pairs $a\triangleright p(\mathbf{s},\mathbf{t})$ , $a\triangleright\forall x.p(x,\mathbf{s})\multimap q(x,\mathbf{t})$ . This grammar accepts the string $a a$ , because the sequent

p(x_{0},x_{1}),\forall x.p(x,x_{1})\multimap q(x,x_{2})\vdash q(x_{0},x_{2})

is derivable in $\mathrm{MILL}1$ .

We see that, in Definition 10, the noncommutative structure of a string is simulated by variables. Informally, one could imagine a string graph with the nodes $x_{0},\ldots,x_{n}$ such that, for $i=1,\ldots,n$ , there is an edge labeled by $A_{i}$ connecting $x_{i-1}$ to $x_{i}$ . Based on this observation, let us introduce the central notion of hypergraph $\mathcal{L}$ grammars. From now on, we consider nodes, selectors and logical variables as objects of the same kind. Besides, if $T$ is a $\Sigma$ -typed alphabet, then we treat $a\in T$ as a predicate symbol.

Definition 12.

Let us fix a variable $x_{\bullet}$ and a symbol $\bullet$ . A hypergraph $\mathcal{L}$ grammar is a quadruple $\mathcal{G}=\langle T,S,X,\triangleright\rangle$ where $T$ is a $\Sigma$ -typed alphabet; $\triangleright\subseteq(T\cup\{\bullet\})\times\mathrm{Fm}({\mathcal{L}})$ is a finite binary relation such that $a\triangleright A$ implies $\mathrm{FVar}(A)\subseteq\mathrm{type}(a)$ and $\bullet\triangleright A$ implies $\mathrm{FVar}(A)\subseteq\{x_{\bullet}\}$ ; finally, $S$ is a formula of $\mathcal{L}$ such that $\mathrm{FVar}(S)\subseteq X\subseteq\Sigma$ .

Definition 13.

The language $L(\mathcal{G})$ is defined as follows: $H\in L(\mathcal{G})$ if and only if $\mathrm{type}(H)=X$ and there are functions $h_{V}:V_{H}\to\mathrm{Fm}({\mathcal{L}})$ , $h_{E}:E_{H}\to\mathrm{Fm}({\mathcal{L}})$ such that

1.

$\bullet\triangleright h_{V}(v)$ for $v\in V_{H}$ , $\mathit{lab}_{H}(e)\triangleright h_{E}(e)$ for $e\in E_{H}$ ;
2.

the sequent $\{h_{E}(e)[\mathit{att}_{H}(e)]\mid e\in E_{H}\},\{h_{V}(v)[v/x_{\bullet}]\mid v% \in V_{H}\}\vdash S[\mathit{ext}_{H}]$ is derivable in $\mathcal{L}$ .

Example 14.

Let $\mathcal{G}$ be a hypergraph $\mathrm{MILL}1$ grammar with $T:=\{a,b\}$ ( $\mathrm{type}(a)=\{\mathbf{s},\mathbf{t}\}$ , $\mathrm{type}(b)=\{1,2,3\}$ ), $X:=\{\mathbf{s},\mathbf{t}\}$ , $S:=p(\mathbf{s},\mathbf{t})\otimes r\otimes r\otimes r$ , and with $\triangleright$ consisting of the pairs

$\blacksquare$

$a\triangleright q(\mathbf{s},\mathbf{t})$ ; $a\triangleright\forall x.q(x,\mathbf{s})\multimap p(x,\mathbf{t})$ ;
$\blacksquare$

$b\triangleright q(2,3)\multimap p(1,1)\multimap p(2,3)$ ;
$\blacksquare$

$\bullet\triangleright r$ ; $\bullet\triangleright\forall y.p(x_{\bullet},y)$ .

Consider the hypergraph $H=\begin{matrix}\includegraphics{dagpub-standalone-manual-dagpub-svg-2025-09-1% 8-15-25-56.pdf2svg.svg}\end{matrix}$ with $V_{H}=\{v_{1},v_{2},v_{3},v_{4}\}$ (nodes are enumerated left to right, top to bottom). It belongs to $L(\mathcal{G})$ , because the sequent

q(v_{3},v_{4}),q(v_{3},v_{4})\multimap p(v_{1},v_{1})\multimap p(v_{3},v_{4}),% \forall y.p(v_{1},y),r,r,r\vdash p(v_{3},v_{4})\otimes r\otimes r\otimes r

is derivable in $\mathrm{MILL}1$ . In this sequent, the first formula corresponds to the $a$ -labeled hyperedge, the second one corresponds to the $b$ -labeled hyperedge, the third formula corresponds to $v_{1}$ and the remaining formulae correspond to $v_{2},v_{3},v_{4}$ .

The string graph $\mathrm{sg}(aa)=\begin{matrix}\includegraphics{dagpub-standalone-manual-dagpub% -svg-2025-09-18-15-26-21.pdf2svg.svg}\end{matrix}$ also belongs to $L(\mathcal{G})$ , because the sequent

q(v_{0},v_{1}),\forall x.q(x,v_{1})\multimap p(x,v_{2}),r,r,r\vdash p(v_{0},v_% {2})\otimes r\otimes r\otimes r

is derivable in $\mathrm{MILL}1$ (the nodes of $\mathrm{sg}(aa)$ from left to right are $v_{0},v_{1},v_{2}$ ).

Finally, the hypergraph $\begin{matrix}\includegraphics{dagpub-standalone-manual-dagpub-svg-2025-09-18-% 15-26-40.pdf2svg.svg}\end{matrix}$ without hyperedges and with nodes $w_{1},w_{2},w_{3},w_{4}$ is also accepted by $\mathcal{G}$ , because the following sequent is derivable in $\mathrm{MILL}1$ :

\forall y.p(w_{1},y),r,r,r\vdash p(w_{1},w_{4})\otimes r\otimes r\otimes r.

Let us comment on Definitions 12 and 13. The relation $\triangleright$ assigns formulae of $\mathcal{L}$ to hyperedge labels from $T$ . Besides, it assigns formulae with the free variable $x_{\bullet}$ (or without free variables) to the distinguished “node symbol” $\bullet$ . Since, in the theory of hyperedge replacement, it is traditional to consider hypergraphs where only hyperedges are labeled, one might ask why we assign formulas not only to hyperedges but to nodes as well. The answer is that we need to have some control over nodes. If we remove all the parts concerning nodes from Definitions 12 and 13, then hypergraph $\mathcal{L}$ grammars would completely ignore isolated nodes; this is a minor yet annoying issue. Moreover, each hypergraph language generated by a hypergraph $\mathcal{L}$ grammar (for, say, $\mathcal{L}=\mathrm{MILL}1$ ) would be closed under node identification.

Example 15.

Assume that $\mathrm{sg}(aa)\in L(\mathcal{G})$ for a hypergraph $\mathrm{MILL}1$ grammar $\mathcal{G}=\langle T,S,\{\mathbf{s},\mathbf{t}\},\triangleright\rangle$ where nodes do not participate in the grammar formalism. This would mean that there are formulae $A_{1},A_{2}$ with free variables in $\{\mathbf{s},\mathbf{t}\}$ such that $a\triangleright A_{1}$ , $a\triangleright A_{2}$ and such that the sequent $A_{1}[v_{0}/\mathbf{s},v_{1}/\mathbf{t}],A_{2}[v_{1}/\mathbf{s},v_{2}/\mathbf{% t}]\vdash S[v_{0}/\mathbf{s},v_{2}/\mathbf{t}]$ is derivable in $\mathrm{MILL}1$ . However, this necessarily implies that the sequent $A_{1}[v_{0}/\mathbf{s},v_{0}/\mathbf{t}],A_{2}[v_{0}/\mathbf{s},v_{1}/\mathbf{% t}]\vdash S[v_{0}/\mathbf{s},v_{1}/\mathbf{t}]$ is derivable in $\mathrm{MILL}1$ too, hence the hypergraph $\begin{matrix}\includegraphics{dagpub-standalone-manual-dagpub-svg-2025-09-18-% 15-26-52.pdf2svg.svg}\end{matrix}$ is also accepted by $\mathcal{G}$ . Consequently, hypergraph $\mathrm{MILL}1$ grammars without node-formula assignment are not able to generate a language consisting only of string graphs, which is quite undesirable.

Another remark concerning Definition 12 is why we need the set $X$ . This set makes the language generated by a grammar consistent in terms of external nodes: if $H_{1},H_{2}\in L(\mathcal{G})$ , then $\mathrm{type}(H_{1})=\mathrm{type}(H_{2})=X$ . Note that ht-systems and hyperedge replacement grammars [8] are consistent in this sense.

Given a hypergraph $\mathcal{L}$ grammar $\mathcal{G}$ , one can consider only string graphs generated by it and thus associate a string language with $\Gamma$ .

Definition 16.

The string language $L^{\mathit{str}}(\mathcal{G})$ generated by a hypergraph $\mathcal{L}$ grammar $\mathcal{G}$ is the set $\{w\mid\mathrm{sg}(w)\in L(\mathcal{G})\}$ .

One expects that, normally, string languages generated by hypergraph $\mathcal{L}$ grammars should be the same as languages generated by string $\mathcal{L}$ grammars. For example, for $\mathcal{L}=\mathrm{MILL}1$ , the following proposition holds.

Proposition 17.

If a language $L$ is generated by a string $\mathrm{MILL}1$ grammar, then there is a hypergraph $\mathrm{MILL}1$ grammar $\mathcal{G}$ such that $L^{{\mathit{str}}}(\mathcal{G})=L$ . Conversely, if $\mathcal{G}$ is a hypergraph $\mathrm{MILL}1$ grammar, then $L^{{\mathit{str}}}(\mathcal{G})\setminus\{\varepsilon\}$ is generated by some string $\mathrm{MILL}1$ grammar.

This proposition is almost trivial; the only minor technicality is that hypergraph $\mathrm{MILL}1$ grammars assign formulae to nodes while string $\mathrm{MILL}1$ grammars are unable to do so. This technicality is the cause why we need to exclude the empty word in the second part of the proposition. See the proof of this proposition in [30, Appendix A.1].

3.1 Hypergraph Transformation Rules as MILL1 Formulae

Our main goal now is to describe a relation between hypergraph first-order linear logic grammars and hypergraph transformation systems. We start with showing how a ht-rule is encoded by a $\mathrm{MILL}1$ formula. Let us fix a unary predicate $\nu(x)$ ; informally, it is understood as “ $x$ is a node”. Given a hypergraph $H$ , let us treat its hyperedge labels as predicate symbols. Namely, if $\mathit{lab}_{H}(e)=a$ , then let us assume that the elements of $\mathrm{type}(a)$ are enumerated, i.e. $\mathrm{type}(a)=\{\sigma_{1},\ldots,\sigma_{n}\}$ ; for any function $h:\mathrm{type}(A)\to\mathrm{Var}$ , let $\mathit{lab}_{H}(e)[h]$ stand for the formula $a(h(\sigma_{1}),\ldots,h(\sigma_{n}))$ . The arity of $a$ is $n=|\mathrm{type}(a)|$ .

Definition 18.

The diagram of a hypergraph $H$ is the multiset

\mathcal{D}(H):=\{\mathit{lab}_{H}(e)[\mathit{att}_{H}(e)]\mid e\in E_{H}\}% \cup\{\nu(v)\mid v\in V_{H}\}.

Let $D(H):=\exists\vec{v}\bigotimes\mathcal{D}(H)$ where $\vec{v}$ is the list of nodes in $V_{H}\setminus\mathrm{ran}(\mathit{ext}_{H})$ .

Example 19.

Let $H=\begin{matrix}\includegraphics{dagpub-standalone-manual-dagpub-svg-2025-09-1% 8-15-27-12.pdf2svg.svg}\end{matrix}$ be a hypergraph such that $V_{H}=\{v_{1},v_{2},v_{3},v_{4}\}$ . Then $\mathcal{D}(H)=C(v_{1},v_{3},v_{4}),D(v_{3},v_{4}),X(v_{4},v_{3}),Y(v_{4}),\nu% (v_{1}),\nu(v_{2}),\nu(v_{3}),\nu(v_{4}).$

Definition 20.

Given a ht-rule $p=(H\to H^{\prime})$ , let
$\mathit{fm}(p):=\forall\vec{u}\left(D(H^{\prime})\multimap D(H)[\chi_{p}]\right)$ where

$\blacksquare$

$\vec{u}$ is the list of nodes in $\mathrm{ran}(\mathit{ext}_{H^{\prime}})$ ;
$\blacksquare$

$\chi_{p}$ is a substitution function defined on $\mathrm{ran}(\mathit{ext}_{H})$ as follows: $\chi_{p}(\mathit{ext}_{H}(\sigma))=\mathit{ext}_{H^{\prime}}(\sigma)$ .

The formula $\mathit{fm}(p)$ is closed. Note that $\chi_{p}$ is well defined because $\mathit{ext}_{H}$ is injective.

Example 21.

Consider the ht-rule

p=\begin{matrix}\includegraphics{dagpub-standalone-manual-dagpub-svg-2025-09-1% 8-15-27-23.pdf2svg.svg}\end{matrix}

Let $u_{1},u_{2},u_{3}$ be the nodes of the left-hand side hypergraph (from left to right) and let $v_{1},v_{2},v_{3},v_{4}$ be the nodes of the right-hand side hypergraph. Then

\mathit{fm}(p)=\forall v_{3}.\forall v_{4}.\big{(}(\exists v_{1}.\exists v_{2}% .C(v_{1},v_{3},v_{4})\otimes D(v_{3},v_{4})\otimes\nu(v_{1})\otimes\nu(v_{2})% \otimes\nu(v_{3})\otimes\nu(v_{4}))\multimap\\ (\exists u_{2}.A(u_{2},v_{3})\otimes B(u_{2},v_{4})\otimes\nu(v_{3})\otimes\nu% (u_{2})\otimes\nu(v_{4}))\big{)}.

The main lemma about $\mathit{fm}(p)$ is presented below.

Lemma 22.

Let $P,P^{\prime}$ be multisets of ht-rules and let $G,G^{\prime}$ be two hypergraphs with injective $\mathit{ext}_{G},\mathit{ext}_{G^{\prime}}$ . The sequent

\{{!}\mathit{fm}(r)\mid r\in P\},\{\mathit{fm}(r)\mid r\in P^{\prime}\},% \mathcal{D}(G^{\prime})\vdash D(G)[\chi_{G\to G^{\prime}}]

(1)

is derivable in $\mathrm{ILL}1$ if and only if there exists a derivation of a hypergraph isomorphic to $G^{\prime}$ from $G$ which uses each rule from $P^{\prime}$ exactly once and that can use rules from $P$ any number of times.

The proof of this lemma, placed in [30, Appendix A.2], is quite technical; its idea is to do a proof search using focusing [1]. Using Lemma 22 we can prove the following theorem.

Theorem 23.

Hypergraph $\mathrm{ILL}1$ grammars generate the class of all recursively enumerable hypergraph languages.

Proof.

Clearly, all languages generated by hypergraph $\mathrm{ILL}1$ grammars are recursively enumerable. To prove the converse, according to Proposition 9, it suffices to show that any ht-system $\mathcal{G}=\langle N,T,P,S\rangle$ can be converted into a hypergraph $\mathrm{ILL}1$ grammar generating the same language. Define the hypergraph $\mathrm{ILL}1$ grammar $\mathcal{G}^{\prime}=\langle T,S^{\prime},\mathrm{type}(S),\triangleright\rangle$ such that

1.

$S^{\prime}:=\bigotimes\{{!}\mathit{fm}(r)\mid r\in P\}\multimap D(S)[h_{S}]$ where $h_{S}(\mathit{ext}_{S}(\sigma))=\sigma$ for $\sigma\in\mathrm{type}(S)$ ;
2.

$a\triangleright a(\sigma_{1},\ldots,\sigma_{n})$ for $a\in T$ and $\mathrm{type}(a)=\{\sigma_{1},\ldots,\sigma_{n}\}$ ;
3.

$\bullet\triangleright\nu(x_{\bullet})$ .

A hypergraph $H$ is accepted by $\mathcal{G}^{\prime}$ if and only if the following sequent is derivable in $\mathrm{ILL}1$ :

\{\mathit{lab}_{H}(e)[\mathit{att}_{H}(e)]\mid e\in E_{H}\},\{\nu(v)\mid v\in V% _{H}\}\vdash S^{\prime}[\mathit{ext}_{H}].

Note that the antecedent of this sequent is the diagram of $H$ and that the succedent equals $\bigotimes\{{!}\mathit{fm}(r)\mid r\in P\}\multimap D(S)[h_{S}][\mathit{ext}_{% H}]=\bigotimes\{{!}\mathit{fm}(r)\mid r\in P\}\multimap D(S)[\chi_{S\to H}]$ . By invertibility of the rules $(\multimap R)$ and $(\otimes L)$ , the above sequent is equiderivable with the one

\{{!}\mathit{fm}(r)\mid r\in P\},\mathcal{D}(H)\vdash D(S)[\chi_{S\to H}]

By Lemma 22, derivability of the latter sequent is equivalent to the fact that $H$ is derivable from $S$ using rules from $P$ , i.e. that $H\in L(\mathcal{G})$ . Thus, $L(\mathcal{G})=L(\mathcal{G}^{\prime})$ . $\hfill\blacktriangleleft$

This result justifies soundness of Definition 12: if hypergraph grammars based on $\mathrm{ILL}1$ , which includes the powerful exponential modality, were not Turing-complete, this would indicate that we do not have enough control in the grammar formalism. For example, if we did not include node-formula assignment in Definition 12, then Theorem 23 would be false.

3.2 Hypergraph MILL1 Grammars

We proceed to investigating expressive power of hypergraph $\mathrm{MILL}1$ grammars. They are clearly less expressive than ht-systems because they generate only languages from NP. Nevertheless, it turns out that they as powerful as ht-systems in which the length of a derivation of a hypergraph is bounded by a linear function w.r.t. the size of the hypergraph.

Definition 24.

The size of a hypergraph $H$ denoted by $|H|$ is the number of nodes and hyperedges in $H$ . A linear-time hypergraph transformation system is a ht-system $\mathcal{G}=\langle N,T,P,S\rangle$ for which there is $c\in\mathbb{N}$ (a time constant) such that, for each $H\in L(\mathcal{G})$ , there is a derivation $S\Rightarrow^{\ast}H$ with at most $c\cdot|H|$ steps.

Adding linear-time bound to formal grammars has a long history. Linear-time type-0 Chomsky grammars were studied in [2, 11]; linear-time one-tape Turing machines were studied in [35]; linear-time branching vector addition systems were studied in [29] in the context of commutative Lambek grammars. However, to our best knowledge, linear-time graph grammars have not appeared in the literature. Linear-time ht-systems may be considered as the hypergraph counterpart of linear-time type-0 grammars studied in [2, 11], so it is quite a natural formalism. The main result concerning them is presented below.

Theorem 25.

Each linear-time ht-system can be converted into an equivalent hypergraph $\mathrm{MILL}1$ grammar.

Proof of Theorem 25.

Let $\mathcal{G}=\langle N,T,P,S\rangle$ be a linear-time ht-system with the time constant $c\in\mathbb{N}$ . Define the hypergraph $\mathrm{MILL}1$ grammar $\mathcal{G}^{\prime}=\langle T,S^{\prime},\mathrm{type}(S),\triangleright\rangle$ such that

1.

$S^{\prime}:=D(S)[h_{S}]$ where $h_{S}(\mathit{ext}_{S}(\sigma))=\sigma$ for $\sigma\in\mathrm{type}(S)$ ;
2.

$a\triangleright a(\sigma_{1},\ldots,\sigma_{n})\otimes\mathit{fm}(p_{1})% \otimes\ldots\otimes\mathit{fm}(p_{k})$ for $a\in T$ , $\mathrm{type}(a)=\{\sigma_{1},\ldots,\sigma_{n}\}$ , $0\leq k\leq c$ , and for $p_{1},\ldots,p_{k}\in P$ ;
3.

$\bullet\triangleright\nu(x_{\bullet})\otimes\mathit{fm}(p_{1})\otimes\ldots% \otimes\mathit{fm}(p_{k})$ for $0\leq k\leq c$ , and for $p_{1},\ldots,p_{k}\in P$ .

A hypergraph $H$ is accepted by $\mathcal{G}^{\prime}$ if and only if, for each $e\in E_{H}$ (and for each $v\in V_{H}$ ), there exist at most $c$ rules from $P$ , which we denote by $p_{1}^{e},\ldots,p_{k(e)}^{e}$ (by $q_{1}^{v},\ldots,q_{k(v)}^{v}$ resp.), such that the sequent

\{\mathit{lab}_{H}(e)[\mathit{att}_{H}(e)]\otimes\mathit{fm}(p^{e}_{1})\otimes% \ldots\otimes\mathit{fm}(p^{e}_{k(e)})\mid e\in E_{H}\},\\ \{\nu(v)\otimes\mathit{fm}(q^{v}_{1})\otimes\ldots\otimes\mathit{fm}(q^{v}_{k(% v)})\mid v\in V_{H}\}\vdash S^{\prime}[\mathit{ext}_{H}]

is derivable in $\mathrm{MILL}1$ . By invertibility of $(\otimes L)$ , this is equivalent to the fact that there exists a multiset $P^{\prime}$ of cardinality at most $c\cdot|E_{H}|+c\cdot|V_{H}|=c\cdot|H|$ such that all its elements are from $P$ and such that the following sequent is derivable in $\mathrm{MILL}1$ :

\{\mathit{fm}(p)\mid p\in P^{\prime}\},\mathcal{D}(H)\vdash D(S)[\chi_{S\to H}]

By Lemma 22, this is equivalent to the fact that there is a derivation of a hypergraph $H^{\prime}$ isomorphic to $H$ from $S$ that uses each rule from $P^{\prime}$ exactly once. Existence of $P^{\prime}$ satisfying these properties is equivalent to the fact that there is a derivation of $H^{\prime}$ from $S$ of length at most $c\cdot|H|$ , which is equivalent to $H\in L(\mathcal{G})$ . Thus, $L(\mathcal{G})=L(\mathcal{G}^{\prime})$ . $\hfill\blacktriangleleft$ The question arises whether the converse holds as well. We are not going to address it in the article because of space-time limitations and also because we are mainly interested in lower bounds for the class of hypergraph $\mathrm{MILL}1$ grammars. Still, we claim that it is possible to convert each hypergraph $\mathrm{MILL}1$ grammar into a linear-time ht-system with non-injective rules, i.e. with rules $H\to H^{\prime}$ where $\mathit{ext}_{H^{\prime}}$ is allowed to be non-injective. This could be done by a straightforward (yet full of tiring technical details) encoding of $\mathrm{MILL}1$ inference rules by hypergraph transformations. We leave proving that for the future work.

Speaking of upper bounds, we noted that all languages generated by hypergraph $\mathrm{MILL}1$ grammars are in NP; besides, as we shall show in Theorem 30, there is an NP-complete language of string graphs generated by a hypergraph $\mathrm{MILL}1$ grammars, so the NP upper bound is accurate.

Let us further explore properties of the class of languages generated by hypergraph $\mathrm{MILL}1$ grammars. It turns out to be closed under intersection.

Theorem 26.

Languages generated by hypergraph $\mathrm{MILL}1$ grammars are closed under intersection.

Proof.

Let $\mathcal{G}_{i}=\langle T_{i},S_{i},X_{i},\triangleright_{i}\rangle$ be two hypergraph $\mathrm{MILL}1$ grammars; we aim to construct a hypergraph $\mathrm{MILL}1$ grammar generating $L(\mathcal{G}_{1})\cap L(\mathcal{G}_{2})$ . Let us assume without loss of generality that $T_{1}=T_{2}=T$ ; also, let us assume that predicate symbols used by $\mathcal{G}_{1}$ and $\mathcal{G}_{2}$ are disjoint. Let us denote the set of subformulas of formulas occuring in $\mathcal{G}_{i}$ by $\mathrm{Fm}_{i}$ ( $i=1,2$ ). Note that, if $X_{1}\neq X_{2}$ , then $L(\mathcal{G}_{1})\cap L(\mathcal{G}_{2})=\emptyset$ (hypergraphs in these languages would have different types); thus, we can assume that $X_{1}=X_{2}$ (let us denote this set by $X$ ).

Define the grammar $\mathcal{G}:=\langle T,S,X,\triangleright\rangle$ where $S=S_{1}\otimes S_{2}$ and $\triangleright$ is the smallest relation such that $a\triangleright A_{1}\otimes A_{2}$ holds whenever $a\in T\cup\{\bullet\}$ and $a\triangleright_{i}A_{i}$ for $i=1,2$ .

Lemma 27 (splitting lemma).

1.

If the sequent $A_{1},\ldots,A_{n}\vdash B$ is derivable in $\mathrm{MILL}1$ such that $A_{i}\in\mathrm{Fm}_{1}\cup\mathrm{Fm}_{2}$ and $B\in\mathrm{Fm}_{k}$ for some $k\in\{1,2\}$ , then all $A_{i}$ are also from $\mathrm{Fm}_{k}$ .
2.

The sequent $A_{1},\ldots,A_{n},B_{1},\ldots,B_{m}\vdash A\otimes B$ such that $A_{i}$ and $A$ are from $\mathrm{Fm}_{1}$ and $B_{i}$ , $B$ are from $\mathrm{Fm}_{2}$ is derivable in $\mathrm{MILL}1$ if and only if the sequents $A_{1},\ldots,A_{n}\vdash A$ and $B_{1},\ldots,B_{m}\vdash B$ are derivable in $\mathrm{MILL}1$ .

Both statements are proved jointly by straightforward induction on the length of a derivation.

A hypergraph $H$ is accepted by $\mathcal{G}$ if and only if, for $i=1,2$ , there exist functions $h^{i}_{V}:V_{H}\to\mathrm{Fm}_{i}$ and $h^{i}_{E}:E_{H}\to\mathrm{Fm}_{i}$ such that $\bullet\triangleright_{i}h^{i}_{V}(v)$ for $v\in V_{H}$ , $\mathit{lab}_{H}(e)\triangleright_{i}h^{i}_{E}(v)$ for $e\in E_{H}$ , and the following sequent is derivable in $\mathrm{MILL}1$ :

\{(h_{E}^{1}(e)\otimes h_{E}^{2}(e))[\mathit{att}_{H}(e)]\mid e\in E_{H}\},\{(% h_{V}^{1}(v)\otimes h_{V}^{2}(v))[v/x_{\bullet}]\mid v\in V_{H}\}\vdash(S_{1}% \otimes S_{2})[\mathit{ext}_{H}].

By invertibility of $(\otimes L)$ and splitting lemma, this is equivalent to derivability of the sequents $\{h_{E}^{i}(e)[\mathit{att}_{H}(e)]\mid e\in E_{H}\},\{h_{V}^{i}(v)[v/x_{% \bullet}]\mid v\in V_{H}\}\vdash S_{i}[\mathit{ext}_{H}]$ for $i=1,2$ , hence is equivalent to the fact that $H$ belongs to $L(\mathcal{G}_{1})\cap L(\mathcal{G}_{2})$ . $\hfill\blacktriangleleft$ An analogous technique cannot be used for Lambek categorial grammars because the Lambek calculus is non-commutative, and splitting lemma does not hold for it. In fact, since Lambek categorial grammars generate context-free languages [25], which are not closed under intersection, a similar result does not hold for Lambek categorial grammars.

4 Expressive Power of String MILL1 Grammars

Now, let us apply the results and techniques developed for hypergraph $\mathrm{MILL}1$ grammars to describing the class of languages generated by string $\mathrm{MILL}1$ grammars. First, Proposition 17 and Theorem 25 imply the following lower bound.

Corollary 28.

If $\mathcal{G}=\langle N,T,P,S\rangle$ is a linear-time ht-system, then $\{w\in T^{+}\mid\mathrm{sg}(w)\in L(\mathcal{G})\}$ is generated by a string $\mathrm{MILL}1$ grammar.

Next, the following result can be proved in the same way as Theorem 26.

Theorem 29.

Languages generated by string MILL1 grammars are closed under intersection.

(Note that we cannot directly infer this theorem from Proposition 17 and Theorem 26 because of the empty string issue.) Since languages generated by string MILL1 grammars contain all context-free languages, they also contain their finite intersections, in particular, the language

\{(a^{n}b^{n})^{n}\mid n>0\}=\{a^{n_{1}}b^{n_{1}}\ldots a^{n_{k}}b^{n_{k}}\mid n% _{1},\ldots,n_{k}\in\mathbb{N}\}\\ \cap\{a^{k}b^{n_{1}}a^{n_{1}}\ldots b^{n_{k-1}}a^{n_{k-1}}b^{l}\mid k,l,n_{1},% \ldots,n_{k-1}>0\}.

It is simple to prove that languages generated by string $\mathrm{MILL}1$ grammars are also closed under letter-to-letter homomorphisms, so the language $\{a^{2n^{2}}\mid n\in\mathbb{N}\}$ can be generated by a string $\mathrm{MILL}1$ grammar as well. Thus, string $\mathrm{MILL}1$ grammars generate languages with non-semilinear Parikh images.

Note that, for Lambek categorial grammars, there is imbalance between expressive power and algorithmic complexity. Namely, on the one hand, Lambek categorial grammars generate exactly context-free languages, all of which are polynomially parsable, but on the other hand, parsing in the Lambek calculus is an NP-complete problem [27]. This is not the case for $\mathrm{MILL}1$ grammars, as the following theorem shows.

Theorem 30.

String $\mathrm{MILL}1$ grammars generate an NP-complete language.

Proof.

In view of Corollary 28, it suffices to present a linear-time ht-system generating an NP-complete language of string graphs. Let $P,Q_{1},Q_{2},S,T_{1},T_{2},U$ be nonterminal symbols such that $\mathrm{type}(Q_{1})=\mathrm{type}(Q_{2})=\emptyset$ , $\mathrm{type}(P)=\mathrm{type}(S)=\mathrm{type}(T_{1})=\mathrm{type}(U)=\{% \mathbf{s},\mathbf{t}\}$ and $\mathrm{type}(T_{2})=\{1,2,3,4\}$ ; let $0,1,a,b,c$ be terminal symbols with type $\{\mathbf{s},\mathbf{t}\}$ . Let the start hypergraph be $S^{\bullet}+Q_{1}^{\bullet}$ (“ $+$ ” is introduced in Definition 5). The rules are presented below.

1.

$S^{\bullet}+Q_{1}^{\bullet}\to\mathrm{sg}(SaT_{1}aT_{1}aT_{1})+Q_{1}^{\bullet}$ ;
2.

$T_{1}^{\bullet}+Q_{1}^{\bullet}\to\mathrm{sg}(kT_{1})+Q_{1}^{\bullet}$ , $T_{1}^{\bullet}+Q_{1}^{\bullet}\to\mathrm{sg}(k)+Q_{1}^{\bullet}$ for $k=0,1$ ;
3.

$S^{\bullet}+Q_{1}^{\bullet}\to\begin{matrix}\includegraphics{dagpub-standalone% -manual-dagpub-svg-2025-09-18-15-27-55.pdf2svg.svg}\end{matrix};$
4.

$T_{2}^{\bullet}+Q_{1}^{\bullet}\to\begin{matrix}\includegraphics{dagpub-% standalone-manual-dagpub-svg-2025-09-18-15-28-13.pdf2svg.svg}\end{matrix}$ , $T_{2}^{\bullet}+Q_{1}^{\bullet}\to\begin{matrix}\includegraphics{dagpub-% standalone-manual-dagpub-svg-2025-09-18-15-28-23.pdf2svg.svg}\end{matrix}$ for $k=0,1$ ;
5.

$S^{\bullet}+Q_{1}^{\bullet}\to U^{\bullet}+Q_{2}^{\bullet}$ ;
6.

$\begin{matrix}\includegraphics{dagpub-standalone-manual-dagpub-svg-2025-09-18-% 15-28-30.pdf2svg.svg}\end{matrix}\;\to\begin{matrix}\includegraphics{dagpub-% standalone-manual-dagpub-svg-2025-09-18-15-28-38.pdf2svg.svg}\end{matrix};$
7.

$U^{\bullet}+Q_{2}^{\bullet}\to c^{\bullet}$ .

This ht-system is linear-time because each production, except for 5, which is applied at most once in any derivation, increases the number of terminal symbols. This ht-system generates hypergraphs of the form

\mathrm{sg}(bw_{1}bbw_{2}b\ldots bw_{m}bcax_{1}ay_{1}az_{1}\ldots ax_{n}ay_{n}% az_{n})

(2)

such that $w_{i},x_{i},y_{i},z_{i}$ are nonempty strings over the alphabet $\{0,1\}$ and such that $\{w_{1},\ldots,w_{m}\}$ coincides as a multiset with $\{x_{i_{1}},y_{i_{1}},z_{i_{1}},\ldots,x_{i_{l}},y_{i_{l}},z_{i_{l}}\}$ for some $1\leq i_{1}<\ldots<i_{l}\leq n$ . Consequently, one can reduce the exact cover problem by 3-sets to this language: if $X=\{w_{1},\ldots,w_{m}\}$ is a set and $C=\{\{x_{1},y_{1},z_{1}\},\ldots,\{x_{n},y_{n},z_{n}\}\}$ is a collection of 3-element subsets of $X$ , then checking whether $X$ is the disjoint union of some sets from $C$ is equivalent to checking whether the hypergraph (2) is generated by the ht-system. $\hfill\blacktriangleleft$

Let us present another proof of Theorem 30 that relies on a result by Book [3]. Consider a string rewriting system $\mathcal{S}=\langle N,T,P,S\rangle$ where $P$ is a finite set of rules of the form $\alpha\to\beta$ ( $\alpha,\beta\in(N\cup T)^{\ast}$ are arbitrary strings) and $S\in(N\cup T)^{\ast}$ is the start string. Recall that $\eta\alpha\theta\Rightarrow_{\mathcal{S}}\eta\beta\theta$ whenever $(\alpha\to\beta)\in P$ (and no other pairs of strings are in the relation $\Rightarrow_{\mathcal{S}}$ ); recall also that $L(\mathcal{S})=\{w\in T^{\ast}\mid S\Rightarrow_{\mathcal{S}}^{\ast}w\}$ . Let us convert $\mathcal{S}$ into the ht-system $\mathcal{G}=\langle N,T,P^{\prime},\mathrm{sg}(S)\rangle$ where the symbols from $N\cup T$ have the type $\{\mathbf{s},\mathbf{t}\}$ and $P^{\prime}=\{\mathrm{sg}(\alpha)\to\mathrm{sg}(\beta)\mid(\alpha\to\beta)\in P\}$ . (Let us assume that, in each rule $\alpha\to\beta$ , $\alpha$ and $\beta$ are nonempty.) It is straightforward to check that $L(\mathcal{G})=\{\mathrm{sg}(w)\mid w\in L(\mathcal{S})\}$ . Finally, [3, Theorem 1] says that there is a linear-time string rewriting system $\mathcal{S}$ that generates an NP-complete language. Thus, if one constructs $\mathcal{G}$ from it and then applies Corollary 28, then they obtain a string $\mathrm{MILL}1$ grammar generating an NP-complete language.

One of the reviewers pointed out that string $\mathrm{MILL}1$ grammars generating nonsemilinear languages is too much for linguistic applications, where it is widely assumed that natural languages are semilinear. The reviewer asked whether restricting $\mathrm{MILL}1$ to the fragment where each (free or bound) variable occurs in each formula at most twice results in languages generated by the corresponding class of grammars being semilinear. I am afraid that, even with this restriction, string $\mathrm{MILL}1$ grammars generate some non-semilinear and NP-complete languages. Let us consider the above construction involving string rewriting systems. If, for example, $AB\to BCD$ is a string rewriting rule, then

\mathit{fm}(\mathrm{sg}(AB)\to\mathrm{sg}(BCD))=\forall v_{1}.\forall v_{2}.% \big{[}\left(\exists x.A(v_{1},x)\otimes B(x,v_{2})\otimes\nu(v_{1})\otimes\nu% (x)\otimes\nu(v_{2})\right)\multimap\\ \left(\exists y.\exists z.B(v_{1},y)\otimes C(y,z)\otimes D(z,v_{2})\otimes\nu% (v_{1})\otimes\nu(y)\otimes\nu(z)\otimes\nu(v_{2})\right)\big{]}.

In a formula of the form $\mathit{fm}(\mathrm{sg}(\alpha)\to\mathrm{sg}(\beta))$ , each variable occurs at most four times. However, we claim that one could remove $\nu$ predicates everywhere, which would result in each variable occurring in each formula at most twice. We also claim that Lemma 22 would remain true without $\nu$ predicates in formulas in case where we consider only string graphs. Thus, we can use Book’s result from [3] to generate an NP-complete and a non-semilinear language by a grammar over the restricted fragment of $\mathrm{MILL}1$ . We do not provide formal proofs but leave them for the future work.

5 Hypergraph Language Semantics

Now, let us proceed to model-theoretic investigations into $\mathrm{MILL}1$ . Our objective is to generalise language models for the Lambek calculus to $\mathrm{MILL}1$ and to devise hypergraph language models. This will enable one to regard $\mathrm{MILL}1$ as a logic for reasoning about hypergraph resources. The most important question is how the composition of such resources should be defined: if $H_{1},H_{2}$ are two hypergraphs, then how should one understand “ $H_{1}\otimes H_{2}$ ”?

Language semantics for the Lambek calculus, algebraically speaking, is a mapping of $\mathrm{L}$ formulae to a free semigroup of words which interprets product $A\cdot B$ as elementwise product of interpretations of $A$ and $B$ . What is the hypergraph counterpart of free semigroups? In the field of hyperedge replacement, there are algebras of hypergraphs called HR-algebras [6, 7, 31]. They include the parallel composition operation and source manipulating operations. Parallel composition is a way of gluing hypergraphs defined as follows.

Definition 31.

Let $H_{1}$ and $H_{2}$ be hypergraphs. Let $\sim$ be the smallest equivalence relation on $V_{H_{1}}\sqcup V_{H_{2}}$ such that $\mathit{ext}_{H_{1}}(\sigma)\sim\mathit{ext}_{H_{2}}(\sigma)$ for $\sigma\in\mathrm{type}(H_{1})\cap\mathrm{type}(H_{2})$ . Then, parallel composition $H_{1}\mathbin{/\mkern-6.0mu/}H_{2}$ is the hypergraph $H$ such that $\mathrm{type}(H)=\mathrm{type}(H_{1})\cup\mathrm{type}(H_{2})$ ; $V_{H}=(V_{H_{1}}\sqcup V_{H_{2}})/\sim$ , $E_{H}=E_{H_{1}}\sqcup E_{H_{2}}$ ; $\mathit{att}_{H}(e)(\sigma)=[\mathit{att}_{H_{i}}(e)(\sigma)]_{\sim}$ , $\mathit{lab}_{H}(e)=\mathit{lab}_{H_{i}}(e)$ for $e\in E_{H_{i}}$ ; $\mathit{ext}_{H}(\sigma)=[\mathit{ext}_{H_{i}}(\sigma)]_{\sim}$ for $\sigma\in\mathrm{type}(H_{1})\cup\mathrm{type}(H_{2})$ .

Informally, $H_{1}\mathbin{/\mkern-6.0mu/}H_{2}$ is obtained by taking the disjoint union of $H_{1}$ and $H_{2}$ and fusing $\mathit{ext}_{H_{1}}(\sigma)$ with $\mathit{ext}_{H_{2}}(\sigma)$ for $\sigma\in\mathrm{type}(H_{1})\cap\mathrm{type}(H_{2})$ . This operation is illustrated on Figure 3. Note that parallel composition is associative and commutative.

Figure 3: Scheme of parallel composition of

H_{1}

and

H_{2}

(left) and an example of substitution of a hypergraph according to

h

where

h(x)=h(y)=\sigma

,

h(z)=\tau

, and

h(t)

is undefined (right).

Other operations used in HR-algebras allow one to reassign selectors to external nodes, to make an external node non-external, or to fuse some external nodes. We shall use an operation that unifies all the three manipulations; we shall call it substitution.

Definition 32.

Given a hypergraph $H$ and a partial function $h\subseteq\Sigma\times\Sigma$ , let $\sim$ be the smallest equivalence relation such that $\mathit{ext}_{H}(\sigma_{1})\sim\mathit{ext}_{H}(\sigma_{2})$ whenever $h(\sigma_{1})=h(\sigma_{2})$ . Then $\mathit{sub}_{h}(H)$ is the hypergraph $H^{\prime}$ such that $\mathrm{type}(H^{\prime})=h(\mathrm{type}(H))$ ; $V_{H^{\prime}}=V_{H}/\sim$ ; $E_{H^{\prime}}=E_{H}$ ; $\mathit{att}_{H^{\prime}}(e)(\sigma)=[\mathit{att}_{H}(e)(\sigma)]_{\sim}$ , $\mathit{lab}_{H^{\prime}}(e)=\mathit{lab}_{H}(e)$ for $e\in E$ ; $\mathit{ext}_{H^{\prime}}(h(\sigma))=[\mathit{ext}_{H}(\sigma)]_{\sim}$ .

One can compare the substitution operation with the operations of source renaming and source fusion from [31] and verify that the former one is interdefinable with the latter ones (in presence of parallel composition). Hence, one can use $\mathbin{/\mkern-6.0mu/}$ and $\mathit{sub}_{h}$ as basic operations of HR-algebras. Nicely, exactly these operations can also be used for defining hypergraph language models, which we are going to do now. Let $K_{0}:=\langle\emptyset,\emptyset,\emptyset,\emptyset,\emptyset\rangle$ be the empty hypergraph. From now on, $\Sigma=\mathrm{Var}$ (i.e. selectors and variables are the same objects).

Definition 33.

A hypergraph language model is a pair $\langle T,u\rangle$ where $T$ is a $\Sigma$ -typed alphabet and $u:\mathrm{Fm}(\mathrm{MILL}1)\to\mathcal{P}(\mathcal{H}(T))$ is a function mapping formulas of $\mathrm{MILL}1$ to sets of abstract hypergraphs over $T$ which satisfies the following conditions:

1.

$\mathit{sub}_{h}(u(A))\subseteq u(A[h])$ for any total function $h:\Sigma\to\Sigma$ ;
2.

$u(A\otimes B)=u(A)\mathbin{/\mkern-6.0mu/}u(B)$ ;
3.

$u(A\multimap B)=\{H\mid\forall H^{\prime}\in u(A)\,\left(H\mathbin{/\mkern-6.0% mu/}H^{\prime}\in u(B)\right)\}$ ;
4.

$u(\exists xA)=\bigcup\limits_{y\in\mathrm{Var}}u(A[y/x])$ ;
5.

$u(\forall xA)=\bigcap\limits_{y\in\mathrm{Var}}u(A[y/x])$ .

A sequent $A_{1},\ldots,A_{n}\vdash B$ is true in this model if $u(A_{1})\mathbin{/\mkern-6.0mu/}\ldots\mathbin{/\mkern-6.0mu/}u(A_{n})% \subseteq u(B)$ (for $n=0$ , if $K_{0}\in u(B)$ ).

This semantics can be viewed as an instance of intuitionistic phase semantics [15] with the commutative monoid $(\mathcal{H}(T),\mathbin{/\mkern-6.0mu/},K_{0})$ and with the trivial closure operator $\mathrm{Cl}(X)=X$ .

The first property of $u$ in Definition 33 relates substitution as a logical operation to substitution as a hypergraph transformation; without it, $u(A)$ and $u(A[h])$ would be unrelated, which is undesirable. Besides, this property is used to prove correctness. Note that, if $h:\Sigma\to\Sigma$ is a bijection, then $u(A[h])=\mathit{sub}_{h}(u(A))$ (apply property 1 twice). Quantifiers are interpreted as additive conjunction and disjunction, which reflects their behaviour correctly.

Lemma 34.

1.

$u(A)\mathbin{/\mkern-6.0mu/}u(B)\subseteq u(C)$ if and only if $u(A)\subseteq u(B\multimap C)$ .
2.

If $u(A)\subseteq u(B)$ , then $u(A[h])\subseteq u(B[h])$ for any substitution $h$ .

Proof.

1.

Trivially follows from the definition of a model.
2.

Clearly, $K_{0}\mathbin{/\mkern-6.0mu/}H=H$ for any $H$ . Therefore, $u(A)\subseteq u(B)$ implies $K_{0}\in u(A\multimap B)$ . Consequently, according to item 1 of Definition 33, $K_{0}=\mathit{sub}_{h}(K_{0})\in u(A[h]\multimap B[h])$ . This implies that $u(A[h])\subseteq u(B[h])$ , as desired.

$\hfill\blacktriangleleft$

The main result is soundness of $\mathrm{MILL}1$ and completeness of its fragment $\mathrm{MILL}1(\multimap,\forall)$ w.r.t. hypergraph language models.

Theorem 35.

1.

$\mathrm{MILL}1$ is sound w.r.t. hypergraph language models.
2.

$\mathrm{MILL}1(\multimap,\forall)$ is complete w.r.t. hypergraph language models.

Proof.

Soundness can be checked straightforwardly by showing that the conclusion of each rule with true premises is also true. The only nontrivial cases are the rules $(\exists L)$ and $(\forall R)$ . Assume that $\Gamma,A[z/x]\vdash B$ where $z$ does not occur freely in $\Gamma,B$ is true in a model $\langle T,u\rangle$ . Without loss of generality, let us assume that $\Gamma$ is empty (otherwise, we can move it to the succeedent using Lemma 34). Under this assumption, we are given that $u(A[z/x])\subseteq u(B)$ . By Lemma 34, for each $y\in\mathrm{Var}$ , $u(A[y/x])=u(A[z/x][y/z])\subseteq u(B[y/z])=u(B)$ . This proves that $u(\exists xA)\subseteq u(B)$ . The case $(\forall R)$ is dealt with similarly.

Completeness of $\mathrm{MILL}1(\multimap,\forall)$ is proved by constructing a canonical model $\langle\mathrm{T},\mathrm{u}\rangle$ . Before we do this, let us introduce a new notion used in the construction.

Definition 36.

Assume that a countable sequence of variables $\xi_{1},\xi_{2},\ldots$ is fixed. Given a formula $A$ , let us read it from left to right and replace the $i$ -th occurrence of a free variable from the left by $\xi_{i}$ . We denote the resulting formula by $A^{\circ}$ .

For example, $(\exists x(A(x,y,y)\otimes\forall zB(z,y,t)))^{\circ}=\exists x(A(x,\xi_{1},% \xi_{2})\otimes\forall zB(z,\xi_{3},\xi_{4}))$ . We shall use formulas of the form $A^{\circ}$ as labels of hyperedges in the canonical model with $\mathrm{type}(A^{\circ})=\mathrm{FVar}(A^{\circ})$ . So, $\mathrm{T}:=\{A^{\circ}\mid A\in\mathrm{Fm}(\mathrm{MILL}1(\multimap,\forall))\}$ . The idea is that, in the canonical model, variables are represented by nodes, while hyperedge labels do not carry information about variables ( $\xi_{1},\xi_{2},\ldots$ are, informally, placeholders for variables).

Given a sequent $\Gamma\vdash A$ where $\Gamma=A_{1},\ldots,A_{n}$ ( $n\geq 0$ ) and given two finite sets of variables $X, Y$ such that $Y\subseteq\mathrm{FVar}(\Gamma)\cup X$ , let us define the hypergraph $H^{X;Y}_{\Gamma;A}=\langle V,E,\mathit{att},\mathit{lab},\mathit{ext}\rangle$ as follows: $V=\mathrm{FVar}(\Gamma)\cup X$ ; $E=\{e_{1},\ldots,e_{n}\}$ ; $\mathit{att}(e_{i})(x)=h_{i}(x)$ for $x\in\mathrm{FVar}(A_{i}^{\circ})$ where $h_{i}:\Sigma\to\Sigma$ is a function such that $A_{i}=A_{i}^{\circ}[h_{i}]$ ; $\mathit{lab}(e_{i})=A^{\circ}_{i}$ ; $\mathit{ext}(x)=x$ for $x\in(V\cap\mathrm{FVar}(A))\cup Y$ (and undefined otherwise).

Let $\mathrm{u}(A):=\{H^{X;Y}_{\Gamma;A}\mid\Gamma\vdash A\leavevmode\nobreak\ % \text{is derivable in $\mathrm{MILL}1(\multimap,\forall)$}\}$ . The proof that $\langle\mathrm{T},\mathrm{u}\rangle$ is a hypergraph language model is quite technical and it can be found in [30, Appendix A.4]. Now, assume that $A_{1},\ldots,A_{n}\vdash B$ is true in this model. Then, $K_{0}\in\mathrm{u}(A_{1}\multimap(\ldots\multimap(A_{n}\multimap B)))$ , i.e. $K_{0}=H^{X;Y}_{\emptyset;A_{1}\multimap(\ldots\multimap(A_{n}\multimap B))}$ , and $\vdash A_{1}\multimap(\ldots\multimap(A_{n}\multimap B))$ is derivable in $\mathrm{MILL}1(\multimap,\forall)$ . By invertibility of $(\multimap R)$ , the sequent $A_{1},\ldots,A_{n}\vdash B$ is derivable. $\hfill\blacktriangleleft$

6 Conclusion

As we have shown, first-order intuitionistic linear logic does have strong connections to the hypergraph grammar theory, namely, to hypergraph transformation systems and to HR-algrebras. The notion of hypergraph first-order categorial grammars naturally and simply extends the concept of Lambek categorial grammars to hypergraphs. Developing hypergraph $\mathrm{MILL}1$ grammars and relating them to hypergraph transformation systems gave us useful insights into expressive power of string $\mathrm{MILL}1$ grammars. In turn, the notion of a hypergraph language model revealed a previously unknown connection of first-order intuitionistic linear logic to the apparatus of HR-algebras, the latter having been studied mainly in the context of monadic second-order definability.

Two questions remain open for the future work. The first one is whether the converse to Theorem 25 holds; more generally, it is desirable to characterise precisely hypergraph $\mathrm{MILL}1$ grammars in terms of hypergraph transformation systems. The second one is whether $\mathrm{MILL}1$ is complete w.r.t. hypergraph language models.

References

[1] Jean-Marc Andreoli. Logic programming with focusing proofs in linear logic. Journal of Logic and Computation, 2(3):297–347, 1992. doi:10.1093/logcom/2.3.297.
[2] Ronald V. Book. Time-bounded grammars and their languages. Journal of Computer and System Sciences, 5(4):397–429, 1971. doi:10.1016/S0022-0000(71)80025-9.
[3] Ronald V. Book. On the complexity of formal grammars. Acta Informatica, 9:171–181, 1978. doi:10.1007/BF00289076.
[4] Wojciech Buszkowski. Compatibility of a categorial grammar with an associated category system. Mathematical Logic Quarterly, 28(14-18):229–238, 1982. doi:10.1002/malq.19820281407.
[5] Wojciech Buszkowski. Type Logics in Grammar, pages 337–382. Springer Netherlands, Dordrecht, 2003. doi:10.1007/978-94-017-3598-8_12.
[6] Bruno Courcelle. The monadic second-order logic of graphs. I. Recognizable sets of finite graphs. Information and Computation, 85(1):12–75, 1990. doi:10.1016/0890-5401(90)90043-H.
[7] Bruno Courcelle and Joost Engelfriet. Graph Structure and Monadic Second-Order Logic: A Language-Theoretic Approach. Encyclopedia of Mathematics and its Applications. Cambridge University Press, 2012.
[8] Frank Drewes, Hans-Jörg Kreowski, and Annegret Habel. Hyperedge replacement graph grammars. In Grzegorz Rozenberg, editor, Handbook of Graph Grammars and Computing by Graph Transformations, Volume 1: Foundations, pages 95–162. World Scientific, 1997. doi:10.1142/9789812384720_0002.
[9] Joost Engelfriet. Context-free graph grammars. In Grzegorz Rozenberg and Arto Salomaa, editors, Handbook of Formal Languages, Volume 3: Beyond Words, pages 125–213. Springer, 1997. doi:10.1007/978-3-642-59126-6_3.
[10] Jean-Yves Girard. Linear logic: A survey. In Friedrich L. Bauer, Wilfried Brauer, and Helmut Schwichtenberg, editors, Logic and Algebra of Specification, pages 63–112, Berlin, Heidelberg, 1993. Springer Berlin Heidelberg. doi:10.1007/978-3-642-58041-3_3.
[11] Aleksei Gladkii. On complexity of inference in phase-structure grammars. Algebra i Logika. Sem. (in Russian), 3(5-6):29–44, 1964.
[12] Annegret Habel. Hyperedge Replacement: Grammars and Languages, volume 643 of Lecture Notes in Computer Science. Springer, 1992. doi:10.1007/BFB0013875.
[13] Annegret Habel, Jürgen Müller, and Detlef Plump. Double-pushout graph transformation revisited. Mathematical Structures in Computer Science, 11(5):637–688, 2001. doi:10.1017/S0960129501003425.
[14] Laura Kallmeyer. Parsing Beyond Context-Free Grammars. Springer Berlin Heidelberg, 2010. doi:10.1007/978-3-642-14846-0.
[15] Max I. Kanovich, Mitsuhiro Okada, and Kazushige Terui. Intuitionistic phase semantics is almost classical. Mathematical. Structures in Comp. Sci., 16(1):67–86, 2006. doi:10.1017/S0960129505005062.
[16] Yuichi Komori. Predicate logics without the structure rules. Studia Logica, 45(4):393–404, 1986. doi:10.1007/bf00370272.
[17] Barbara König, Dennis Nolte, Julia Padberg, and Arend Rensink. A tutorial on graph transformation. In Reiko Heckel and Gabriele Taentzer, editors, Graph Transformation, Specifications, and Nets - In Memory of Hartmut Ehrig, volume 10800 of Lecture Notes in Computer Science, pages 83–104. Springer, 2018. doi:10.1007/978-3-319-75396-6_5.
[18] Joachim Lambek. The mathematics of sentence structure. The American Mathematical Monthly, 65(3):154–170, 1958. doi:10.1080/00029890.1958.11989160.
[19] Michael Moortgat. Multimodal linguistic inference. Journal of Logic, Language and Information, 5(3–4):349–385, 1996. doi:10.1007/bf00159344.
[20] Richard Moot. Extended Lambek calculi and first-order linear logic. In Claudia Casadio, Bob Coecke, Michael Moortgat, and Philip J. Scott, editors, Categories and Types in Logic, Language, and Physics - Essays Dedicated to Jim Lambek on the Occasion of His 90th Birthday, volume 8222 of Lecture Notes in Computer Science, pages 297–330. Springer, 2014. doi:10.1007/978-3-642-54789-8_17.
[21] Richard Moot. Hybrid type-logical grammars, first-order linear logic and the descriptive inadequacy of lambda grammars, 2014. arXiv:1405.6678.
[22] Richard Moot and Mario Piazza. Linguistic applications of first order intuitionistic linear logic. Journal of Logic, Language and Information, 10(2):211–232, 2001. doi:10.1023/a:1008399708659.
[23] Richard Moot and Christian Retoré. The Logic of Categorial Grammars - A Deductive Account of Natural Language Syntax and Semantics, volume 6850 of Lecture Notes in Computer Science. Springer, 2012. doi:10.1007/978-3-642-31555-8.
[24] Glyn Morrill, Oriol Valentín, and Mario Fadda. The displacement calculus. Journal of Logic, Language and Information, 20(1):1–48, 2010. doi:10.1007/s10849-010-9129-2.
[25] Mati Pentus. Lambek grammars are context free. In Proceedings Eighth Annual IEEE Symposium on Logic in Computer Science, pages 429–433, 1993. doi:10.1109/LICS.1993.287565.
[26] Mati Pentus. Models for the Lambek calculus. Ann. Pure Appl. Log., 75(1-2):179–213, 1995. doi:10.1016/0168-0072(94)00063-9.
[27] Mati Pentus. Lambek calculus is NP-complete. Theoretical Computer Science, 357(1):186–201, 2006. Clifford Lectures and the Mathematical Foundations of Programming Semantics. doi:10.1016/j.tcs.2006.03.018.
[28] Tikhon Pshenitsyn. Hypergraph Lambek grammars. Journal of Logical and Algebraic Methods in Programming, 129:100798, 2022. doi:10.1016/j.jlamp.2022.100798.
[29] Tikhon Pshenitsyn. Commutative Lambek grammars. Journal of Logic, Language and Information, 32(5):887–936, 2023. doi:10.1007/s10849-023-09407-z.
[30] Tikhon Pshenitsyn. First-order intuitionistic linear logic and hypergraph languages, 2025. doi:10.48550/arXiv.2502.05816.
[31] Grzegorz Rozenberg. Handbook of Graph Grammars and Computing by Graph Transformation. World Scientific, 1997. doi:10.1142/3303.
[32] Sylvain Salvati. A note on the complexity of abstract categorial grammars. In Christian Ebert, Gerhard Jäger, and Jens Michaelis, editors, The Mathematics of Language, pages 266–271, Berlin, Heidelberg, 2010. Springer Berlin Heidelberg.
[33] Harold Shellinx. Some syntactical observations on linear logic. Journal of Logic and Computation, 1(4):537–559, September 1991. doi:10.1093/logcom/1.4.537.
[34] Sergey Slavnov. Making first order linear logic a generating grammar. Logical Methods in Computer Science, Volume 19, Issue 4, 2023. doi:10.46298/lmcs-19(4:11)2023.
[35] Kohtaro Tadaki, Tomoyuki Yamakami, and Jack C.H. Lin. Theory of one-tape linear-time Turing machines. Theoretical Computer Science, 411(1):22–43, 2010. doi:10.1016/j.tcs.2009.08.031.
[36] Tadahiro Uesu. A system of graph grammars which generates all recursively enumerable sets of labelled graphs. Tsukuba Journal of Mathematics, 2(none):11–26, 1978. doi:10.21099/tkbjm/1496158502.
[37] Johan van Benthem. Language in Action: Categories, Lambdas and Dynamic Logic. MIT Press, 1995.

[bib.bib1] [1] Jean-Marc Andreoli. Logic programming with focusing proofs in linear logic. Journal of Logic and Computation, 2(3):297–347, 1992. doi:10.1093/logcom/2.3.297.

[bib.bib2] [2] Ronald V. Book. Time-bounded grammars and their languages. Journal of Computer and System Sciences, 5(4):397–429, 1971. doi:10.1016/S0022-0000(71)80025-9.

[bib.bib3] [3] Ronald V. Book. On the complexity of formal grammars. Acta Informatica, 9:171–181, 1978. doi:10.1007/BF00289076.

[bib.bib4] [4] Wojciech Buszkowski. Compatibility of a categorial grammar with an associated category system. Mathematical Logic Quarterly, 28(14-18):229–238, 1982. doi:10.1002/malq.19820281407.

[bib.bib5] [5] Wojciech Buszkowski. Type Logics in Grammar, pages 337–382. Springer Netherlands, Dordrecht, 2003. doi:10.1007/978-94-017-3598-8_12.

[bib.bib6] [6] Bruno Courcelle. The monadic second-order logic of graphs. I. Recognizable sets of finite graphs. Information and Computation, 85(1):12–75, 1990. doi:10.1016/0890-5401(90)90043-H.

[bib.bib7] [7] Bruno Courcelle and Joost Engelfriet. Graph Structure and Monadic Second-Order Logic: A Language-Theoretic Approach. Encyclopedia of Mathematics and its Applications. Cambridge University Press, 2012.

[bib.bib8] [8] Frank Drewes, Hans-Jörg Kreowski, and Annegret Habel. Hyperedge replacement graph grammars. In Grzegorz Rozenberg, editor, Handbook of Graph Grammars and Computing by Graph Transformations, Volume 1: Foundations, pages 95–162. World Scientific, 1997. doi:10.1142/9789812384720_0002.

[bib.bib9] [9] Joost Engelfriet. Context-free graph grammars. In Grzegorz Rozenberg and Arto Salomaa, editors, Handbook of Formal Languages, Volume 3: Beyond Words, pages 125–213. Springer, 1997. doi:10.1007/978-3-642-59126-6_3.

[bib.bib10] [10] Jean-Yves Girard. Linear logic: A survey. In Friedrich L. Bauer, Wilfried Brauer, and Helmut Schwichtenberg, editors, Logic and Algebra of Specification, pages 63–112, Berlin, Heidelberg, 1993. Springer Berlin Heidelberg. doi:10.1007/978-3-642-58041-3_3.

[bib.bib11] [11] Aleksei Gladkii. On complexity of inference in phase-structure grammars. Algebra i Logika. Sem. (in Russian), 3(5-6):29–44, 1964.

[bib.bib12] [12] Annegret Habel. Hyperedge Replacement: Grammars and Languages, volume 643 of Lecture Notes in Computer Science. Springer, 1992. doi:10.1007/BFB0013875.

[bib.bib13] [13] Annegret Habel, Jürgen Müller, and Detlef Plump. Double-pushout graph transformation revisited. Mathematical Structures in Computer Science, 11(5):637–688, 2001. doi:10.1017/S0960129501003425.

[bib.bib14] [14] Laura Kallmeyer. Parsing Beyond Context-Free Grammars. Springer Berlin Heidelberg, 2010. doi:10.1007/978-3-642-14846-0.

[bib.bib15] [15] Max I. Kanovich, Mitsuhiro Okada, and Kazushige Terui. Intuitionistic phase semantics is almost classical. Mathematical. Structures in Comp. Sci., 16(1):67–86, 2006. doi:10.1017/S0960129505005062.

[bib.bib16] [16] Yuichi Komori. Predicate logics without the structure rules. Studia Logica, 45(4):393–404, 1986. doi:10.1007/bf00370272.

[bib.bib17] [17] Barbara König, Dennis Nolte, Julia Padberg, and Arend Rensink. A tutorial on graph transformation. In Reiko Heckel and Gabriele Taentzer, editors, Graph Transformation, Specifications, and Nets - In Memory of Hartmut Ehrig, volume 10800 of Lecture Notes in Computer Science, pages 83–104. Springer, 2018. doi:10.1007/978-3-319-75396-6_5.

[bib.bib18] [18] Joachim Lambek. The mathematics of sentence structure. The American Mathematical Monthly, 65(3):154–170, 1958. doi:10.1080/00029890.1958.11989160.

[bib.bib19] [19] Michael Moortgat. Multimodal linguistic inference. Journal of Logic, Language and Information, 5(3–4):349–385, 1996. doi:10.1007/bf00159344.

[bib.bib20] [20] Richard Moot. Extended Lambek calculi and first-order linear logic. In Claudia Casadio, Bob Coecke, Michael Moortgat, and Philip J. Scott, editors, Categories and Types in Logic, Language, and Physics - Essays Dedicated to Jim Lambek on the Occasion of His 90th Birthday, volume 8222 of Lecture Notes in Computer Science, pages 297–330. Springer, 2014. doi:10.1007/978-3-642-54789-8_17.

[bib.bib21] [21] Richard Moot. Hybrid type-logical grammars, first-order linear logic and the descriptive inadequacy of lambda grammars, 2014. arXiv:1405.6678.

[bib.bib22] [22] Richard Moot and Mario Piazza. Linguistic applications of first order intuitionistic linear logic. Journal of Logic, Language and Information, 10(2):211–232, 2001. doi:10.1023/a:1008399708659.

[bib.bib23] [23] Richard Moot and Christian Retoré. The Logic of Categorial Grammars - A Deductive Account of Natural Language Syntax and Semantics, volume 6850 of Lecture Notes in Computer Science. Springer, 2012. doi:10.1007/978-3-642-31555-8.

[bib.bib24] [24] Glyn Morrill, Oriol Valentín, and Mario Fadda. The displacement calculus. Journal of Logic, Language and Information, 20(1):1–48, 2010. doi:10.1007/s10849-010-9129-2.

[bib.bib25] [25] Mati Pentus. Lambek grammars are context free. In Proceedings Eighth Annual IEEE Symposium on Logic in Computer Science, pages 429–433, 1993. doi:10.1109/LICS.1993.287565.

[bib.bib26] [26] Mati Pentus. Models for the Lambek calculus. Ann. Pure Appl. Log., 75(1-2):179–213, 1995. doi:10.1016/0168-0072(94)00063-9.

[bib.bib27] [27] Mati Pentus. Lambek calculus is NP-complete. Theoretical Computer Science, 357(1):186–201, 2006. Clifford Lectures and the Mathematical Foundations of Programming Semantics. doi:10.1016/j.tcs.2006.03.018.

[bib.bib28] [28] Tikhon Pshenitsyn. Hypergraph Lambek grammars. Journal of Logical and Algebraic Methods in Programming, 129:100798, 2022. doi:10.1016/j.jlamp.2022.100798.

[bib.bib29] [29] Tikhon Pshenitsyn. Commutative Lambek grammars. Journal of Logic, Language and Information, 32(5):887–936, 2023. doi:10.1007/s10849-023-09407-z.

[bib.bib30] [30] Tikhon Pshenitsyn. First-order intuitionistic linear logic and hypergraph languages, 2025. doi:10.48550/arXiv.2502.05816.

[bib.bib31] [31] Grzegorz Rozenberg. Handbook of Graph Grammars and Computing by Graph Transformation. World Scientific, 1997. doi:10.1142/3303.

[bib.bib32] [32] Sylvain Salvati. A note on the complexity of abstract categorial grammars. In Christian Ebert, Gerhard Jäger, and Jens Michaelis, editors, The Mathematics of Language, pages 266–271, Berlin, Heidelberg, 2010. Springer Berlin Heidelberg.

[bib.bib33] [33] Harold Shellinx. Some syntactical observations on linear logic. Journal of Logic and Computation, 1(4):537–559, September 1991. doi:10.1093/logcom/1.4.537.

[bib.bib34] [34] Sergey Slavnov. Making first order linear logic a generating grammar. Logical Methods in Computer Science, Volume 19, Issue 4, 2023. doi:10.46298/lmcs-19(4:11)2023.

[bib.bib35] [35] Kohtaro Tadaki, Tomoyuki Yamakami, and Jack C.H. Lin. Theory of one-tape linear-time Turing machines. Theoretical Computer Science, 411(1):22–43, 2010. doi:10.1016/j.tcs.2009.08.031.

[bib.bib36] [36] Tadahiro Uesu. A system of graph grammars which generates all recursively enumerable sets of labelled graphs. Tsukuba Journal of Mathematics, 2(none):11–26, 1978. doi:10.21099/tkbjm/1496158502.

[bib.bib37] [37] Johan van Benthem. Language in Action: Categories, Lambdas and Dynamic Logic. MIT Press, 1995.