A Canonical Form for Universe Levels in Impredicative Type Theory

Géran, Yoan

doi:10.4230/LIPIcs.CSL.2026.39

A Canonical Form for Universe Levels in Impredicative Type Theory

Yoan Géran

Mines Paris PSL, Centre de Recherche en Informatique, France
Université Paris-Saclay, Laboratoire Méthodes Formelles, ENS Paris-Saclay, France
Lab. ICube UMR 7357 CNRS Université de Strasbourg, France

Abstract

The 0-imax-successor algebra, where $\operatorname{imax}\colon\mathbb{N}\times\mathbb{N}\to\mathbb{N}$ is the function defined by $\operatorname{imax}(n,0)=0$ and $\operatorname{imax}(n,S(m))=\operatorname*{max}(n,S(m))$ , is used to represent universe levels in impredicative type theory, in particular with universe polymorphism which introduces level variables, so it is present in proof systems such as Rocq and Lean. In particular, we need to know when two elements of this algebra are equivalent, and we may also want to decide the inequality. In this article, we introduce a canonical form for the terms of this algebra, and we provide a canonization algorithm. It permits deciding level equivalence by checking the canonical form equality, and also permits easily checking if a level is smaller than another one.

Keywords and phrases:

universe levels, canonical form, impredicativity, imax algebra

Copyright and License:

2012 ACM Subject Classification:

Theory of computation

\rightarrow

Type theory ; Theory of computation

\rightarrow

Equational logic and rewriting

Supplementary Material:

Software (Rock formalisation): https://gitlab.crans.org/geran/level_formalisation
archived at

swh:1:dir:79df183835f26e788d568780b46e800bf0d0cfc6

Acknowledgements:

I want to thank my PhD advisors Olivier Hermant and Gilles Dowek for the helpful discussions and comments.

DOI:

10.4230/LIPIcs.CSL.2026.39

Event:

34th EACSL Annual Conference on Computer Science Logic (CSL 2026)

Editors:

Stefano Guerrini and Barbara König

Series and Publisher:

Leibniz International Proceedings in Informatics, Schloss Dagstuhl – Leibniz-Zentrum für Informatik

1 Introduction

The formalization of mathematical theorems and the verification of software lead to the development of many logical systems. Predicate Logic is a quite general theory but does not allow for instance to quantify over predicates, preventing the expression of some propositions. Then, more expressive logic have been introduced through the years. This paper being motivated by the universe polymorphism in impredicative type theory, the introduction will briefly remind the history of these theories, to understand what they bring.

Pure Type Systems

A lot of type theories are based on extensions of Church’s simply-typed $\lambda$ -calculus which does not permit expressing terms over arbitrary types (preventing for instance to talk about all the groups). To address this, Martin-Löf introduced a dependent type theory with a type of all types [25], and later, to avoid paradoxes such as Girard’s one [12, 20], introduced a distinction between small types and large types (which are types containing types, and are also called universes) [27].

In the same years, Girard and Reynolds independently invented System F, an extension of Church’s simply-typed $\lambda$ -calculus with type polymorphism, and even later, Girard presented System $F_{\omega}$ which adds type operators i.e. the ability to quantify over terms to create types.

The Calculus of Constructions [13] introduced by Coquand in his PhD thesis combined features from both Martin-Löf Type Theory and System $F_{\omega}$ . This system allows quantifying on types or terms to build new types and new terms, and it is the pinnacle of the $\lambda$ -cube of Barendregt [4], which classifies type systems depending on the quantification possibilities.

The Calculus of Constructions is an elegant system with strong properties such as normalization and logical consistency. However, quantification over Type is not possible since it makes the system incoherent. This led Coquand to generalize the system with a predicative hierarchy of universes [12], in the same way as predicative Martin-Löf type theory [26]. This new system, which we will call $\text{CC}^{\infty}$ , contains a countable sequence of universes $\text{Type}_{0}\colon\text{Type}_{1}\colon\cdots$ , where $\text{Type}_{0}$ is the universe of the propositions, the indices being referred to as universes levels.

These logical systems are generalized under the name of Pure Type Systems [5, 6].

Definition 1.

A Pure Type System (PTS) is defined by a set of sorts $\mathcal{S}$ (that corresponds to universes), a set of axioms $\mathcal{A}\subseteq\mathcal{S}^{2}$ and a set of rules $\mathcal{R}\subseteq\mathcal{S}^{3}$ .

$\mathcal{A}$ describes the sorts typing ( $s_{1}$ has the type $s_{2}$ when $(s_{1},s_{2})\in\mathcal{A}$ ), and $\mathcal{R}$ describes the possible quantifications and their typing rules. The terms are the following, where $s\in\mathcal{S}$ and $x$ ranges an infinite set of variables.

t\Coloneqq s\mid x\mid\Pi\,x\colon t\cdot t\mid\lambda\,x\colon t\cdot t\mid t% \,t.

Both $\text{CC}^{\infty}$ and predicative Martin-Löf type theory have a set of sorts indexed over the natural numbers, with for all $i\in\mathbb{N}$ , $\text{Type}_{i}\colon\text{Type}_{i+1}$ . Their difference resides in their set of rules. These rules permit to type products, so we recall the typing rule for products.

Impredicativity

With the aim of building a consistent system, paradoxes such as Girard’s one should be avoided. When Coquand analysed it, he found that a product from Type to Type could not live in Type: it should live in a greater type (hence the distinction between small and large types).

With an infinite hierarchy of universes, this principle remains: a product from $\text{Type}_{i}$ to $\text{Type}_{j}$ should live in a greater universe. Therefore, in the predicative Martin-Löf type theory, the set of rules is $\left\{\text{Type}_{i},\text{Type}_{j},\text{Type}_{\operatorname*{max}\left(i% ,j\right)}\right\}$ . The choice of $\text{CC}^{\infty}$ is different (and does not break the consistency either): a product from $\text{Type}_{i}$ to $\text{Type}_{0}$ lives in $\text{Type}_{0}$ , so it follows the rules $\left\{\text{Type}_{i},\text{Type}_{j},\text{Type}_{\operatorname{imax}\left(i% ,j\right)}\right\}$ where $\operatorname{imax}\colon\mathbb{N}\to\mathbb{N}\to\mathbb{N}$ is defined for all $i,j\in\mathbb{N}$ by $\operatorname{imax}\left(i,0\right)=0$ and $\operatorname{imax}\left(i,j+1\right)=\operatorname*{max}\left(i,j+1\right)$ .

This corresponds to the so-called impredicativity of Prop (hence the name $\operatorname{imax}$ for impredicative max) which notably permits to say that we can quantify over all the propositions and still get a new proposition. Accepting or rejecting impredicativity is a philosophical questioning: some consider that $\Pi\,P\colon\text{Prop}\cdot P\to P$ should be a proposition (which is of course true), while others think that a proposition cannot be created by quantifying over all the propositions.

Universe Polymorphism

A PTS can be enriched with universe polymorphism which allows the user to quantify over universes [28, 23, 14]. For instance, it permits declaring simultaneously the identity for all the types of any universes with $\lambda\,s\colon\mathcal{S}\cdot\lambda\,A\colon s\cdot\lambda\,x\colon A\cdot x$ . This feature adds universe variables to the language of a PTS. In the case of $\text{CC}^{\infty}$ , it is equivalent to extend the syntax of the levels with level variables.

Definition 2 (Levels).

A level is a term of the grammar

\ell\coloneqq 0\mid S(\ell)\mid\operatorname*{max}\left(\ell,\ell\right)\mid% \operatorname{imax}\left(\ell,\ell\right)\mid x

where $x$ is an element of a countable set of variables $\mathbf{X}$ . We denote by $\bm{\mathrm{L}}$ the set of levels, and we say that a level is ground if it does not contain any variable.

Definition 3.

We call $\text{CC}^{\infty}_{\forall}$ the extension of $\text{CC}^{\infty}$ with prenex universe polymorphism.

The universe polymorphic identity of $\text{CC}^{\infty}_{\forall}$ is the term

\operatorname{id}\Coloneqq\lambda\,i\colon\bm{\mathrm{L}}\cdot\lambda\,A\colon% \text{Type}_{i}\cdot\lambda\,x\colon A\cdot x.

We can use it by instantiating the level variable. For instance, $\operatorname{id}\,1\,\text{Prop}$ is the identity of Prop while $\operatorname{id}\,2\,\text{Type}_{1}$ is $\text{Type}_{1}$ ’s one. This instantiation is done throughout substitution functions, which replace a level variable by a level, and valuation functions which replace level variables by integers.

Definition 4 (Valuation).

A function $\sigma\colon\mathbf{X}\to\mathbb{N}$ is called a valuation. For all valuations $\sigma$ , we define inductively the value of a level $\ell$ over $\sigma$ , denoted as $\left\llbracket\ell\right\rrbracket_{\sigma}$ , with

	$\displaystyle\left\llbracket 0\right\rrbracket_{\sigma}=0\qquad\left\llbracket S% (\ell)\right\rrbracket_{\sigma}=S(\left\llbracket\ell\right\rrbracket_{\sigma}% )\qquad\left\llbracket x\right\rrbracket_{\sigma}=\sigma(x)$
	$\displaystyle\left\llbracket\operatorname{max}\left(\ell_{1},\ell_{2}\right)% \right\rrbracket_{\sigma}=\operatorname{max}\left(\vphantom{\mathord{\big(}}% \left\llbracket\ell_{1}\right\rrbracket_{\sigma},\left\llbracket\ell_{2}\right% \rrbracket_{\sigma}\right)\qquad\left\llbracket\operatorname{imax}\left(\ell_{% 1},\ell_{2}\right)\right\rrbracket_{\sigma}=\operatorname{imax}\left(\vphantom% {\mathord{\big(}}\left\llbracket\ell_{1}\right\rrbracket_{\sigma},\left% \llbracket\ell_{2}\right\rrbracket_{\sigma}\right)$

This interpretation through the valuations explains why, even if levels are abstract terms, we defined them with the same symbols $0$ , $s$ , $\operatorname*{max}$ and $\operatorname{imax}$ that are used for the natural numbers. Indeed, the ground levels can clearly be identified as the natural numbers and the levels’ semantic, through the valuations, justifies to use the same symbol and permits to see the valuations as functions that realise levels, turning them into ground ones.

Besides, two levels can also be compared using these valuations. They are equivalent if they give the same ground levels through any valuation.

Definition 5 (Level comparison).

Let $\ell_{1},\ell_{2}\in\bm{\mathrm{L}}$ . We say that $\ell_{1}\leqslant\ell_{2}$ if for all valuations $\sigma$ , $\left\llbracket\ell_{1}\right\rrbracket_{\sigma}\leqslant\left\llbracket\ell_{% 2}\right\rrbracket_{\sigma}$ . In the same way, we say that $\ell_{1}\equiv\ell_{2}$ if for all valuations $\sigma$ , $\left\llbracket\ell_{1}\right\rrbracket_{\sigma}=\left\llbracket\ell_{2}\right% \rrbracket_{\sigma}$ . Hence, $\ell_{1}\equiv\ell_{2}$ if and only if $\ell_{1}\leqslant\ell_{2}$ and $\ell_{2}\leqslant\ell_{1}$ . And we say that $\ell_{1}$ and $\ell_{2}$ are incomparable if neither $\ell_{1}\leqslant\ell_{2}$ nor $\ell_{2}\leqslant\ell_{1}$ .

This equivalence shows that universes such as $\text{Type}_{x}$ and $\text{Type}_{\operatorname*{max}\left(x,x\right)}$ should be identified. However, it is not obvious to check. It is not syntactic, like it was without universe polymorphism, and the $\operatorname{imax}$ function makes it complicated.

The aim of this paper is to address this problem. Since it is a word problem, a direct solution consists in finding a canonical form and an algorithm which computes the canonical form of a given term. The level equivalence is then checked by syntactic comparison of the canonical form. We follow this path. To do this, we study the 0- $\operatorname{imax}$ -successor algebra, and we provide a canonical form for its terms. Besides, this equivalence is decidable by a reduction to Presburger arithmetic, and so we provide an additional motivation to our work.

Motivation

Our main motivation lies in the interoperability between proof systems. Indeed, it became a big challenge in the research on proof-checking, which aims to avoid the redevelopment of the same proof. Instead of developing translators from each system to another one, logical frameworks propose to define theories in a common language, which makes translation easier.

The $\lambda\Pi$ -calculus modulo rewriting ( $\lambda\Pi/\equiv$ ) [10] is a logical framework that extends $\lambda\Pi$ (the simply-typed $\lambda$ -calculus with dependent types) with higher-order rewrite rules [16, 29] that can be used to define functions, but also types; terms are then identified modulo $\beta$ and these rewrite rules. The computational part of the type theories can then be represented using the expressiveness of rewrite systems.

The Calculus of Constructions and its subtheories can be expressed in $\lambda\Pi/\equiv$ [8], and, in [15], Cousineau and Dowek showed how to express some PTS. Therefore, several systems have been encoded in $\lambda\Pi/\equiv$ : HOL-Light [30, 1], Agda [19], Matita [1], but also parts of Rocq [18, 9]. Besides, since there exist multiple implementations of $\lambda\Pi/\equiv$ such as Dedukti [2], Lambdapi [24], or Kontroli [17], these embeddings have been implemented, leading to effective translations [30, 21, 22].

To define $\text{CC}^{\infty}_{\forall}$ in $\lambda\Pi/\equiv$ , we need to define its levels. It can be done with a type nat together with functions max, and imax, and rewrite rules to define them. This permits to express $\text{CC}^{\infty}$ in $\lambda\Pi/\equiv$ , but the equivalence relation that comes with level variables adds some difficulties.

Indeed, for all term $u$ of $\text{CC}^{\infty}_{\forall}$ , let us note $\left|u\right|$ its translation in $\lambda\Pi/\equiv$ , and let us consider a function $f\colon\text{Type}_{i}\to\text{Type}_{j}$ and a term $t\colon\text{Type}_{k}$ where $k\equiv i$ . Since $f\,t$ is well-typed, then $\left|f\,t\right|$ should be well-typed in $\lambda\Pi/\equiv$ . Therefore, $\left|t\right|$ should have the type $\left|\text{Type}_{i}\right|$ , whereas it has the type $\left|\text{Type}_{k}\right|$ . We deduce that $\left|\text{Type}_{i}\right|$ and $\left|\text{Type}_{k}\right|$ should be convertible types, and then that equivalent levels should be convertible terms.

Related Work

The 0- $\operatorname*{max}$ -successor algebra is well-studied, and so, some solutions exist in the predicative case. In [31], Voevodsky represented each level as $\operatorname*{max}\left(n,n_{1}+x_{1},\ldots,n_{k}+x_{k}\right)$ where $n\geqslant\operatorname*{max}\left(n_{1},\ldots,n_{k}\right)$ . Then, if there exists $i\neq j$ such that $x_{j}=x_{i}$ , we simplify the term and keep only $\operatorname*{max}\left(n_{i},n_{j}\right)+x_{i}$ . Therefore, we obtain a minimal representation for the 0- $\operatorname*{max}$ -successor algebra.

In [19], Genestier encoded the universe polymorphism of Agda in $\lambda\Pi/\equiv$ using a similar idea and a representation modulo associativity and commutativity (for the $\operatorname*{max}$ symbol). Blanqui gave another presentation of this algebra in [7], with an encoding without matching modulo associativity and commutativity.

The 0- $\operatorname{imax}$ -successor algebra is less studied. An encoding is proposed in [3], but it does not fully reflect the equalities; for instance, the levels $\operatorname*{max}\left(\operatorname{imax}\left(x,y\right),x\right)$ and $\operatorname*{max}\left(x,y\right)$ are not convertible. Besides, Férey also worked on the encoding of universe polymorphism [18].

Finally, an algorithm to check level inequality, and so level equivalence, is presented in [11], but it does not rely on a canonical form.

Outline and Contributions

We use the same idea presented above in the predicative case: find a subset $E$ of levels such that any level can be represented as $\operatorname*{max}U$ with $U\subset E$ , and such that $\operatorname*{max}U$ is a minimal representation that ensures uniqueness property for any other minimal representation $\operatorname*{max}V$ :

\displaystyle\operatorname*{max}U\equiv\operatorname*{max}V\iff U=V.

In the predicative case, $E=\mathbb{N}\cup\left\{n+x\mathrel{}\middle|\mathrel{}n\in\mathbb{N},x\in% \mathbf{X}\right\}$ ; and the minimal representation consists in having one term $n$ (the maximum of two integers can be simplified) and for all $x\in\mathbf{X}$ at most one term $n+x$ since $\operatorname*{max}\left(n+x,m+x\right)=\operatorname*{max}\left(n,m\right)+x$ . To obtain the canonical representation, we push the successor symbols inside the $\operatorname*{max}$ , and we obtain $\operatorname*{max}U$ with $U\subset E$ . Then, $U$ can be simplified by removing $u$ if there exists $v\in U$ such that $v\neq u$ and $u\leqslant v$ , leading to the minimal representation.

This gives us the intuition that we need. The elements of the subset should be very basic and simple in the sense that it is not equivalent to a maximum of other levels.

Section 2 studies the algebra and introduces $\bm{\mathrm{S_{C}}}$ , a set of basic terms, called canonical sublevels. It turns out that these terms do not have the same expressive power than levels, hence we extend $\bm{\mathrm{L}}$ into $\bm{\mathrm{E}}$ , the extended levels. But, each level can be represented as a maximum of elements of $\bm{\mathrm{S_{C}}}$ ; such a maximum is a representation, they will form the set of representations $\bm{\mathrm{R}}$ .

Then, in Section 3 we introduce $\bm{\mathrm{R_{C}}}$ , the set of minimal representations which are representations whose elements are incomparable, and we show that any representation has a unique minimal representation.

Moreover, we will see that $\bm{\mathrm{R_{C}}}$ is not stable by level substitution, which can generate extended levels. But, in order to have a correct translation of substitution in $\lambda\Pi/\equiv$ , we should be able to identify $\left|u\left\{x\coloneq v\right\}\right|$ with $\left|u\right|\left\{x\coloneq\left|v\right|\right\}$ . That is why we generalize the canonical form to the extended levels in Section 4, showing that all extended levels have a minimal representation: its canonical form. Then , we present the canonization algorithm in Appendix A.

Figure 1 summarises this. Besides, the proofs of the results are given in the full version of the paper, as well as a canonization algorithm. Finally, the existence and uniqueness are also proved in Rocq since we formalised the canonization algorithm for the levels ¹¹1https://gitlab.crans.org/geran/level_formalisation.

	$\displaystyle\bm{\mathrm{L}}:\text{levels}\qquad\bm{\mathrm{E}}:\text{extended% levels}\qquad\bm{\mathrm{S_{C}}}:\text{canonical sublevels}$
	$\displaystyle\bm{\mathrm{R}}:\text{representations (maximum of canonical % sublevels)}$
	$\displaystyle\bm{\mathrm{R_{C}}}:\text{minimal representations (elements are % incomparable)}$

Figure 1: Outline.

2 Level Representation

As said in the introduction, in this section we find a set of sublevels that fit our objective of level representation. Before that, let us introduce a basic notion that improves the readability of the text.

Definition 6.

Let $u$ be a level and $\sigma$ be a valuation. We say that $u$ is active (under the valuation $\sigma$ ) if $\left\llbracket u\right\rrbracket_{\sigma}\neq 0$ . We also say that $\sigma$ activates $u$ .

The valuation $\sigma$ will often be left implicit. For instance, we can say that $\operatorname{imax}\left(u,v\right)$ is $\operatorname*{max}\left(u,v\right)$ if $v$ is active and $0$ otherwise in the sense that for all valuation $\sigma$ , $\left\llbracket\operatorname{imax}\left(u,v\right)\right\rrbracket_{\sigma}$ is $\left\llbracket\operatorname*{max}\left(u,v\right)\right\rrbracket_{\sigma}$ if $v$ is active under $\sigma$ and $0$ otherwise.

2.1 Levels as Maximum

The very first step to simplify the terms is to express any level as a maximum of levels that do not contain any $\operatorname*{max}$ , that is the principle of our idea. The successor can be distributed over $\operatorname*{max}$ since for all $u,v\in\bm{\mathrm{L}}$ , $S(\operatorname*{max}\left(u,v\right))\equiv\operatorname*{max}\left(S(u),S(v)\right)$ , and the next two observations show how to distribute $\operatorname{imax}$ over $\operatorname*{max}$ .

Observation 7.

For all $u,v,w\in\bm{\mathrm{L}}$ ,

\operatorname{imax}\left(u,\operatorname*{max}\left(v,w\right)\right)\equiv% \operatorname*{max}\left(\operatorname{imax}\left(u,v\right),\operatorname{% imax}\left(u,w\right)\right).

Observation 8.

For all $u,v,w\in\bm{\mathrm{L}}$ ,

\operatorname{imax}\left(\operatorname*{max}\left(u,v\right),w\right)\equiv% \operatorname*{max}\left(\operatorname{imax}\left(u,w\right),\operatorname{% imax}\left(v,w\right)\right).

Then, any level can be expressed as a maximum of levels without $\operatorname*{max}$ . Note that for this, we consider that $\operatorname*{max}$ takes a finite set of levels as argument. We obtain the following theorem.

Theorem 9.

For all $t\in\bm{\mathrm{L}}$ , there exists $u_{1},\ldots,u_{n}$ in the grammar

\ell\coloneqq 0\mid S(\ell)\mid\operatorname{imax}\left(\ell,\ell\right)\mid x

such that $t\equiv\operatorname*{max}\left(u_{1},\ldots,u_{n}\right)$ . Moreover, the variables that appear in $u_{1},\ldots,u_{n}$ are exactly the variables that appear in $t$ .

2.2 Simplification of the Levels

We can now focus on levels without maximum. The uniqueness property sought for the representation requires the levels to be very basic, and then we search to simplify them.

The main issue in the grammar of Theorem 9 is $\operatorname{imax}$ because it is asymmetric. We aim to restrict the localisation of the $\operatorname{imax}$ symbol to specific parts of the levels in order to understand and control their influence on the levels semantic.

Firstly, we recall some equivalences that are direct consequences of the semantics of $\operatorname{imax}$ . They permit to deal with $0$ and the successor.

Observation 10.

For all $u,v\in\bm{\mathrm{L}}$ ,

\operatorname{imax}\left(u,0\right)\equiv 0\qquad\operatorname{imax}\left(u,S(% v)\right)\equiv\operatorname*{max}\left(u,S(v)\right)

And we show how to remove $\operatorname{imax}$ symbol in second argument of $\operatorname{imax}$ .

Observation 11.

For all $u,v,w\in\bm{\mathrm{L}}$ ,

\operatorname{imax}\left(u,\operatorname{imax}\left(v,w\right)\right)\equiv% \operatorname*{max}\left(\operatorname{imax}\left(u,w\right),\operatorname{% imax}\left(v,w\right)\right).

Thus, by applying the simplification suggested by observations 10 and 11, we can restrict the second argument of $\operatorname{imax}$ to be a variable. It is more complicated for its first argument. We can simplify the level when it is $0$ .

Observation 12.

For all $v\in\bm{\mathrm{L}}$ , $\operatorname{imax}\left(0,v\right)\equiv v$ .

Moreover, we can distribute $S$ over $\operatorname{imax}$ , but this cannot be done as directly as we distribute the $S$ over $\operatorname*{max}$ , as shown in the next example.

Example 13.

We consider the levels $t_{1}=S(\operatorname{imax}\left(y,x\right))$ and $t_{2}=\operatorname{imax}\left(S(y),S(x)\right)$ . By considering a valuation $\sigma$ such that $\sigma(x)=0$ and $\sigma(y)=1$ , $\left\llbracket t_{1}\right\rrbracket_{\sigma}\neq\left\llbracket t_{2}\right% \rrbracket_{\sigma}$ , therefore $t_{1}\not\equiv t_{2}$ .

Besides, we can observe that $S(\operatorname{imax}\left(u,v\right)$ ) is at least $S(v)$ , but can also be $S(u)$ when $v$ is active. Therefore, the simplification rule has to take into account the two relevant cases depending on the value of $v$ .

Observation 14.

For all $u,v\in\bm{\mathrm{L}}$ ,

S(\operatorname{imax}\left(u,v\right))\equiv\operatorname*{max}\left(S(v),% \operatorname{imax}\left(S(u),v\right)\right).

Finally, observations 11, 10, 12, and 14 lead to this grammar restriction.

Theorem 15.

For all $t\in\bm{\mathrm{L}}$ , there exists $u_{1},\ldots,u_{n}$ in the grammar

\ell\coloneq S^{k+1}(0)\mid S^{k}(x)\mid\operatorname{imax}\left(\ell,x\right)

such that $t\equiv\operatorname*{max}\left(u_{1},\ldots,u_{n}\right)$ . Moreover, the variables that appear in $u_{1},\ldots,u_{n}$ are exactly the variables that appear in $t$ .

$\blacktriangleright$ Remark 16.

For all $t$ in the grammar of Theorem 15, there exists $x_{1},\ldots,x_{n}\in\mathbf{X}$ , and $v=S^{k+1}(0)$ or $v=S^{k}(x)$ such that

t=\operatorname{imax}\left(\operatorname{imax}\left(\operatorname{imax}\left(% \cdots\operatorname{imax}\left(v,x_{1}\right),x_{2})\cdots)\right),x_{n-1}% \right),x_{n}\right).

Such a term $t$ will be denoted by $[v,x_{1},\ldots,x_{n}]$ . Intuitively, this term is the maximum of

$\blacksquare$

$x_{n}$ ,
$\blacksquare$

$x_{n-1}$ if $x_{n}$ is active,
$\blacksquare$

$x_{n-2}$ if $x_{n}$ and $x_{n-1}$ are active,
$\blacksquare$

etc.
$\blacksquare$

$v$ if all the $x_{i}$ are active.

2.3 Introducing New Levels

Here, we continue the simplification process in order to find simple enough terms to reach the uniqueness property. Indeed, the terms of the grammar of Theorem 15 are still not simple enough.

Example 17.

Let us consider $t=\operatorname*{max}\left(\operatorname{imax}\left(x,y\right),x\right)$ . Then, $t\equiv\operatorname*{max}\left(x,y\right)$ .

The problem is the following: if $\operatorname{imax}\left(x,y\right)$ permits to take $x$ into account if $y$ is active, it also takes $y$ into account in all cases. Then, it is redundant with $y$ and lead to the equivalence $\operatorname{imax}\left(x,y\right)\equiv\operatorname*{max}\left(y,% \operatorname{imax}\left(x,y\right)\right)$ . We would like to obtain $\operatorname{imax}\left(x,y\right)=\operatorname*{max}\left(y,t\right)$ with some level $t$ , but $\operatorname{imax}\left(x,y\right)$ cannot be simplified more.

In fact, the second argument of $\operatorname{imax}$ has too many responsibilities since it should be taken into account, but it is also a condition to take into account the first argument.

The second concern leads us to devise the introduction of a term “ $\mathop{\text{if}}y\mathop{\text{then}}x$ ” such that $\left\llbracket\mathop{\text{if}}y\mathop{\text{then}}x\right\rrbracket_{\sigma}$ is $0$ if $\left\llbracket y\right\rrbracket_{\sigma}=0$ and $\left\llbracket x\right\rrbracket_{\sigma}$ otherwise. This permits us to simplify $\operatorname{imax}\left(x,y\right)$ into $\operatorname*{max}\left(y,\mathop{\text{if}}y\mathop{\text{then}}x\right)$ , and since $\mathop{\text{if}}y\mathop{\text{then}}x\leqslant x$ , $\operatorname*{max}\left(y,\mathop{\text{if}}y\mathop{\text{then}}x,x\right)$ can be turned into $\operatorname*{max}\left(y,x\right)$ .

Since, the $\operatorname{imax}$ functions are nested in the grammar of Theorem 15, we may need to have multiple variables as conditions; we generalize this idea of new terms and extend the level’s grammar with two symbols $\operatorname{\mathcal{V}}$ and $\operatorname{\mathcal{C}}$ .

Definition 18 (Extended levels).

An extended level is a term of the grammar

\displaystyle\ell\coloneqq 0

\displaystyle\mid S(\ell)\mid\operatorname*{max}\left(\ell,\ell\right)\mid% \operatorname{imax}\left(\ell,\ell\right)\mid x\mid\operatorname{\mathcal{V}}% \left(\left\{\ell,\ldots,\ell\right\},\ell,k\right)\mid\operatorname{\mathcal{% C}}\left(\left\{\ell,\ldots,\ell\right\},k\right)

where $k\in\mathbb{N}$ . We extend $\left\llbracket\cdot\right\rrbracket_{\sigma}$ and the level comparison to the extended levels with

	$\displaystyle\left\llbracket\operatorname{\mathcal{V}}\left(E,u,k\right)\right% \rrbracket_{\sigma}$	$\displaystyle=\begin{cases}0&\text{if $\exists v\in E,\left\llbracket v\right% \rrbracket_{\sigma}=0$}\\ \left\llbracket u\right\rrbracket_{\sigma}+k&\text{otherwise}.\end{cases}$
	$\displaystyle\left\llbracket\operatorname{\mathcal{C}}(E,k)\right\rrbracket_{\sigma}$	$\displaystyle=\begin{cases}0&\text{if $\exists u\in E,\left\llbracket u\right% \rrbracket_{\sigma}=0$}\\ k&\text{otherwise}.\end{cases}$

We denote by $\bm{\mathrm{E}}$ the set of extended levels. Levels of the form $\operatorname{\mathcal{V}}\left(E,u,k\right)$ or $\operatorname{\mathcal{C}}\left(E,k\right)$ are called sublevels.

The symbols $\operatorname{\mathcal{V}}$ and $\operatorname{\mathcal{C}}$ stand for “variable sublevel” and “constant sublevel” in the sense that their semantic consists in taking into account a non-constant or a constant extended level $u$ when a set of extended levels $E$ only contain active ones.

Definition 19.

We denote by $\bm{\mathrm{S}}$ the set of sublevels. Let $u\in\bm{\mathrm{S}}$ , $u=\operatorname{\mathcal{V}}\left(E,v,k\right)$ or $u=\operatorname{\mathcal{C}}\left(E,k\right)$ . We call $E$ the verification conditions of $u$ denoted by $\operatorname{VC}\left(u\right)$ , and $k$ is its constant part denoted by $\operatorname{\omega}\left(u\right)$ . We also define the variable part of $u$ denoted by $\operatorname{\nu}\left(u\right)$ which is $0$ in the case of a constant sublevel and $v$ in the case of a variable sublevel.

Besides, we say that a verification condition $w\in E$ is checked (by a valuation $\sigma$ ) if $\left\llbracket w\right\rrbracket_{\sigma}\neq 0$ and we say that the verification condition $E$ are checked if for all $w\in E$ , $\left\llbracket w\right\rrbracket_{\sigma}\neq 0$ .

The verification conditions and the parts of a sublevel determine its value and if it is active. Indeed, when $u$ is active, its value is $\left\llbracket\operatorname{\omega}\left(u\right)\right\rrbracket_{\sigma}+% \left\llbracket\operatorname{\nu}\left(u\right)\right\rrbracket_{\sigma}$ . And for $u$ to be active, the verification condition of $u$ should be checked (otherwise the value of $u$ is automatically $0$ ), and on top of that $\left\llbracket\operatorname{\omega}\left(u\right)\right\rrbracket_{\sigma}+% \left\llbracket\operatorname{\nu}\left(u\right)\right\rrbracket_{\sigma}$ (which is then the value of $u$ ) should not be $0$ .

Proposition 20.

For all $u\in\bm{\mathrm{S}}$ and for all valuations $\sigma$ , $\left\llbracket u\right\rrbracket_{\sigma}=\operatorname{\omega}\left(u\right)% +\left\llbracket\operatorname{\nu}\left(u\right)\right\rrbracket_{\sigma}$ if $u$ is active and $0$ otherwise. Moreover, $u$ is active if and only if its verifications conditions are checked and $\operatorname{\omega}\left(u\right)+\left\llbracket\operatorname{\nu}\left(u% \right)\right\rrbracket_{\sigma}\neq 0$ .

Keeping with our idea, we want to show that any level is equivalent to a maximum of sublevels, so we do it for the grammar presented in Theorem 15. Here is the intuition behind the replacement of a nested $\operatorname{imax}$ by a maximum of sublevels. In $[S^{k}(y),x_{1},\ldots,x_{n}]$ ,

$\blacksquare$

$x_{n}$ is always considered, so we take $\operatorname{\mathcal{V}}\left(\left\{\right\},x_{n},0\right)$ ,
$\blacksquare$

$x_{n-1}$ is considered if $x_{n}$ is active, so we take $\operatorname{\mathcal{V}}\left(\left\{x_{n}\right\},x_{n-1},0\right)$ ,
$\blacksquare$

$x_{n-2}$ is considered if $x_{n}$ and $x_{n-1}$ are active, so we take $\operatorname{\mathcal{V}}\left(\left\{x_{n},x_{n-1}\right\},x_{n-2},0\right)$ ,
$\blacksquare$

…
$\blacksquare$

$S^{k}(y)$ is considered if all the $x_{i}$ are active, so we take $\operatorname{\mathcal{V}}\left(\left\{x_{n},\ldots,x_{1}\right\},y,k\right)$ .

The situation is similar in the case $[S^{k}(0),x_{1},\ldots,x_{n}]$ , except that the last taken term is $\operatorname{\mathcal{C}}\left(\left\{x_{n},\ldots,x_{1}\right\},k\right)$ .

Proposition 21.

Let $n\in\mathbb{N}$ , $E=\left\{x_{1},\ldots,x_{n}\right\}\subset\mathbf{X}$ , $k\in\mathbb{N}$ , $x_{0}\in\mathbf{X}$ , and for all $i\in\left\{0,\ldots,n\right\}$ , $u_{i}=\operatorname{\mathcal{V}}\left(\left\{x_{i+1},\ldots,x_{n}\right\},x_{i% },0\right)$ . Then,

	$\displaystyle[S^{k}(x_{0}),x_{1},\ldots,x_{n}]\equiv\operatorname*{max}\left(% \operatorname{\mathcal{V}}\left(E,x_{0},k\right),u_{1},\ldots,u_{n}\right),$
	$\displaystyle[S^{k+1}(0),x_{1},x_{n}]\equiv\operatorname*{max}\left(% \operatorname{\mathcal{C}}\left(E,k+1\right),u_{1},\ldots,u_{n}\right).$

One could note that since the grammar of $\bm{\mathrm{E}}$ is really permissive, for all $u\in\bm{\mathrm{E}}$ , we have the trivial equivalence $u\equiv\operatorname{\mathcal{V}}\left(\emptyset,u,0\right)$ to express $u$ as a sublevel. This shows that the sublevels are at least as expressive as the levels, but this equivalence is a nonsense in terms of level simplification. Proposition 21 is a much stronger and useful result since it states that the verification conditions and the variable part of variable sublevels can be restricted to variables to have a complete representation of levels of $\bm{\mathrm{L}}$ . However, we made the choice of presenting extended levels without this restriction to facilitate the level instantiation (developed in Section 4). Indeed, a variable will then be replaced by any level, and we want to make this substitution transparent in our level representation.

2.4 An Appropriate Set of Sublevels

We have restrained our study to the sublevels whose verification conditions and the variable part (in the case of variable sublevels) are variables. Now, we show that some of them can be obtained as a maximum of other ones. The first restriction is related to the representation of $0$ . Indeed, for all $E\subset\mathbf{X}$ , $\operatorname{\mathcal{C}}\left(E,0\right)\equiv 0$ . Since we already have $0\equiv\operatorname*{max}\left(\emptyset\right)$ , we can remove all these sublevels. The second restriction is a little more subtle and is illustrated with this example.

Example 22.

With $t_{1}=\operatorname{\mathcal{V}}\left(\emptyset,x,0\right)$ and $t_{2}=\operatorname{\mathcal{V}}\left(\left\{x\right\},x,0\right)$ , we have $t_{1}\equiv t_{2}$ since for all valuation $\sigma$ , $\left\llbracket t_{1}\right\rrbracket_{\sigma}=\sigma(x)=\left\llbracket t_{2}% \right\rrbracket_{\sigma}$ .

The issue here is the fact that the variable part of a variable sublevel does not necessarily appear in its first argument. This is the key of the following equivalence.

Proposition 23.

Let $x\in\mathbf{X}$ , $E\subset\mathbf{X}\setminus\left\{x\right\}$ and $k\in\mathbb{N}$ . Then

\operatorname{\mathcal{V}}\left(E,x,k\right)\equiv\operatorname*{max}\left(% \operatorname{\mathcal{V}}\left(E\cup\left\{x\right\},x,k\right),\operatorname% {\mathcal{C}}\left(E,k\right)\right).

If $k=0$ , the sublevel $\operatorname{\mathcal{C}}\left(E,k\right)$ obtained with Proposition 23 is removed accordingly to the first restriction. We end up with the following set of sublevels which permits to express any level.

Definition 24 (Canonical sublevels).

A canonical sublevel is an element of the set

\bm{\mathrm{S_{C}}}=\left\{\operatorname{\mathcal{V}}\left(E,x,k\right)% \mathrel{}\middle|\mathrel{}E\subset\mathbf{X},x\in E\right\}\cup\left\{% \operatorname{\mathcal{C}}\left(E,k\right)\mathrel{}\middle|\mathrel{}E\subset% \mathbf{X},k>0\right\}.

Theorem 25.

Let $t\in\bm{\mathrm{L}}$ . Then there exists a finite $U\subset\bm{\mathrm{S_{C}}}$ such that $t\equiv\operatorname*{max}U$ . Moreover, the variables that appear in the elements of $U$ are exactly the variables that appear in $t$ .

Definition 26.

Let $U$ be a finite subset of $\bm{\mathrm{S_{C}}}$ . We say that $\operatorname*{max}U$ is a representation, and we denote by $\bm{\mathrm{R}}$ the set of representations.

Besides, for all $t\in\bm{\mathrm{E}}$ , we say that $\operatorname*{max}U$ is a representation of $t$ if $t\equiv\operatorname*{max}U$ , and we say that the elements $u$ of $U$ are the elements of the representation.

The canonical sublevels correspond to the set of sublevels that we were searching for. We could try to merge the two types of sublevels by introducing a special variable $\mathbf{1}$ such that for all valuation $\sigma$ , $\sigma(\mathbf{1})=1$ . We can then see $\operatorname{\mathcal{C}}\left(E,k+1\right)$ as $\operatorname{\mathcal{V}}\left(E\cup\left\{\mathbf{1}\right\},\mathbf{1},k\right)$ . This simplifies some results but makes the presentation less clear, and the distinction should still be done in a lot of cases.

$\blacktriangleright$ Remark 27.

Let $u\in\bm{\mathrm{S_{C}}}$ and let $\sigma$ be a valuation. Then $u$ is active if and only if all its verification conditions are checked.

3 A Canonical Form for levels

The previous section defined $\bm{\mathrm{R}}$ , the set of representations, and showed that any level is equivalent to one of its elements. The goal of this one is to show that any level has a minimal representation and that it is unique. This will be the canonical form.

Definition 28 (Minimal representation).

Let $\operatorname*{max}U\in\bm{\mathrm{R}}$ . We say that $\operatorname*{max}U$ is minimal if and only if for all $u,v\in U$ such that $u\neq v$ , $u$ and $v$ are incomparable. We denote by $\bm{\mathrm{R_{C}}}$ the set of the minimal representations.

By Theorem 25, any level has a representation. Since the set of representations is well-founded with the inclusion order, any level has a minimal representation. The challenging part is the uniqueness of minimal representations. To show it, we study the core of the definition of a minimal representation: the sublevel comparison.

3.1 Sublevel Comparison

The sublevels can be easily compared which is quite normal since we chose them to be very basic. To have $u\leqslant v$ , $v$ should be active whenever $u$ is active hence $\operatorname{VC}\left(v\right)\subset\operatorname{VC}\left(u\right)$ . And when they are both active, the value of $v$ should be greater than the one of $u$ .

With two variable sublevels it means that their variable part is the same or else we can set a very large value to $\operatorname{\nu}\left(u\right)$ in order to falsify the inequality. This also explains that we cannot have $\operatorname{\mathcal{V}}\left(E,x,l\right)\leqslant\operatorname{\mathcal{C}% }\left(F,k\right)$ . And if $u$ is a constant sublevel and $v$ a variable sublevel, we should remember that when they are active, $\operatorname{\nu}\left(v\right)$ is active (because it is included in $\operatorname{VC}\left(v\right)$ ) and then $\operatorname{\omega}\left(u\right)\leqslant\operatorname{\omega}\left(v\right% )+1$ suffices.

Theorem 29 (Sublevels comparison).

The following equivalences permit to compare elements of $\bm{\mathrm{S_{C}}}$ .

	$\displaystyle\operatorname{\mathcal{V}}\left(E,x,l\right)\not\leqslant% \operatorname{\mathcal{C}}\left(F,k\right)$		(1)
	$\displaystyle\operatorname{\mathcal{C}}\left(E,l\right)\leqslant\operatorname{% \mathcal{C}}\left(F,k\right)\iff F\subset E\land l\leqslant k$		(2)
	$\displaystyle\operatorname{\mathcal{C}}\left(E,l\right)\leqslant\operatorname{% \mathcal{V}}\left(F,x,k\right)\iff(F\subset E\land l\leqslant k+1)$		(3)
	$\displaystyle\operatorname{\mathcal{V}}\left(E,x,l\right)\leqslant% \operatorname{\mathcal{V}}\left(F,y,k\right)\iff F\subset E\land x=y\land l\leqslant k$		(4)

As a corollary, we get that the sublevel equivalence is a syntactic equality, which is expected to ease uniqueness.

Corollary 30.

Let $t_{1},t_{2}\in\bm{\mathrm{S_{C}}}$ . Then $t_{1}\equiv t_{2}\iff t_{1}=t_{2}$ .

Figure 2 illustrates the comparison of the canonical sublevels on the set of variables $\left\{x,y\right\}$ and with $0$ or $1$ as constant part. We see that we get a greater sublevel by increasing the constant part, reducing the set of verification conditions, or moving to a variable sublevel (in that case, the constant part can be reduced by one).

Figure 2: Sublevel comparison on

\left\{x,y\right\}

.

3.2 The Uniqueness Property

Now, we can show the uniqueness property. We have to show that two equivalent minimal representations $\operatorname*{max}U$ and $\operatorname*{max}V$ have the same sublevels. Here, we show a slightly different proof. It is more elegant, and it emphasises the property that ensures the uniqueness.

We will show that $u\leqslant\operatorname*{max}V$ implies the existence of $v\in V$ such that $u\leqslant v$ . This statement lead to the uniqueness property. Indeed, if $\operatorname*{max}U$ and $\operatorname*{max}V$ are two equivalent minimal representation, then for all $u\in U$ there exists a $v\in V$ such that $u\leqslant v$ , and similarly there exists $u^{\prime}\in U$ such that $v\leqslant u^{\prime}$ . Minimality of $\operatorname*{max}U$ permits to conclude that $u=u^{\prime}$ , and then $u\equiv v$ and finally $u=v$ by comparison of canonical sublevels.

To prove this statement, the idea is to find, for any sublevel $u$ , a valuation $\sigma$ such that the only way to have $\left\llbracket u\right\rrbracket_{\sigma}\leqslant\left\llbracket V\right% \rrbracket_{\sigma}$ is to have $v\in V$ with $u\leqslant v$ . Such a valuation should only activate the verification condition of $u$ . Moreover, $\left\llbracket u\right\rrbracket_{\sigma}$ should be large enough to overtake the other sublevels of the representation. When $u$ is a variable sublevel, it means to set its variable part to a large number.

Definition 31.

Let $u\in\bm{\mathrm{S_{C}}}$ be a constant sublevel and $V\subset\bm{\mathrm{S_{C}}}$ be a set of canonical sublevel. The $V$ -minimal over-valuation of $u$ is the valuation defined by

\sigma(x)=\begin{cases*}0&if $x\notin\operatorname{VC}\left(u\right)$\\ 1&if $x\in\operatorname{VC}\left(u\right)$.\end{cases*}

Definition 32.

Let $u\in\bm{\mathrm{S_{C}}}$ be a variable sublevel and $V\subset\bm{\mathrm{S_{C}}}$ be a set of canonical sublevel. The $V$ -minimal over-valuation of $u$ is the valuation defined by

\sigma(x)=\begin{cases*}0&if $x\notin\operatorname{VC}\left(u\right)$\\ 1&if $x\in\operatorname{VC}\left(u\right)\setminus\operatorname{\nu}\left(u% \right)$\\ 2+\operatorname*{max}\left\{\operatorname{\omega}\left(v\right)\mathrel{}% \middle|\mathrel{}v\in V\right\}&otherwise.\end{cases*}

Proposition 33.

Let $u\in\bm{\mathrm{S_{C}}}$ be a variable sublevel, $V\subset\bm{\mathrm{S_{C}}}$ be a set of canonical sublevel and $\sigma$ be the $V$ -minimal over-valuation of $u$ . We have

\left\llbracket u\right\rrbracket_{\sigma}\leqslant\left\llbracket% \operatorname*{max}V\right\rrbracket_{\sigma}\iff\exists v\in V,u\leqslant v.

This proposition is fundamental for the following theorem.

Theorem 34.

For all $u\in\bm{\mathrm{S_{C}}}$ and $\operatorname*{max}V\in\bm{\mathrm{R}}$ , $u\leqslant\operatorname*{max}V$ if and only if there exists $v\in V$ such that $u\leqslant v$ .

And we obtain that equivalence of minimal representations is set equality.

Proposition 35.

For all $\operatorname*{max}U,\operatorname*{max}V\in\bm{\mathrm{R_{C}}}$ , $\operatorname*{max}U\equiv\operatorname*{max}V\iff U=V$ .

Finally, we obtain the main theorem: the existence and uniqueness of a minimal representation for each level, that is to say a canonical form. First, we show the intuitive property that the minimal representation of a maximum of sublevels is formed with some of them.

Proposition 36.

For all $\operatorname*{max}U\in\bm{\mathrm{R}}$ , there exists a unique $\operatorname*{max}V\in\bm{\mathrm{R_{C}}}$ such that $\operatorname*{max}U\equiv\operatorname*{max}V$ . Moreover, $V$ is a subset of $U$ .

Theorem 37 (Minimal Representation).

For all $t\in\bm{\mathrm{L}}$ , there exists a unique $\operatorname*{max}U\in\bm{\mathrm{R_{C}}}$ such that $t\equiv\operatorname*{max}U$ . We say that $\operatorname*{max}U$ is the canonical form of $t$ .

This theorem states the existence of a canonical form function $c$ for $\bm{\mathrm{L}}$ , $c$ being the function that associates any level to its minimal representation.

$\blacktriangleright$ Remark 38.

The key point of this result is Theorem 34. It should be understood as an independence property. Indeed, if we consider $\operatorname*{max}\left(u_{1},\ldots,u_{n}\right)$ as a linear combination of $u_{1},\ldots,u_{n}$ , then this theorem states that the only way to be smaller than a linear combination is to depend on and be smaller than one of the elements of this combination.

This analogy provides a new point of view on our work: $\bm{\mathrm{S_{C}}}$ is a “linearly independent” family (uniqueness of the minimal representation) which generates all the levels through “linear combinations”.

Moreover, Theorem 34 provides a method to compare two levels, by comparing each sublevel of the minimal representation of the first one to the second one. More generally, two representations are compared in the following way.

Theorem 39.

For all $\operatorname*{max}U,\operatorname*{max}V\in\bm{\mathrm{R}}$ , $\operatorname*{max}U\leqslant\operatorname*{max}V$ if and only if for all $u\in U$ , there exists $v\in V$ such that $u\leqslant v$ .

4 A Canonical Form for Extended Levels

We are now interested in extending the representation theorem to the whole grammar of extended levels in order to have substitution. Indeed, if $u=\operatorname*{max}\left(\operatorname{\mathcal{V}}\left(\left\{x\right\},x,% 0\right)\right)$ (the canonical form of $x$ ), then

\displaystyle v=u\left\{x\coloneq\operatorname*{max}\left(y,1\right)\right\}=% \operatorname*{max}\left(\operatorname{\mathcal{V}}\left(\left\{\operatorname*% {max}\left(y,1\right)\right\},\operatorname*{max}\left(y,1\right),0\right)\right)

is not a representation. Since $\bm{\mathrm{L}}$ is stable by substitution, $u\left\{x\coloneq\operatorname*{max}\left(y,1\right)\right\}$ is a level and then $v$ has a minimal representation (which is $\operatorname*{max}\left(\operatorname{\mathcal{V}}\left(\left\{y\right\},y,0% \right),\operatorname{\mathcal{C}}\left(\emptyset{},1\right)\right)$ ).

Here, we provide a way to find this representation by extending the representation to all extended levels, showing that they have representations.

Theorem 40.

For all $u\in\bm{\mathrm{E}}$ , there exists $\operatorname*{max}V\in\bm{\mathrm{R}}$ such that $u\equiv\operatorname*{max}V$ .

$\blacktriangleright$ Remark 41.

One could think that we do not have to deal with $S$ and $\operatorname{imax}$ because we know that the successor and $\operatorname{imax}$ of representations are representations. But we do not yet have this result ; we only know that it is true for representations of levels (because levels can be represented). Moreover, the development of this section will be helpful to develop a recursive canonization algorithm.

4.1 The Successor

In order to define the successor of a canonical sublevel in terms of representation, we define $\operatorname{inc}\colon\bm{\mathrm{S_{C}}}\to\bm{\mathrm{S_{C}}}$ such that for all $E\subset\mathbf{X},k>0$ , $\operatorname{inc}\left(\operatorname{\mathcal{C}}\left(E,k\right)\right)=% \operatorname{\mathcal{C}}\left(E,k+1\right)$ and for all $E\subset\mathbf{X},x\in E,k>0$ , $\operatorname{inc}\left(\operatorname{\mathcal{V}}\left(E,x,k\right)\right)=% \operatorname{\mathcal{V}}\left(E,x,k+1\right)$ . This function nearly increments a canonical sublevel but does not fulfil that objective when some elements of $E$ are not active (it will give $0$ instead of $1$ ). That’s why we take it in combination with $\operatorname{\mathcal{C}}\left(\emptyset,1\right)$ which is $1$ .

Proposition 42.

For all $u\in\bm{\mathrm{S_{C}}}$ ,

S(u)\equiv\operatorname*{max}\left(\operatorname{inc}\left(u\right),% \operatorname{\mathcal{C}}\left(\emptyset,1\right)\right).

We immediately deduce the following result.

Proposition 43.

Let $\operatorname*{max}U\in\bm{\mathrm{R}}$ . Then,

S(\operatorname*{max}U)\equiv\operatorname*{max}\left\{\operatorname{inc}\left% (u\right)\mathrel{}\middle|\mathrel{}u\in U\right\}\cup\left\{\operatorname{% \mathcal{C}}\left(\emptyset,1\right)\right\}.

4.2 The Impredicative Maximum

Following the equivalences $\operatorname{imax}\left(0,u\right)\equiv u$ and $\operatorname{imax}\left(u,0\right)\equiv 0$ , and observations 7 and 8, we have the following equivalence.

Proposition 44.

Let $\operatorname*{max}U,\operatorname*{max}V\in\bm{\mathrm{R_{C}}}$ . We have

\operatorname{imax}\left(\operatorname*{max}U,\operatorname*{max}V\right)% \equiv\begin{dcases*}\operatorname*{max}\left(\emptyset\right)\equiv 0&if $V=% \emptyset$\\ \operatorname*{max}V&if $U=\emptyset$\\ \operatorname*{max}_{\cramped{\begin{subarray}{c}u\in U\\ v\in V\end{subarray}}}\operatorname{imax}\left(u,v\right)&else.\end{dcases*}

(5)

Then, it is sufficient to show that for all $u,v\in\bm{\mathrm{S_{C}}}$ , $\operatorname{imax}\left(u,v\right)$ has a representation. We will then obtain a representation of $\operatorname{imax}\left(\operatorname*{max}U,\operatorname*{max}V\right)$ by taking the elements of the ones of $\operatorname{imax}\left(u,v\right)$ for all $u\in U$ and $v\in V$ .

Proposition 45.

Let $v\in\bm{\mathrm{S_{C}}}$ , $E\subset\mathbf{X}$ , $x\in E$ and $k\in\mathbb{N}$ . Then,

	$\displaystyle\operatorname{imax}\left(\operatorname{\mathcal{C}}\left(E,k+1% \right),v\right)\equiv\operatorname*{max}\left(\operatorname{\mathcal{C}}\left% (E\cup\operatorname{VC}\left(v\right),k+1\right),v\right)$
	$\displaystyle\operatorname{imax}\left(\operatorname{\mathcal{V}}\left(E,x,k% \right),v\right)\equiv\operatorname*{max}\left(\operatorname{\mathcal{V}}\left% (E\cup\operatorname{VC}\left(v\right),x,k\right),v\right)$

4.3 The Sublevels

We now assume that the verification conditions of a sublevel are representations and that its variable part, in the case of a variable sublevel is a representation as well. First, we show how to transform a sublevel $t$ into a maximum of sublevels whose verifications conditions are variable.

For the sublevel to be active, all its verification conditions should be checked. Since these verification conditions are representations, it means that each of them have an active sublevel. Then, we have to consider all the combinations obtained by taking one sublevel for each verification condition of $t$ . Moreover, a sublevel is active when its verifications conditions are all checked. Then, we have to consider all the combinations of verification conditions of the verification conditions of $t$ .

Definition 46.

For all $t,u_{1},\ldots,u_{n}\in\bm{\mathrm{S_{C}}}$ , we define

P(t,u_{1},\ldots,u_{n})=\begin{dcases*}\operatorname{\mathcal{C}}\left(% \textstyle\bigcup_{0\leqslant i\leqslant n}\operatorname{VC}\left(u_{i}\right)% ,\operatorname{\omega}\left(t\right)\right)&if $t$ is a constant sublevel\\ \operatorname{\mathcal{V}}\left(\textstyle\bigcup_{0\leqslant i\leqslant n}% \operatorname{VC}\left(u_{i}\right),\operatorname{\nu}\left(t\right),% \operatorname{\omega}\left(t\right)\right)&else.\end{dcases*}

The term $P(t,u_{1},\ldots,u_{n})$ means that if $u_{1},\ldots,u_{n}$ are checked, then we can take into account the value of the sublevel that we want to simplify (which is then $\operatorname{\nu}\left(t\right)+\operatorname{\omega}\left(t\right)$ ). Then, we consider such terms when $u_{i},\ldots,u_{n}$ are a combination of verification conditions taken from each verification conditions of $t$ . This leads to sublevels whose verification conditions are variables.

Proposition 47.

Let $\operatorname*{max}U_{1},\ldots,\operatorname*{max}U_{n}\in\bm{\mathrm{R}}$ and $t$ be a sublevel whose verifications conditions are $\operatorname*{max}U_{1},\ldots,\operatorname*{max}U_{n}$ . Then

t\equiv\operatorname*{max}\left\{P(t,u_{1},\ldots,u_{n})\mathrel{}\middle|% \mathrel{}u_{1}\in U_{1},\ldots,u_{n}\in U_{n}\right\}.

$\blacktriangleright$ Remark 48.

It is important to have $\operatorname*{max}U_{1},\ldots,\operatorname*{max}U_{n}\in\bm{\mathrm{R}}$ and not only maximum of (possibly not canonical) sublevels. Indeed, we use the fact that $u\in\bm{\mathrm{S_{C}}}$ is active if and only if all its verification conditions are checked (Remark 27), which is not always true with non-canonical sublevels. Moreover, note that the result still holds if there exists an $i$ such that $U_{i}=\emptyset$ , since both terms are equivalent to $0$ .

The Constant Sublevels

With Proposition 47, we have shown how to transform a constant sublevel $u$ whose verification conditions are representations into a representation. Indeed, in the constant sublevel case, for all $u_{1},\ldots,u_{n}\in\bm{\mathrm{S_{C}}}$ , $P(u_{1},\ldots,u_{n})\in\bm{\mathrm{S_{C}}}$ if $k>0$ (hence we obtain a representation of $t$ ). Besides, if $k=0$ , a representation of $t$ is $\operatorname*{max}\emptyset$ .

The Variable Sublevels

However, in the variable sublevels, it is not the case; Proposition 47 only permits us to obtain variable sublevels where the verification conditions are variables. Besides, the variable part of $P(u_{1},\ldots,u_{n})$ is not necessarily a variable. That is why we have to take care of the variable part of variable sublevels.

In $t=\operatorname{\mathcal{V}}\left(V,U,k\right)$ with $V\in\bm{\mathrm{E}}$ and $U\in\bm{\mathrm{R}}$ , the value of $t$ when it is activated by $\sigma$ is the maximum of the value of $\left\llbracket u\right\rrbracket_{\sigma}+k$ with $u\in U$ . Therefore, $t$ can be easily split into a maximum.

Proposition 49.

Let $\operatorname*{max}U\in\bm{\mathrm{R}},V\subset\bm{\mathrm{E}}$ , and $k\in\mathbb{N}$ . Then,

\operatorname{\mathcal{V}}\left(V,\operatorname*{max}U,k\right)\equiv% \operatorname*{max}\left\{\operatorname{\mathcal{V}}\left(V,u,k\right)\mathrel% {}\middle|\mathrel{}u\in U\right\}.

So, it results in sublevels as variable part of variable sublevels, and we handle these in the next proposition.

To transform the $\operatorname{\mathcal{V}}\left(V,u,k\right)$ obtained with Proposition 49 into a canonical sublevel, we note that its value can be $0$ , or $k$ , or $\operatorname{\nu}\left(u\right)+\operatorname{\omega}\left(u\right)+k$ . Then, we need a canonical sublevel with $\operatorname{\omega}\left(u\right)+k$ as constant part, and $\operatorname{\nu}\left(u\right)$ as variable part (we will get a constant or a variable sublevel depending on the nature of its variable part $\operatorname{\nu}\left(u\right)$ ). Besides, $u$ has to be active to obtain the value $\operatorname{\nu}\left(u\right)+\operatorname{\omega}\left(u\right)+k$ , then the verification conditions of $u$ have to be verification conditions of the targeted sublevel. However, when $u$ is not active, $\operatorname{\mathcal{V}}\left(V,u,k\right)$ is not necessarily evaluated to $0$ since it can also be evaluated to $k$ when $V$ are checked. Therefore, we add a second sublevel $\operatorname{\mathcal{C}}\left(V,k\right)$ to keep this behaviour.

Proposition 50.

Let $V\subset\bm{\mathrm{E}}$ , $u\in\bm{\mathrm{S}}$ , and $k\in\mathbb{N}$ . We note

f(u)=\begin{dcases*}\operatorname{\mathcal{C}}\left(V\cup\operatorname{VC}% \left(u\right),k+\operatorname{\omega}\left(u\right)\right)&if $u$ is a % constant sublevel\\ \operatorname{\mathcal{V}}\left(V\cup\operatorname{VC}\left(u\right)\cup\left% \{u\right\},\operatorname{\nu}\left(u\right),k+\operatorname{\omega}\left(u% \right)\right)&else\end{dcases*}

Then $\operatorname{\mathcal{V}}\left(V,u,k\right)\equiv\operatorname*{max}\left(% \operatorname{\mathcal{C}}\left(V,k\right),f(u)\right).$

Note that when $u\in\bm{\mathrm{S_{C}}}$ , having $\operatorname{VC}\left(u\right)$ as verification conditions is equivalent to having $u$ as verification condition which simplifies $f(u)$ in the case where $u$ is a variable sublevel.

We can apply these two propositions to the $P(u_{1},\ldots,u_{n})$ from Definition 46. For that, we define the following.

Definition 51.

Let $k\in\mathbb{N}$ . For all $v,u_{1},\ldots,u_{n}\in\bm{\mathrm{S_{C}}}$ , we define (by noting $u_{0}=v$ ),

\displaystyle f(v,u_{1},\ldots,u_{n})=\begin{dcases*}\operatorname{\mathcal{C}% }\left(\textstyle\bigcup_{0\leqslant i\leqslant n}\operatorname{VC}\left(u_{i}% \right),k+\operatorname{\omega}\left(v\right)\right)&if $v$ is a constant % sublevel\\ \operatorname{\mathcal{V}}\left(\textstyle\bigcup_{0\leqslant i\leqslant n}% \operatorname{VC}\left(u_{i}\right),\operatorname{\nu}\left(v\right),k+% \operatorname{\omega}\left(v\right)\right)&else\end{dcases*}

and

Q(v,u_{1},\ldots,u_{n})=\operatorname*{max}\left(\operatorname{\mathcal{C}}% \left(\textstyle\bigcup_{1\leqslant i\leqslant n}\operatorname{VC}\left(u_{i}% \right),k\right),f(v,u_{1},\ldots,u_{n})\right).

Here, $f(v,u_{1},\ldots,u_{n})$ , corresponds to the case where all the verification conditions $u_{1},\ldots,u_{n}$ are checked and $v$ is active and $\operatorname{\mathcal{C}}\left(\textstyle\bigcup_{1\leqslant i\leqslant n}% \operatorname{VC}\left(u_{i}\right),k\right)$ corresponds to the case where $u_{1},\ldots,u_{n}$ are checked but $v$ is not active, hence they form $Q(v,u_{1},\ldots,u_{n})$ . Finally, we get the following proposition.

Proposition 52.

Let $k\in\mathbb{N}$ , $\operatorname*{max}U_{1},\ldots,\operatorname*{max}U_{n}\in\bm{\mathrm{R}}$ , $\operatorname*{max}V\in\bm{\mathrm{R}}$ , and let $t$ be the variable sublevel $\operatorname{\mathcal{V}}\left(\left\{\operatorname*{max}U_{1},\ldots,% \operatorname*{max}U_{n}\right\},\operatorname*{max}V,k\right)$ . Then,

t\equiv\operatorname*{max}\left\{Q(v,u_{1},\ldots,u_{n})\mathrel{}\middle|% \mathrel{}u_{1}\in U_{1},\ldots,u_{n}\in U_{n},v\in V\right\}

$\blacktriangleright$ Remark 53.

As for the constant sublevels case, we should take care to consider the sublevel $f(u_{1},\ldots,u_{n})$ only if its constant part is not $0$ . Otherwise, it is equivalent to $0$ , and it has to be removed from the max if you want a canonical form.

Besides, one could think that we should have the same consideration with the sublevel $g(v,u_{1},\ldots,u_{n})$ (whose constant part is $k+\operatorname{\omega}\left(v\right)$ ) when $v$ is a constant sublevel, but since $v\in\bm{\mathrm{S_{C}}}$ (because it is an element of a representation), then $\operatorname{\omega}\left(v\right)>0$ .

After that, the case of the variable sublevel is solved. We have proved all the induction cases, which terminates the proof of Theorem 40.

General Representation Theorem

By Theorem 40 any level has a representation, and therefore a minimal one by Proposition 36.

Theorem 54.

For all $u\in\bm{\mathrm{E}}$ , there exists a unique $v\in\bm{\mathrm{R_{C}}}$ such that $u\equiv v$ .

With Theorem 54, we have shown that $\bm{\mathrm{R_{C}}}$ is as expressive as $\bm{\mathrm{E}}$ whereas $\bm{\mathrm{R_{C}}}\subsetneq\bm{\mathrm{E}}$ . To finish this section, we compare the expressiveness of the different shape of levels.

If $\bm{\mathrm{L}}$ is semantically, and even syntactically, a subset of $\bm{\mathrm{E}}$ , one could note that some extended levels are not equivalent to any level. In fact, even some canonical sublevels are not equivalent to any level. For instance, for all $x\in\mathbf{X}$ , there is no $u\in\bm{\mathrm{L}}$ such that $u\equiv\operatorname{\mathcal{C}}\left(\left\{x\right\},1\right)$ .

In the same way we show that some levels are not expressible with just one canonical level. For instance, for all $x,y\in\mathbf{X}$ , there is no $u\in\bm{\mathrm{S_{C}}}$ such that $u\equiv\operatorname*{max}\left(x,y\right)$ .

We end up with the inclusion Hasse diagram displayed in Figure 3.

Figure 3: Hierarchy of levels.

5 Conclusion

We studied the 0- $\operatorname{imax}$ -successor and introduced a canonical form for its terms, which gives us an easy procedure decision for the equivalence problem. For that, we extended the grammar with new terms called sublevels, and we expressed any term as a maximum of sublevels, what we have called a representation. Since not all representations are actually terms of the algebra, a next step could be to characterize the representations that are. This could lead to an even better understanding of the 0- $\operatorname{imax}$ -successor algebra.

We only provide a naive canonization algorithm since level expressions are usually quite small. However, searching for a better algorithm and a good data structures for representations could be useful.

Finally, this representation can be expressed in $\lambda\Pi/\equiv$ with rewrite rules, which is our initial motivation, and it is used in a Work In Progress translator from Lean to Dedukti showing that it can indeed be used to express $\text{CC}^{\infty}_{\forall}$ in $\lambda\Pi/\equiv$ . The next step here, is to study how the expression of universe polymorphism, thanks to this level representation, behaves well together with other features such as inductive types or cumulativity.

References

[1] Ali Assaf. A framework for defining computational higher-order logics. Theses, École polytechnique, September 2015. URL: https://pastel.archives-ouvertes.fr/tel-01235303.
[2] Ali Assaf, Guillaume Burel, Raphaël Cauderlier, David Delahaye, Gilles Dowek, Catherine Dubois, Frédéric Gilbert, Pierre Halmagrand, Olivier Hermant, and Ronan Saillard. Dedukti : a Logical Framework based on the $\lambda\Pi$ -Calculus Modulo Theory, 2016.
[3] Ali Assaf, Gilles Dowek, Jean-Pierre Jouannaud, and Jiaxiang Liu. Encoding Proofs in Dedukti: the case of Coq proofs. In Proceedings Hammers for Type Theories, Proc. Higher-Order rewriting Workshop, Coimbra, Portugal, July 2016. Easy Chair. URL: https://inria.hal.science/hal-01330980.
[4] Henk Barendregt. Introduction to generalized type systems. Journal of Functional Programming, 1(2):125–154, 1991. doi:10.1017/S0956796800020025.
[5] Henk Barendregt, S. Abramsky, D. Gabbay, T. Maibaum, and Henk (Hendrik) Barendregt. Lambda Calculi with Types, 2000.
[6] Stefano Berardi. Type dependence and Constructive mathematics. PhD thesis, PhD thesis, Dipartimento di Informatica, Torino, Italy, 1990.
[7] Frédéric Blanqui. Encoding Type Universes Without Using Matching Modulo Associativity and Commutativity. In Amy P. Felty, editor, 7th International Conference on Formal Structures for Computation and Deduction (FSCD 2022), volume 228 of Leibniz International Proceedings in Informatics (LIPIcs), pages 24:1–24:14, Dagstuhl, Germany, 2022. Schloss Dagstuhl – Leibniz-Zentrum für Informatik. doi:10.4230/LIPIcs.FSCD.2022.24.
[8] Frédéric Blanqui, Gilles Dowek, Émilie Grienenberger, Gabriel Hondet, and François Thiré. Some Axioms for Mathematics. In Naoki Kobayashi, editor, 6th International Conference on Formal Structures for Computation and Deduction (FSCD 2021), volume 195 of Leibniz International Proceedings in Informatics (LIPIcs), pages 20:1–20:19, Dagstuhl, Germany, 2021. Schloss Dagstuhl – Leibniz-Zentrum für Informatik. doi:10.4230/LIPIcs.FSCD.2021.20.
[9] Mathieu Boespflug and Guillaume Burel. CoqInE: Translating the Calculus of Inductive Constructions into the $\lambda\Pi$ -calculus Modulo. In "Second International Workshop on Proof Exchange for Theorem Proving, 2012.
[10] Mathieu Boespflug, Quentin Carbonneaux, and Olivier Hermant. The $\lambda\Pi$ -calculus Modulo as a Universal Proof Language. CEUR Workshop Proceedings, 878, June 2012. URL: https://ceur-ws.org/Vol-878/paper2.pdf.
[11] Mario Carneiro. The Type Theory of Lean. Master’s thesis, Carnegie Mellon University, 2019. URL: https://github.com/digama0/lean-type-theory/releases.
[12] Thierry Coquand. An Analysis of Girard’s Paradox. In Proceedings of the First Annual IEEE Symposium on Logic in Computer Science (LICS 1986), pages 227–236. IEEE Computer Society Press, June 1986.
[13] Thierry Coquand and Gérard Huet. The calculus of constructions. Information and Computation, 76(2):95–120, 1988. doi:10.1016/0890-5401(88)90005-3.
[14] Judicaël Courant. Explicit Universes for the Calculus of Constructions. In Victor A. Carreño, César A. Muñoz, and Sofiène Tahar, editors, Theorem Proving in Higher Order Logics, pages 115–130, Berlin, Heidelberg, 2002. Springer Berlin Heidelberg. doi:10.1007/3-540-45685-6_9.
[15] Denis Cousineau and Gilles Dowek. Embedding Pure Type Systems in the Lambda-Pi-Calculus Modulo. In Typed Lambda Calculi and Applications, 8th International Conference, TLCA 2007, Paris, France, June 26-28, 2007, Proceedings, pages 102–117, June 2007. doi:10.1007/978-3-540-73228-0_9.
[16] Nachum Dershowitz and Jean-Pierre Jouannaud. Rewrite Systems. In Jan van Leeuwen, editor, Handbook of Theoretical Computer Science, Volume B: Formal Models and Semantics, pages 243–320. Elsevier and MIT Press, 1990. doi:10.1016/b978-0-444-88074-1.50011-1.
[17] Michael Färber. Safe, Fast, Concurrent Proof Checking for the Lambda-Pi Calculus modulo Rewriting. In Proceedings of the 11th ACM SIGPLAN International Conference on Certified Programs and Proofs, CPP 2022, pages 225–238, New York, NY, USA, 2022. Association for Computing Machinery. doi:10.1145/3497775.3503683.
[18] Gaspard Férey. Higher-Order Confluence and Universe Embedding in the Logical Framework. (Confluence d’ordre supérieur et encodage d’univers dans le Logical Framework). PhD thesis, École normale supérieure Paris-Saclay, France, 2021. URL: https://lmf.cnrs.fr/downloads/Perso/Ferey-thesis.pdf.
[19] Guillaume Genestier. Encoding Agda Programs Using Rewriting. In Zena M. Ariola, editor, 5th International Conference on Formal Structures for Computation and Deduction (FSCD 2020), volume 167 of Leibniz International Proceedings in Informatics (LIPIcs), pages 31:1–31:17, Dagstuhl, Germany, 2020. Schloss Dagstuhl – Leibniz-Zentrum für Informatik. doi:10.4230/LIPIcs.FSCD.2020.31.
[20] Girard, Jean-Yves. Interprétation fonctionnelle et élimination des coupures dans l’arithmétique d’ordre supérieur, 1972.
[21] Yoan Géran. Mathématiques inversées de Coq. Master’s thesis, ENS Paris-Saclay, September 2021. URL: https://inria.hal.science/hal-04319183.
[22] Yoan Géran. STT $\forall$ GeoCoq, 2021. URL: https://github.com/Karnaj/sttfa_geocoq_euclid.
[23] Robert Harper and Robert Pollack. Type Checking with Universes. In 2nd International Joint Conference on Theory and Practice of Software Development, TAPSOFT ’89, pages 107–136, NLD, 1991. Elsevier Science Publishers B. V. doi:10.1016/0304-3975(90)90108-T.
[24] Gabriel Hondet and Frédéric Blanqui. The New Rewriting Engine of Dedukti (System Description). In Zena M. Ariola, editor, 5th International Conference on Formal Structures for Computation and Deduction (FSCD 2020), volume 167 of Leibniz International Proceedings in Informatics (LIPIcs), pages 35:1–35:16, Dagstuhl, Germany, 2020. Schloss Dagstuhl – Leibniz-Zentrum für Informatik. doi:10.4230/LIPIcs.FSCD.2020.35.
[25] Per Martin-Löf. A theory of types, 1971. Preprint, Stockholm University.
[26] Per Martin-Löf. An Intuitionistic Theory of Types: Predicative Part. Studies in logic and the foundations of mathematics, 80:73–118, 1975.
[27] Per Martin-Löf. An intuitionistic theory of types. In Giovanni Sambin and Jan M. Smith, editors, Twenty-five years of constructive type theory (Venice, 1995), volume 36 of Oxford Logic Guides, pages 127–172. Oxford University Press, 1998.
[28] Matthieu Sozeau and Nicolas Tabareau. Universe Polymorphism in Coq. In Gerwin Klein and Ruben Gamboa, editors, Interactive Theorem Proving, pages 499–514, Cham, 2014. Springer International Publishing. doi:10.1007/978-3-319-08970-6_32.
[29] Terese. Term rewriting systems, volume 55 of Cambridge tracts in theoretical computer science. Cambridge University Press, 2003.
[30] François Thiré. Sharing a Library between Proof Assistants: Reaching out to the HOL Family. In Frédéric Blanqui and Giselle Reis, editors, Proceedings of the 13th International Workshop on Logical Frameworks and Meta-Languages: Theory and Practice, LFMTP@FSCD 2018, Oxford, UK, 7th July 2018, volume 274 of EPTCS, pages 57–71, 2018. doi:10.4204/EPTCS.274.5.
[31] Vladimir Voevodsky. A universe polymorphic type system, October 2014. An unfinished unreleased manuscript. URL: https://www.math.ias.edu/Voevodsky/files/files-annotated/Dropbox/Unfinished_papers/Type_systems/UPTS_current/Universe_polymorphic_type_sytem.pdf.

Appendix A Computation Algorithm

We design a recursive algorithm suited to the inductive structure of $\bm{\mathrm{E}}$ . It is presented in Algorithm 1 which already contains the code for the base cases $0$ and $x$ for which the canonical form are respectively $\operatorname*{max}\left(\emptyset\right)$ and $\operatorname*{max}\left(\operatorname{\mathcal{V}}\left(\left\{x\right\},x,0% \right)\right)$ .

Algorithm 1 Canonization algorithm.

We are now interested in the code to compute the canonical form in the other cases. The algorithm follows the induction performed in Section 4 to prove Theorem 54 since it gives us a representation for the different shapes of levels. Then, it is sufficient to minimize this representation to stick to Proposition 36. For that, we write Algorithm 2 which inserts a sublevel in an independent set of canonical sublevels.

Algorithm 2 Insertion algorithm.

The Maximum

To compute the canonical form of $\operatorname*{max}\left(u,v\right)$ , we use Algorithm 2 to insert the sublevels of $v$ into the ones of $u$ .

Algorithm 3 Case of the maximum.

The Successor

Thanks to Proposition 42, for all $\operatorname*{max}U\in\bm{\mathrm{R_{C}}}$ , we know a representation of $S(\operatorname*{max}U)$ . To obtain its canonical form, we could use $\operatorname{insert}$ to add its sublevels to an initially empty set. But, we have a simpler operation.

Proposition 55.

Let $\operatorname*{max}U\in\bm{\mathrm{R_{C}}}$ and $E=\left\{\operatorname{inc}\left(u\right)\mathrel{}\middle|\mathrel{}u\in U\right\}$ .

c(S(\operatorname*{max}U))=\begin{cases*}\operatorname*{max}E&if $\exists u\in U% ,\operatorname{VC}\left(u\right)=\emptyset$\\ \operatorname*{max}E\cup\left\{\operatorname{\mathcal{C}}\left(\emptyset,1% \right)\right\}&else\end{cases*}

We implement this strategy in Algorithm 4.

Algorithm 4 Case of the successor.

The Impredicative Maximum

For all $u,v\in\bm{\mathrm{S_{C}}}$ , Proposition 45 expresses $\operatorname{imax}\left(u,v\right)$ as a maximum of canonical sublevels, and for all $\operatorname*{max}U,\operatorname*{max}V\in\bm{\mathrm{R_{C}}}$ , Proposition 44 expresses $\operatorname{imax}\left(\operatorname*{max}U,\operatorname*{max}V\right)$ as a maximum of $\operatorname{imax}\left(u,v\right)$ with $u\in U$ and $v\in V$ (hence $u,v\in\bm{\mathrm{S_{C}}}$ ). Using these two results, we design Algorithm 5.

Algorithm 5 Case of the impredicative maximum.

The Constant Sublevels

The computation of the canonical form of a constant sublevel relies on Proposition 47. Here, we immediately returns $\operatorname*{max}\left(\emptyset\right)$ if some VC is $0$ , and we do not forget the case $k=0$ which results in $0$ .

Algorithm 6 Case of the constant sublevels.

The Variable Sublevels

The case of the variable sublevel is very similar and relies on Proposition 52.

Algorithm 7 Case of the variable sublevels.

Theorem 56 (Correction).

Let $u\in\bm{\mathrm{E}}$ . Then, $\operatorname{normalize}\left(u\right)$ computes $c(u)$ , the canonical form of $u$ .

[bib.bib1] [1] Ali Assaf. A framework for defining computational higher-order logics. Theses, École polytechnique, September 2015. URL: https://pastel.archives-ouvertes.fr/tel-01235303.

[bib.bib2] [2] Ali Assaf, Guillaume Burel, Raphaël Cauderlier, David Delahaye, Gilles Dowek, Catherine Dubois, Frédéric Gilbert, Pierre Halmagrand, Olivier Hermant, and Ronan Saillard. Dedukti : a Logical Framework based on the $\lambda\Pi$ -Calculus Modulo Theory, 2016.

[bib.bib3] [3] Ali Assaf, Gilles Dowek, Jean-Pierre Jouannaud, and Jiaxiang Liu. Encoding Proofs in Dedukti: the case of Coq proofs. In Proceedings Hammers for Type Theories, Proc. Higher-Order rewriting Workshop, Coimbra, Portugal, July 2016. Easy Chair. URL: https://inria.hal.science/hal-01330980.

[bib.bib4] [4] Henk Barendregt. Introduction to generalized type systems. Journal of Functional Programming, 1(2):125–154, 1991. doi:10.1017/S0956796800020025.

[bib.bib5] [5] Henk Barendregt, S. Abramsky, D. Gabbay, T. Maibaum, and Henk (Hendrik) Barendregt. Lambda Calculi with Types, 2000.

[bib.bib6] [6] Stefano Berardi. Type dependence and Constructive mathematics. PhD thesis, PhD thesis, Dipartimento di Informatica, Torino, Italy, 1990.

[bib.bib7] [7] Frédéric Blanqui. Encoding Type Universes Without Using Matching Modulo Associativity and Commutativity. In Amy P. Felty, editor, 7th International Conference on Formal Structures for Computation and Deduction (FSCD 2022), volume 228 of Leibniz International Proceedings in Informatics (LIPIcs), pages 24:1–24:14, Dagstuhl, Germany, 2022. Schloss Dagstuhl – Leibniz-Zentrum für Informatik. doi:10.4230/LIPIcs.FSCD.2022.24.

[bib.bib8] [8] Frédéric Blanqui, Gilles Dowek, Émilie Grienenberger, Gabriel Hondet, and François Thiré. Some Axioms for Mathematics. In Naoki Kobayashi, editor, 6th International Conference on Formal Structures for Computation and Deduction (FSCD 2021), volume 195 of Leibniz International Proceedings in Informatics (LIPIcs), pages 20:1–20:19, Dagstuhl, Germany, 2021. Schloss Dagstuhl – Leibniz-Zentrum für Informatik. doi:10.4230/LIPIcs.FSCD.2021.20.

[bib.bib9] [9] Mathieu Boespflug and Guillaume Burel. CoqInE: Translating the Calculus of Inductive Constructions into the $\lambda\Pi$ -calculus Modulo. In "Second International Workshop on Proof Exchange for Theorem Proving, 2012.

[bib.bib10] [10] Mathieu Boespflug, Quentin Carbonneaux, and Olivier Hermant. The $\lambda\Pi$ -calculus Modulo as a Universal Proof Language. CEUR Workshop Proceedings, 878, June 2012. URL: https://ceur-ws.org/Vol-878/paper2.pdf.

[bib.bib11] [11] Mario Carneiro. The Type Theory of Lean. Master’s thesis, Carnegie Mellon University, 2019. URL: https://github.com/digama0/lean-type-theory/releases.

[bib.bib12] [12] Thierry Coquand. An Analysis of Girard’s Paradox. In Proceedings of the First Annual IEEE Symposium on Logic in Computer Science (LICS 1986), pages 227–236. IEEE Computer Society Press, June 1986.

[bib.bib13] [13] Thierry Coquand and Gérard Huet. The calculus of constructions. Information and Computation, 76(2):95–120, 1988. doi:10.1016/0890-5401(88)90005-3.

[bib.bib14] [14] Judicaël Courant. Explicit Universes for the Calculus of Constructions. In Victor A. Carreño, César A. Muñoz, and Sofiène Tahar, editors, Theorem Proving in Higher Order Logics, pages 115–130, Berlin, Heidelberg, 2002. Springer Berlin Heidelberg. doi:10.1007/3-540-45685-6_9.

[bib.bib15] [15] Denis Cousineau and Gilles Dowek. Embedding Pure Type Systems in the Lambda-Pi-Calculus Modulo. In Typed Lambda Calculi and Applications, 8th International Conference, TLCA 2007, Paris, France, June 26-28, 2007, Proceedings, pages 102–117, June 2007. doi:10.1007/978-3-540-73228-0_9.

[bib.bib16] [16] Nachum Dershowitz and Jean-Pierre Jouannaud. Rewrite Systems. In Jan van Leeuwen, editor, Handbook of Theoretical Computer Science, Volume B: Formal Models and Semantics, pages 243–320. Elsevier and MIT Press, 1990. doi:10.1016/b978-0-444-88074-1.50011-1.

[bib.bib17] [17] Michael Färber. Safe, Fast, Concurrent Proof Checking for the Lambda-Pi Calculus modulo Rewriting. In Proceedings of the 11th ACM SIGPLAN International Conference on Certified Programs and Proofs, CPP 2022, pages 225–238, New York, NY, USA, 2022. Association for Computing Machinery. doi:10.1145/3497775.3503683.

[bib.bib18] [18] Gaspard Férey. Higher-Order Confluence and Universe Embedding in the Logical Framework. (Confluence d’ordre supérieur et encodage d’univers dans le Logical Framework). PhD thesis, École normale supérieure Paris-Saclay, France, 2021. URL: https://lmf.cnrs.fr/downloads/Perso/Ferey-thesis.pdf.

[bib.bib19] [19] Guillaume Genestier. Encoding Agda Programs Using Rewriting. In Zena M. Ariola, editor, 5th International Conference on Formal Structures for Computation and Deduction (FSCD 2020), volume 167 of Leibniz International Proceedings in Informatics (LIPIcs), pages 31:1–31:17, Dagstuhl, Germany, 2020. Schloss Dagstuhl – Leibniz-Zentrum für Informatik. doi:10.4230/LIPIcs.FSCD.2020.31.

[bib.bib20] [20] Girard, Jean-Yves. Interprétation fonctionnelle et élimination des coupures dans l’arithmétique d’ordre supérieur, 1972.

[bib.bib21] [21] Yoan Géran. Mathématiques inversées de Coq. Master’s thesis, ENS Paris-Saclay, September 2021. URL: https://inria.hal.science/hal-04319183.

[bib.bib22] [22] Yoan Géran. STT $\forall$ GeoCoq, 2021. URL: https://github.com/Karnaj/sttfa_geocoq_euclid.

[bib.bib23] [23] Robert Harper and Robert Pollack. Type Checking with Universes. In 2nd International Joint Conference on Theory and Practice of Software Development, TAPSOFT ’89, pages 107–136, NLD, 1991. Elsevier Science Publishers B. V. doi:10.1016/0304-3975(90)90108-T.

[bib.bib24] [24] Gabriel Hondet and Frédéric Blanqui. The New Rewriting Engine of Dedukti (System Description). In Zena M. Ariola, editor, 5th International Conference on Formal Structures for Computation and Deduction (FSCD 2020), volume 167 of Leibniz International Proceedings in Informatics (LIPIcs), pages 35:1–35:16, Dagstuhl, Germany, 2020. Schloss Dagstuhl – Leibniz-Zentrum für Informatik. doi:10.4230/LIPIcs.FSCD.2020.35.

[bib.bib25] [25] Per Martin-Löf. A theory of types, 1971. Preprint, Stockholm University.

[bib.bib26] [26] Per Martin-Löf. An Intuitionistic Theory of Types: Predicative Part. Studies in logic and the foundations of mathematics, 80:73–118, 1975.

[bib.bib27] [27] Per Martin-Löf. An intuitionistic theory of types. In Giovanni Sambin and Jan M. Smith, editors, Twenty-five years of constructive type theory (Venice, 1995), volume 36 of Oxford Logic Guides, pages 127–172. Oxford University Press, 1998.

[bib.bib28] [28] Matthieu Sozeau and Nicolas Tabareau. Universe Polymorphism in Coq. In Gerwin Klein and Ruben Gamboa, editors, Interactive Theorem Proving, pages 499–514, Cham, 2014. Springer International Publishing. doi:10.1007/978-3-319-08970-6_32.

[bib.bib29] [29] Terese. Term rewriting systems, volume 55 of Cambridge tracts in theoretical computer science. Cambridge University Press, 2003.

[bib.bib30] [30] François Thiré. Sharing a Library between Proof Assistants: Reaching out to the HOL Family. In Frédéric Blanqui and Giselle Reis, editors, Proceedings of the 13th International Workshop on Logical Frameworks and Meta-Languages: Theory and Practice, LFMTP@FSCD 2018, Oxford, UK, 7th July 2018, volume 274 of EPTCS, pages 57–71, 2018. doi:10.4204/EPTCS.274.5.

[bib.bib31] [31] Vladimir Voevodsky. A universe polymorphic type system, October 2014. An unfinished unreleased manuscript. URL: https://www.math.ias.edu/Voevodsky/files/files-annotated/Dropbox/Unfinished_papers/Type_systems/UPTS_current/Universe_polymorphic_type_sytem.pdf.

A Canonical Form for Universe Levels in Impredicative Type Theory

Abstract

Keywords and phrases:

Copyright and License:

2012 ACM Subject Classification:

Related Version:

Supplementary Material:

Acknowledgements:

DOI:

Event:

Editors:

Series and Publisher:

1 Introduction

Pure Type Systems

Definition 1.

Impredicativity

Universe Polymorphism

Definition 2 (Levels).

Definition 3.

Definition 4 (Valuation).

Definition 5 (Level comparison).

Motivation

Related Work

Outline and Contributions

2 Level Representation

Definition 6.

2.1 Levels as Maximum

Observation 7.

Observation 8.

Theorem 9.

2.2 Simplification of the Levels

Observation 10.

Observation 11.

Observation 12.

Example 13.

Observation 14.

Theorem 15.

▶ Remark 16.

2.3 Introducing New Levels

Example 17.

Definition 18 (Extended levels).

Definition 19.

Proposition 20.

Proposition 21.

2.4 An Appropriate Set of Sublevels

Example 22.

Proposition 23.

Definition 24 (Canonical sublevels).

Theorem 25.

Definition 26.

▶ Remark 27.

3 A Canonical Form for levels

Definition 28 (Minimal representation).

3.1 Sublevel Comparison

Theorem 29 (Sublevels comparison).

Corollary 30.

3.2 The Uniqueness Property

Definition 31.

Definition 32.

Proposition 33.

Theorem 34.

Proposition 35.

Proposition 36.

Theorem 37 (Minimal Representation).

▶ Remark 38.

Theorem 39.

4 A Canonical Form for Extended Levels

Theorem 40.

▶ Remark 41.

4.1 The Successor

Proposition 42.

Proposition 43.

4.2 The Impredicative Maximum

Proposition 44.

Proposition 45.

4.3 The Sublevels

Definition 46.

Proposition 47.

▶ Remark 48.

The Constant Sublevels

$\blacktriangleright$ Remark 16.

$\blacktriangleright$ Remark 27.

$\blacktriangleright$ Remark 38.

$\blacktriangleright$ Remark 41.

$\blacktriangleright$ Remark 48.

$\blacktriangleright$ Remark 53.