Abstract 1 Introduction 2 Logical Preliminaries 3 Polytree Sequent Systems 4 Cut-Elimination 5 Concluding Remarks References Appendix A Soundness

Taking Bi-Intuitionistic Logic First-Order:
A Proof-Theoretic Investigation via Polytree Sequents

Tim S. Lyon ORCID Technische Universität Dresden, Germany Ian Shillito ORCID The Australian National University, Canberra, Ngunnawal & Ngambri Country, Australia Alwen Tiu ORCID The Australian National University, Canberra, Ngunnawal & Ngambri Country, Australia
Abstract

It is well-known that extending the Hilbert axiomatic system for first-order intuitionistic logic with an exclusion operator, that is dual to implication, collapses the domains of models into a constant domain. This makes it an interesting problem to find a sound and complete proof system for first-order bi-intuitionistic logic with non-constant domains that is also conservative over first-order intuitionistic logic. We solve this problem by presenting the first sound and complete proof system for first-order bi-intuitionistic logic with increasing domains. We formalize our proof system as a polytree sequent calculus (a notational variant of nested sequents), and prove that it enjoys cut-elimination and is conservative over first-order intuitionistic logic. A key feature of our calculus is an explicit eigenvariable context, which allows us to control precisely the scope of free variables in a polytree structure. Semantically this context can be seen as encoding a notion of Scott’s existence predicate for intuitionistic logic. This turns out to be crucial to avoid the collapse of domains and to prove the completeness of our proof system. The explicit consideration of the variable context in a formula sheds light on a previously overlooked dependency between the residuation principle and the existence predicate in the first-order setting, which may help to explain the difficulty in designing a sound and complete proof system for first-order bi-intuitionistic logic.

Keywords and phrases:
Bi-intuitionistic, Cut-elimination, Conservativity, Domain, First-order, Polytree, Proof theory, Reachability, Sequent
Funding:
Tim S. Lyon: European Research Council, Consolidator Grant DeciGUT (771779)
Copyright and License:
[Uncaptioned image] © Tim S. Lyon, Ian Shillito, and Alwen Tiu; licensed under Creative Commons License CC-BY 4.0
2012 ACM Subject Classification:
Theory of computation Proof theory
; Theory of computation Modal and temporal logics ; Theory of computation Constructive mathematics ; Theory of computation Automated reasoning
Related Version:
Extended Version: https://arxiv.org/abs/2404.15855 [26]
Editors:
Jörg Endrullis and Sylvain Schmitz

1 Introduction

Propositional bi-intuitionistic logic (𝖡𝖨𝖯), also referred to as Heyting-Brouwer logic [33], is a conservative extension of propositional intuitionistic logic (𝖨𝖯), obtained by adding the binary connective [Uncaptioned image] (referred to as exclusion)111 Also referred to as pseudo-difference [33], subtraction, and co-implication [13]. among the traditional intuitionistic connectives. This logic has proven relevant in computer science, having a formulae-as-types interpretation in terms of first-class coroutines [7] and where modal extensions have found import in image processing [38]. While in intuitionistic logic the connectives and form a residuated pair, i.e. (φψ)χ is valid iff φ(ψχ) is valid iff ψ(φχ) is valid, in bi-intuitionistic logic the connectives and [Uncaptioned image] also form a residuated pair, i.e. φ(ψχ) is valid iff (φ[Uncaptioned image]ψ)χ is valid iff (φ[Uncaptioned image]χ)ψ is valid.222However, they are not logically equivalent, e.g., [φ(ψχ)][(φ[Uncaptioned image]ψ)χ] is not valid. To put it succinctly, 𝖡𝖨𝖯 is a bi-intuitionistic extension of 𝖨𝖯 that is (1) conservative and (2) has the residuation property, i.e. (,) and ([Uncaptioned image],) form residuated pairs.

When extending first-order intuitionistic logic (𝖨𝖰) to its bi-intuitionistic counterpart, a “natural” axiomatization seems to be one obtained by adding the universal axioms (Ax1) xφφ(t/x), (Ax2) x(ψφ)(ψxφ) (where x is not free in ψ), and the rule (Gen) φ/xφ to the axioms of 𝖡𝖨𝖯. This extension, which we refer to as the logic 𝖡𝖨𝖰(𝒞𝒟), turns out not to be conservative over first-order intuitionistic logic 𝖨𝖰, as it allows one to prove the quantifier shift axiom x(φψ)xφψ (where x is not free in ψ), which is not valid intuitionistically. A proof of the quantifier shift axiom is given below, where MP stands for modus ponens, Res stands for the residuation property described above, and δ:=x((x(φψ)[Uncaptioned image]ψ)φ)((x(φψ)[Uncaptioned image]ψ)xφ).

It is well-known that the quantifier shift axiom characterizes the class of first-order intuitionistic Kripke models with constant domains [10, 16], thus forcing the models for 𝖡𝖨𝖰(𝒞𝒟) to satisfy this constraint. Indeed, various works in the literature (e.g., [32, 34]) have shown that completeness for 𝖡𝖨𝖰(𝒞𝒟) requires the domain to be constant. These works and the above example strongly suggest that it might not be possible to have a proof system for a bi-intuitionistic logic with non-constant domains, at least not as a traditional Hilbert system. As far as we know, there is no prior successful attempt at solving this problem.

In this paper, we provide the first sound and complete proof system for first-order bi-intuitionistic logic with increasing domains, which we refer to here as 𝖡𝖨𝖰(𝒟). With some minor modifications, the proof system for 𝖡𝖨𝖰(𝒟) can be converted into a proof system for 𝖡𝖨𝖰(𝒞𝒟). A key insight in avoiding the collapse of domains in 𝖡𝖨𝖰(𝒟) is to consider the universal quantifier as implicitly carrying an assumption about the existence of the quantified variable. Proof theoretically, this could be done by introducing a notion of an existence predicate, first studied by Scott [35]. An existence predicate such as E(x) postulates that x exists in the domain under consideration. By insisting that all universally quantified variables be guarded by an existence predicate, i.e. universally quantified formulae would have the form x(E(x)φ(x)), the quantifier shift axiom can be rewritten as: x(E(x)(φψ))(x(E(x)φ)ψ). Attempting a bottom-up construction of a derivation similar to our earlier example for this rewritten axiom, we get stuck at the the top-most residuation rule, which is in fact not a valid instance of Res:

For the proof construction to proceed, we would have to somehow discharge the assumption E(x) in the premise of Gen before applying the residuation rule. In the logic of constant domains 𝖡𝖨𝖰(𝒞𝒟), E(x) is equivalent to (i.e. the interpretation of any term in the logic is an object that exists in all worlds in the underlying Kripke model). So the version of the quantifier shift axiom with the existence predicate is provably equivalent to the original one in 𝖡𝖨𝖰(𝒞𝒟). This is not the case, however, in the logic of increasing domains 𝖡𝖨𝖰(𝒟), since the assumption E(x) cannot always be discharged. What this example highlights is that a typical proof-theoretical argument used to show the provability of the quantifier shift axiom (and hence the collapse of domains) implicitly depends on an existence assumption on objects in the domains in the underlying Kripke model. What we show here is that by making this dependency explicit and by carefully managing the use of the existence assumptions in proofs, we are able to obtain a sound and complete proof system for 𝖡𝖨𝖰(𝒟).

One issue with the existence predicate is that it is not clear how it should interact with the exclusion operator. Semantically, a formula like x[E(x)((p(x)[Uncaptioned image]y(E(y)p(y))))] asserts that, if an object x exists in the current domain, then postulating that p(x) holds in a predecessor world should imply that x exists as well in that predecessor world. This is valid in our semantics, but it was not at all obvious how a proof system that admits this tautology, and does not also degenerate into a logic with constant domains, should be designed. We shall come back to this example later in Section 3. Additionally, the existence predicate poses a problem when proving conservativity over first-order intuitionistic logic that does not feature this predicate. We overcome this remaining hurdle by enriching sequents with an explicit variable context, which can be seen as essentially encoding the existence predicate, while avoiding introducing it explicitly in the language of formulae.

The proof systems for 𝖡𝖨𝖰(𝒟) and 𝖡𝖨𝖰(𝒞𝒟) are both formalized using polytree sequents [5], which are connected binary graphs whose vertices are traditional Gentzen sequents and which are free of (un)directed cycles. Polytree sequents are a restriction of traditional labeled sequents [37, 41] and are notational variants of nested sequents [3, 18, 2]. (NB. For details on the relationship between polytree and nested sequents, see [5].) Nested sequents were introduced independently by Bull [3] and Kashima [18] and employ trees of Gentzen sequents in proofs. Both polytree sequents and nested sequents allow for simple formulations of proof systems for various non-classical logics that enjoy important proof theoretical properties such as cut-elimination and subformula properties. Such systems have also found a range of applications, being used in knowledge integration algorithms [24], serving as a basis for constructive interpolation and decidability techniques [21, 25, 40], and even being used to solve open questions about axiomatizability [17]. We make use of polytree sequents in our work as they admit a formula interpretation (at least in the intuitionistic case), which can be leveraged for direct translations of proofs into sequent calculus or Hilbert calculus proofs.

The calculi for 𝖡𝖨𝖰(𝒟) and 𝖡𝖨𝖰(𝒞𝒟) are based on these richly structured sequents, which internalize the existence predicate into syntactic components, called domain atoms, present in each node of the sequent. The rich structure of these sequents is exploited by special rules within our calculi called reachability rules, which traverse paths in a polytree sequent, propagating and/or consuming data. We demonstrate that our calculi enjoy the height-preserving invertibility of every rule, and show that a wide range of novel and useful structural rules are height-preserving admissible, culminating in a non-trivial proof of cut-elimination.

Outline of Paper.

In Section 2, we define a semantics for first-order bi-intuitionistic logic with increasing domains 𝖡𝖨𝖰(𝒟) and constant domains 𝖡𝖨𝖰(𝒞𝒟). In Section 3, we define our polytree sequent calculi showing them sound and complete relative to the provided semantics. In Section 4, we establish admissibility and invertibility results as well as prove a non-trivial cut-elimination theorem. We conclude and discuss future work in Section 5. Due to space constraints, most proofs have been deferred to the online appended version [26].

2 Logical Preliminaries

In this section, we introduce the language, models, and semantics for first-order bi-intuitionistic logic with increasing domains, dubbed 𝖡𝖨𝖰(𝒟), and with constant domains, dubbed 𝖡𝖨𝖰(𝒞𝒟). Let Var:={x,y,z,} be a countably infinite set of variables and Fun={f,g,h,} be a countably infinite set of function symbols containing countably many function symbols of each arity n. We let ar(f)=n denote that the arity of the function symbol f is n and let a,b,c, denote constants, which are function symbols of arity 0. For a set XVar, we define the set Ter(X) of X-terms to be the smallest set satisfying the following two constraints: (1) XTer(X), and (2) if fFun, f is of arity n, and t1,,tnTer(X), then f(t1,,tn)Ter(X). The complete set of terms Ter is defined to be Ter(Var). We use t, s, (potentially annotated) to denote (X-)terms and let VT(t) denote the set of variables occurring in the term t. We will often write a list t1,,tn of terms as t, and define VT(t)=VT(t1)VT(tn).

We let Pred:={p,q,} be a countably infinite set of predicates containing countably many predicates of each arity n. We denote the arity of a predicate p as ar(p) and refer to predicates of arity 0 as propositional atoms. An atomic formula is a formula of the form p(t1,,tn), obtained by prefixing a predicate p of arity ar(p)=n to a tuple of terms of length n. We will often write atomic formulae p(t1,,tn) as p(t).

Definition 1 (The Language ).

The language is defined to be the set of formulae generated via the following grammar in Backus-Naur form:

φ::=p(t)|||φφ|φφ|φ[Uncaptioned image]φ|φφ|xφ|xφ

where p ranges over Pred, the terms t=t1,,tn range over Ter, and x ranges over the set Var. We use φ, ψ, χ, to denote formulae.

The occurrence of a variable x in φ is defined to be free given that x does not occur within the scope of a quantifier binding x. We let FV(φ) denote the set of all free variables occurring in the formula φ and use φ(x1,,xn) to denote that FV(φ)={x1,,xn}. We let φ(t/x) denote the formula obtained by replacing each free occurrence of the variable x in φ by t, potentially renaming bound variables to avoid unwanted variable capture; e.g. (yp(x,y))(y/x)=zp(y,z). The complexity of a formula φ, written |φ|, is recursively defined as follows: (1) |p(t1,,tn)|=||=||:=0, (2) |Qxφ|:=|φ|+1 for Q{,}, and (3) |φψ|:=|φ|+|ψ|+1 for {,,,[Uncaptioned image]}.

Following [32], we give a Kripke-style semantics for 𝖡𝖨𝖰(𝒟), defining the models used first, and explaining how formulae are evaluated over them second.

Definition 2 (ID-Frame).

An ID-frame (or, frame) is a tuple F=(W,,U,D) such that:

  • W is a non-empty set {w,u,v,} of worlds;

  • W×W is a reflexive and transitive binary relation;

  • U is a non-empty set referred to as the universe;

  • D:W𝒫(U) is a domain function mapping each wW to a non-empty set D(w)U with U=wWD(w), which satisfies the increasing domain condition: (ID) If wu, then D(w)D(u).

Definition 3 (ID-Model).

We define an ID-Model (or, model) M to be an ordered triple (F,I1,I2) such that:

  • F=(W,,U,D) is a frame;

  • I1 is a function interpreting each function symbol fFun such that ar(f)=n by a function I1(f):UnU, satisfying two conditions: (C1) For each wW and constant a, I1(a)D(w), and (C2) For each wW, aD(w)n iff I1(f)(a)D(w).

  • I2 is a function interpreting, in each wW, each predicate pPred such that ar(p)=n by a set I2(w,p)D(w)n, satisfying the following monotonicity condition: (M) If wu, then I2(w,p)I2(u,p).

Definition 4 (M-assignment).

Let M=(F,I1,I2) be a model. We define an M-assignment to be a function α:VarU. We note α[a/x] is the function α modified in x such that α[a/x](x)=a and α[a/x](y)=α(y) if yx. Given an M-assignment α, we define the interpretation of t in M given α, denoted α¯(t), inductively as follows: α¯(x):=α(x) and α¯(f(t1,,tn)):=I1(f)(α¯(t1),,α¯(tn)).

Definition 5 (Semantics).

Let M=(W,,U,D,I1,I2) be a model with wW and α an M-assignment. The satisfaction relation is defined as follows:

  • M,w,αp(t1,,tn) iff (α¯(t1),,α¯(tn))I2(w,p);

  • M,w,α⊮;

  • M,w,α;

  • M,w,αφψ iff M,w,αφ or M,w,αψ;

  • M,w,αφψ iff M,w,αφ and M,w,αψ;

  • M,w,αφ[Uncaptioned image]ψ iff there exists a uW such that uw, M,u,αφ, and M,u,α⊮ψ;

  • M,w,αφψ iff for all uW, if wu and M,u,αφ, then M,u,αψ;

  • M,w,αxφ iff there exists an aD(w) such that M,w,α[a/x]φ;

  • M,w,αxφ iff for all uW and all aD(u), if wu, then M,u,α[a/x]φ.

For a set Γ of formulae, we write Γφ iff for all models M, M-assignments α, and worlds w in M, if M,w,αψ for each ψΓ, then M,w,αφ. A formula φ is valid iff φ. Finally, we define the logic 𝖡𝖨𝖰(𝒟) to be the set {φ|φ} of all valid formulae.

Note that here we define logics as sets of formulae, and not consequence relations. While this is fit for our purpose, the reader should be warned that historical confusions emerged around this distinction in the case of propositional bi-intuitionistic logic [15, 36], notably pertaining to the deduction theorem.

Proposition 6.

Let M=(W,,U,D,I1,I2) be a model with α an M-assignment. For any φ, if M,w,αφ and wu, then M,u,αφ.

 Remark 7.

We define a CD-model to be a model satisfying the constant domain condition: (CD) If w,uW, then D(w)=D(u). If we impose the (CD) condition on models, then first-order bi-intuitionistic logic with constant domains, dubbed 𝖡𝖨𝖰(𝒞𝒟), can be defined as the set of all valid formulae over the class of CD-models. In what follows, we let 𝒟 denote the class of ID-models and 𝒞𝒟 denote the class of CD-models.

Example 8.

Consider the formula x((p(x)[Uncaptioned image]yp(y))), discussed in the introduction, but with the existence predicate removed. In the semantics with increasing domains, this formula is valid. To see this, suppose otherwise, i.e. that there exists a world w where the formula is false. Thus, there is a successor wu such that α¯(x)D(u) and p(x)[Uncaptioned image]yp(y) is true, for some assignment α. The latter implies that for some u such that uu, p(x) is true (i.e. α(x)IP(u,p)), but yp(y) is false. The former implies that α(x)D(u), so by the semantic clause for the quantifier, yp(y) must be true – contradiction.

3 Polytree Sequent Systems

Let Lab={w,u,v,} be a countably infinite set of labels. For a formula φ and label wLab, we define w:φ to be a labeled formula. We use Γ, Δ, Σ, to denote finite multisets of labeled formulae, and let w:Γ denote a multiset of labeled formulae all labeled with w. A relational atom is an expression of the form wRu and a domain atom is an expression of the form w:x, where w,uLab and xVar. Intuitively, the domain atom formalizes an existence predicate: w:x can be interpreted as saying that the interpretation of x exists at world w. We use and 𝒯 (and annotated versions thereof) to denote finite multisets of, respectively, relational atoms and domain atoms. Also, we define w:VT(t)=w:x1,,w:xn with VT(t)={x1,,xn}, define w:VT(t)=w:VT(t1),,w:VT(tn) with t=t1,,tn, and let w:x=w:x1,,w:xn for x=x1,,xn. For multisets X and Y of labeled formulae, relational atoms, and/or domain atoms, we let X,Y denote the multiset union of X and Y, and Lab(X) denote the set of labels occurring in X.

Definition 9 (Polytree Sequent).

We define a polytree sequent to be an expression of the form ,𝒯,ΓΔ such that (1) if , then Lab(𝒯,Γ,Δ)Lab() and if =, then |Lab(𝒯,Γ,Δ)|=1, and (2) forms a polytree, i.e. the graph G=(V,E) such that V=Lab() and E={(w,u)|wRu} is connected and free of both directed and undirected cycles. We refer to ,𝒯,Γ as the antecedent and Δ as the consequent of a polytree sequent. We will often refer to polytree sequents more simply as sequents.

We sometimes use S, S0, S1, to denote sequents, and for S=,𝒯,ΓΔ, we define Lab(S)=Lab(,𝒯,Γ,Δ). A flat sequent is an expression of the form 𝒯,ΓΔ such that |Lab(𝒯,Γ,Δ)|=1, i.e. all labeled formulae and domain atoms share the same label. Polytree sequents encode certain binary graphs whose nodes are flat sequents and such that if you ignore the orientation of the edges, the graph is a tree (cf. [5]). For example, the sequent

S=uRw,uRw,wRv,u:x,u:x,u:y,w:z,v:y,𝒯w:φ,w:ψ,v:θΓu:τ,u:χ,v:ξΔ

can be graphically depicted as the polytree pt(S), shown below:

 Remark 10.

To simplify the proofs of our results in Section 4, we assume w.l.o.g. that sequents with isomorphic polytree representations are mutually derivable from one another.

3.1 Semantics and Proof Systems

The following definition specifies how to interpret sequents. In essence, we lift the semantics of to sequents by means of “M-interpretations”, mapping sequents into models.

Definition 11 (Sequent Semantics).

Let M=(W,,U,D,I1,I2) be a model and α an M-assignment. We define an M-interpretation to be a function ι mapping every label wLab to a world ι(w)W. The satisfaction of multisets , 𝒯, and Γ are defined accordingly:

  • M,ι,α iff for all wRu, ι(w)ι(u);

  • M,ι,α𝒯 iff for all w:x𝒯, α¯(x)D(ι(w));

  • M,ι,αΓ iff for all w:φΓ, M,ι(w),αφ.

We define a sequent S=,𝒯,ΓΔ to be satisfied on M with ι and α, written M,ι,αS, iff if M,ι,α, and M,ι,α𝒯, as well as M,ι,αΓ, then there exists a w:ψΔ such that M,ι,αw:ψ. We write M,ι,α⊧̸S when a sequent S is not satisfied on M with ι and α. A sequent S is defined to be valid iff for every model M, every M-interpretation ι, and every M-assignment α, we have M,ι,αS; otherwise, we say that S is invalid and write M,ι,α⊧̸S.

Figure 1: The System 𝖫𝖡𝖨𝖰(𝒟).

Given a sequent S=,𝒯,ΓΔ, we define the term substitution S(t/x) to be the sequent obtained by replacing (1) every labeled formula w:φ in Γ,Δ by w:φ(t/x) and (2) 𝒯 by 𝒯(t/x):=(𝒯{w:xw:x𝒯}){w:yw:x𝒯 and yVT(t)}. For example, if S=wRu,w:x,u:x,u:y,w:p(x)u:yq(x,y), then

S(f(y,z)/x)=wRu,w:y,w:z,u:y,u:z,u:y,w:p(f(y,z))u:xq(f(y,z),x)

where the bound variable y in yq(x,y) was renamed to x to avoid capture. We now define two reachability relations + and as well as the notion of availability [9, 22] – all of which are required to properly formulate certain inference rules in our calculi.

Definition 12 (+, ).

Let be a finite multiset of relational atoms such that w,uLab(). We say that u is strictly reachable from w, written w+u, iff there exist v1,,vnLab() such that wRv1,,vnRu with n. We say that u is reachable from w, written wu, iff w+u or w=u. We write w↠̸u if wu does not hold.

Definition 13 (Available).

Let S=,𝒯,ΓΔ be a sequent with wLab(S). We define a term t to be available for w in ,𝒯, written 𝖠(t,Xw,,𝒯), iff tTer(Xw) such that

Xw={x|u:x𝒯 and uw for some uLab(S)}.

Our polytree calculus 𝖫𝖡𝖨𝖰(𝒟) for 𝖡𝖨𝖰(𝒟) is shown in Figure 1. The (ax), (L), and (R) rules serve as initial rules, the domain shift rule (ds) encodes the fact that I2(p,w)D(w)n in any model. We define the principal formula in an inference rule to be the one explicitly mentioned in the conclusion, the auxiliary formulae to be the non-principal formulae explicitly mentioned in the premises, and an active formula to be either a principal or auxiliary formula. For example, w:xφ is principal, w:φ(t/x) is auxiliary, and both are active in (R). Note that all rules of our calculus preserve the property of being a polytree-structured sequent. We define a proof and its height as usual [39]. Two unique features of our calculi are the inclusion of reachability rules and the domain shift rule (ds), which we elaborate on next.

3.2 Reachability Rules

A unique feature of our calculi is the inclusion of reachability rules (introduced in [20]), a generalization of propagation rules (cf. [4, 8, 14]), which are not only permitted to propagate formulae throughout a polytree sequent when applied bottom-up, but may also check to see if data exists along certain paths. The rules (ax), (L), ([Uncaptioned image]R), (R), and (L) serve as our reachability rules. The side conditions of our reachability rules are listed at the bottom of Figure 1. Moreover, we define a label u or a variable y to be fresh in a rule application (as in the (L) and (R) rules) iff it does not occur in the conclusion of the rule.

 Remark 14.

If we set 4 := “tTer”, 5 := “wu and tTer”, and remove the (ds) rule, then we obtain a polytree calculus, dubbed 𝖫𝖡𝖨𝖰(𝒞𝒟), for the constant domain version of the logic 𝖡𝖨𝖰(𝒞𝒟). We also note that in the constant domain setting, domain atoms are unnecessary and can be omitted from sequents.

Figure 2: An example proof in 𝖫𝖡𝖨𝖰(𝒞𝒟) for bi-intuitionistic logic with constant domains.

To provide intuition, we give an example showing the operation of a reachability rule.

Example 15.

Let S=,𝒯,ΓΔ such that =uRw,wRv, 𝒯=w:x,u:y,v:z, Γ=w:xp(x),w:p(f(y)),w:p(z), and Δ=u:q(x)[Uncaptioned image]q(x),v:r(y). A representation of S as a polytree is shown below. We explain (in)valid applications of the (L) reachability rule.

The term f(y) is available for w in S since uw, namely there is an edge from u to w, and f(y)Ter(Xw) since Xw={x,y}. Therefore, we may (top-down) apply the (L) rule to delete w:p(f(y)) and derive the sequent S=,𝒯,ΓΔ with Γ=w:xp(x),w:p(z). By contrast, w:p(z) cannot be deleted via an application of (L) because the term z is not available for w in S (observe that w is not reachable from v) meaning zTer(Xw).

 Remark 16.

We note that for any set XVar, Ter(X) since all constants are contained in Ter(X) by definition. This means that bottom-up applications of (R) and (L) may instantiate existential and universal formulae with any constant.

The reachability rules (ax), (L) and ([Uncaptioned image]R) are important to ensure completeness for both 𝖫𝖡𝖨𝖰(𝒞𝒟) and 𝖫𝖡𝖨𝖰(𝒟). The reachability rules for (R) and (L) are relevant only for 𝖫𝖡𝖨𝖰(𝒟) to ensure that the domains in the model do not collapse into a constant domain. We illustrate the importance of these reachability rules with a couple of examples.

Example 17 (An Intuitionistic Formula Valid in Constant Domain Models).

Consider the intuitionistic formula x(pr(x))(p(qxr(x))). This formula was adapted from an example in [27], which was used to illustrate the difficulty of obtaining a sound and complete sequent system for intuitionistic logic with constant domains. A proof of this formula in 𝖫𝖡𝖨𝖰(𝒞𝒟) is shown in Figure 2 and crucially relies on reachability rules. In the figure, the relational atoms =wRu,uRv,vRw in the instances of (ax) allow us to conclude that uu and uw, justifying the left and right instances of (ax), respectively.

Example 18 (Non-Provability of the Quantifier Shift Axiom in the Increasing Domain Setting).

Let us consider again the quantifier shift axiom x(φψ)(xφψ) and an attempt to construct a proof (bottom-up) of one of its instances in 𝖫𝖡𝖨𝖰(𝒟).

It is obvious that to finish this proof, we would need to instantiate the x quantifier in the labeled formula u:x(p(x)q) with x by applying the (L) rule. However, to do so, we would need to demonstrate that the world u is reachable from v where the domain atom v:x resides. Yet, u is not reachable from v, so x is not available at u to be used by (L).

3.3 The Domain Shift Rule (𝒅𝒔)

Although the reachability rules for the quantifiers prevent the quantifier shift axiom from being proved, it turns out that they are not sufficient to ensure the completeness of 𝖫𝖡𝖨𝖰(𝒟) with respect to the sequent semantics for the logic 𝖡𝖨𝖰(𝒟). Interestingly, this incompleteness only arises when the exclusion connective is involved – if one considers the intuitionistic fragment of 𝖫𝖡𝖨𝖰(𝒟), these reachability rules are sufficient to prove completeness (see Lemma 26 in Section 3.5). To see this incompleteness issue, consider the formula in Example 8, which is semantically valid, and the following attempt at a (bottom-up) construction of a proof:

We have so far applied only invertible rules, so the original sequent is provable iff the top sequent in the above derivation also is. To proceed with the proof construction, one needs to instantiate the existential quantifier y with x. However, the only domain atom containing x is located at the world u, which is not available to u where the existential formula is located.

It is not so obvious how the reachability rules for quantifiers could be amended to allow this example to be proved. Looking at the above derivation, it might be tempting to augment the calculus with a rule that allows a backward reachability condition for domain atoms, e.g., making u:x available to u for when uu under certain admissibility conditions, but this could easily lead to a collapse of the domains if one is not careful. Instead, our approach here is motivated by the semantic clause for predicates: when p(x) holds in a world, its interpretation requires that x is also defined in that world. Proof theoretically, we could think of this as postulating an axiom such as x(p(x)E(x)) where E(x) is an existence predicate (which, as we recall, was behind the semantics of the domain atoms). Translated into our calculus, this gives us the (ds) rule as shown in Figure 1. Using the (ds) rule, the above derivation can now be completed to a proof:

Note that the (ds) rule can only be applied to atomic predicates, but not arbitrary formulae, which rules out unsound instances. It may be possible to relax the restriction to atomic predicates by imposing some positivity conditions on the occurrences of x, but we did not find this necessary – neither for completeness, nor for cut-elimination.

 Remark 19.

The (ds) rule can be removed without affecting the cut-elimination result for 𝖫𝖡𝖨𝖰(𝒟). This raises the possibility of defining a first-order bi-intuitionistic logic strictly weaker than 𝖡𝖨𝖰(𝒟). It is unclear what the semantics for such a logic would look like.

3.4 Soundness and Completeness

Theorem 20 (Soundness).

Let S be a sequent. If S is provable in 𝖫𝖡𝖨𝖰(𝒟) (𝖫𝖡𝖨𝖰(𝒞𝒟)), then S is (CD-)valid.

Proof.

By induction on the height of the given proof; see Appendix A for details.

The completeness of our polytree calculi (see Theorem 22 below) is shown by taking a sequent of the form w:xw:φ(x) as input and showing that if the sequent is not provable, then the calculus can be used to construct an infinite derivation from which a counter-model of the end sequent can be extracted. We note that completeness only holds relative to sequents of the form w:xw:φ(x), which includes a domain atom for each free variable in φ(x). This restriction is needed because quantifier rules can only (bottom-up) instantiate quantified formulae with the free variables x of φ(x) if such free variables occur as domain atoms, and such free variables must be accessible to quantifier rules to properly extract a counter-model of the end sequent (see [23] for a relevant discussion).

Below, we outline the cut-free completeness proof for 𝖫𝖡𝖨𝖰(𝒟) as the proof for 𝖫𝖡𝖨𝖰(𝒞𝒟) is similar; the complete proof can be found in the online, appended version [26]. Our proof outline makes use of various new notions, which we now define. A pseudo-derivation is defined to be a (potentially infinite) tree whose nodes are sequents and where every parent node corresponds to the conclusion of a rule in 𝖫𝖡𝖨𝖰(𝒟) with the children nodes corresponding to the premises. We remark that a proof in 𝖫𝖡𝖨𝖰(𝒟) is a finite pseudo-derivation where all top sequents are instances of (ax), (L), or (R). A branch is defined to be a maximal path of sequents through a pseudo-derivation, starting from the conclusion. The following lemma is useful in proving completeness.

Lemma 21.

Let 𝒞{𝒟,𝒞𝒟}. For each i{0,1,2}, let Si=i,𝒯i,ΓiΔi be a sequent.

  1. 1.

    If wu holds for the conclusion of a rule (r) in 𝖫𝖡𝖨𝖰(𝒞), then wu holds for the premises of (r);

  2. 2.

    If w:p(t)Γ0,Δ0 and S0 is the conclusion of a rule (r) in 𝖫𝖡𝖨𝖰(𝒞) with S1 (and S2) the premise(s) of (r), then w:p(t)Γ1,Δ1 (and w:p(t)Γ2,Δ2, resp.);

  3. 3.

    If w:x𝒯0 and S0 is the conclusion of a rule (r) in 𝖫𝖡𝖨𝖰(𝒞) with S1 (and S2) the premise(s) of (r), then w:x𝒯1 (and w:x𝒯2, resp.).

The lemma tells us that propagation paths, the position of atomic formulae, and the position of terms are bottom-up preserved in rule applications.

Theorem 22 (Completeness).

If w:xw:φ(x) is (CD-)valid, then w:xw:φ(x) is provable in 𝖫𝖡𝖨𝖰(𝒟) (𝖫𝖡𝖨𝖰(𝒞𝒟)).

Proof (Outline).

We assume that S=w:xw:φ(x) is not provable in 𝖫𝖡𝖨𝖰(𝒟) and show that a model M can be defined which witnesses that S is invalid. To prove this, we first define a proof-search procedure 𝙿𝚛𝚘𝚟𝚎 that bottom-up applies rules from 𝖫𝖡𝖨𝖰(𝒟) to w:xw:φ(x). Second, we show how a model M can be extracted from failed proof-search. We now describe the proof-search procedure 𝙿𝚛𝚘𝚟𝚎 and let be a well-founded, strict linear order over the set Ter of terms.

𝙿𝚛𝚘𝚟𝚎.

Let us take w:xw:φ(x) as input and continue to the next step. We show some key steps; the complete 𝙿𝚛𝚘𝚟𝚎 procedure can be found in the online appended version [26].

(𝒂𝒙), (𝐋), and (𝐑).

Suppose 1,,n are all branches occurring in the current pseudo-derivation and let S1,,Sn be the top sequents of each respective branch. For each 1in, we halt the computation of 𝙿𝚛𝚘𝚟𝚎 on each branch i where Si is of the form (ax), (L), or (R). If 𝙿𝚛𝚘𝚟𝚎 is halted on each branch i, then 𝙿𝚛𝚘𝚟𝚎 returns 𝚃𝚛𝚞𝚎 because a proof of the input has been constructed. However, if 𝙿𝚛𝚘𝚟𝚎 did not halt on each branch i with 1in, then let j1,,jk be the remaining branches for which 𝙿𝚛𝚘𝚟𝚎 did not halt. For each such branch, copy the top sequent above itself, and continue to the next step.

(𝒅𝒔).

Suppose 1,,n are all branches occurring in the current pseudo-derivation and let S1,,Sn be the top sequents of each respective branch. For each 1in, we consider i and extend the branch with bottom-up applications of (ds) rules. Let k+1 be the current branch under consideration, and assume that 1,,k have already been considered. We assume that the top sequent in k+1 is of the form

Sk+1=,𝒯,Γ,w:p1(t1),,w:p(t)Δ

where all atomic input formulae are displayed in Sk+1 above. We successively consider each atomic input formula and bottom-up apply (ds), yielding a branch extending k+1 with a top sequent saturated under (ds) applications. After these operations have been performed for each branch i with 1in, we continue to the next step.

(𝐋).

Suppose 1,,n are all branches occurring in the current pseudo-derivation and let S1,,Sn be the top sequents of each respective branch. For each 1in, we consider i and extend the branch with bottom-up applications of (L) rules. Let k+1 be the current branch under consideration, and assume that 1,,k have already been considered. We assume that the top sequent in k+1 is of the form

Sk+1=,𝒯,Γ,w1:x1φ1,,wm:xmφmΔ

where all existential input formulae wi:xiφi are displayed in Sk+1 above. We consider each formula wi:xiφi in turn, and bottom-up apply the (L) rule. These rule applications extend k+1 such that

,𝒯,Γ,w1:φ1(y1/x1),,wn:φm(ym/xm)Δ

is now the top sequent of the branch with y1,,ym fresh variables and 𝒯=𝒯,w1:y1,,wm:ym. After these operations have been performed for each branch i with 1in, we continue to the next step.

(𝐑).

Suppose 1,,n are all branches occurring in the current pseudo-derivation and let S1,,Sn be the top sequents of each respective branch. For each 1in, we consider i and extend the branch with bottom-up applications of (R) rules. Let k+1 be the current branch under consideration, and assume that 1,,k have already been considered. We assume that the top sequent in k+1 is of the form

Sk+1=,𝒯,Γw1:x1φ1,,wm:xmφm,Δ

where all existential formulae wi:xiφi are displayed in Sk+1 above. We consider each labeled formula wm:xmφi in turn, and bottom-up apply the (R) rule. Let w+1:x+1φ+1 be the current formula under consideration, and assume that w1:x1φ1,,w:xφ have already been considered. Recall that is a well-founded, strict linear order over the set Ter of terms. Choose the -minimal term tTer(Xw+1) that has yet to be picked to instantiate w+1:x+1φ+1 and bottom-up apply the (R) rule, thus adding w+1:φ+1(t/x+1). We perform these operations for each branch i with 1in.

The remaining rules of 𝖫𝖡𝖨𝖰(𝒟) are processed in a similar fashion. The 𝙿𝚛𝚘𝚟𝚎 procedure will saturate open branches of the pseudo-derivation that is under construction by repeatedly (bottom-up) applying rules from 𝖫𝖡𝖨𝖰(𝒟) in a roundabout fashion.

Next, we aim to show that if 𝙿𝚛𝚘𝚟𝚎 does not return 𝚃𝚛𝚞𝚎, then a model M, M-interpretation ι, and M-assignment α can be defined such that M,ι,α⊧̸S. If 𝙿𝚛𝚘𝚟𝚎 halts, i.e. 𝙿𝚛𝚘𝚟𝚎 returns 𝚃𝚛𝚞𝚎, then a proof of S may be obtained by “contracting” all redundant inferences from the “(ax), (L), and (R)” step of 𝙿𝚛𝚘𝚟𝚎. Therefore, in this case, since a proof exists, we have obtained a contradiction to our assumption. As a consequence, we have that 𝙿𝚛𝚘𝚟𝚎 does not halt, that is, 𝙿𝚛𝚘𝚟𝚎 generates an infinite tree with finite branching. By König’s lemma, an infinite branch must exist in this infinite tree, which we denote by . We define a model M=(W,,U,D,I1,I2) by means of this branch as follows: Let us define the following sets, all of which are obtained by taking the union of each multiset of relational atoms, domain atoms, antecedent labeled formulae, and consequent labeled formulae (resp.) occurring within a sequent in :

=(,𝒯,ΓΔ)𝒯=(,𝒯,ΓΔ)𝒯Γ=(,𝒯,ΓΔ)ΓΔ=(,𝒯,ΓΔ)Δ

We now define: (1) uW iff uLab(,𝒯,Γ,Δ), (2) ={(u,v)|uRv} where denotes the reflexive-transitive closure, (3) tU iff there exists a label uLab(,𝒯,Γ,Δ) such that tTer(Xu), (4) tD(u) iff tTer(Xu), and (5) (t1,,tn)I2(u,p) iff v,uLab(,𝒯,Γ,Δ), vu, and v:p(t1,,tn)Γ.

It can be shown that M is indeed a model. Let us define α to be the M-assignment mapping every variable in U to itself and every variable in VarU arbitrarily. To finish the proof of completeness, we now argue the following by mutual induction on the complexity of the formula ψ: (1) if u:ψΓ, then M,u,αψ, and (2) if u:ψΔ, then M,u,α⊮ψ. Let ι to be the M-interpretation such that ι(u)=u for uW and ι(v)W for vW. By the proof above, M,ι,α⊧̸w:xw:φ(x), showing that if a sequent of the form w:xw:φ(x) is not provable in 𝖫𝖡𝖨𝖰(𝒟), then it is invalid, that is, every valid sequent of the form w:xw:φ(x) is provable in 𝖫𝖡𝖨𝖰(𝒟).

 Remark 23.

We remark that cut admissibility follows from the soundness of the (cut) rule (see Figure 3) and the completeness theorem above. However, this method of proof has two downsides: first, the restriction in the completeness theorem above implies that cut admissibility only holds for proofs with an end sequent of the form w:xw:φ(x). Second, this (semantic) method of proof does not define an algorithm showing how instances of (cut) can be permuted upward and eliminated from a given proof. In Section 4, we will prove that cut admissibility holds for all proofs and will provide such an algorithm (see Theorem 30).

3.5 Intuitionistic Subsystems

We end this section by discussing two subsystems of 𝖫𝖡𝖨𝖰(𝒟) and 𝖫𝖡𝖨𝖰(𝒞𝒟) arising from restricting the connectives to the intuitionistic fragment. In the former case, we obtain a proof system for the usual first-order intuitionistic logic (with non-constant domains) 𝖨𝖰, and in the latter, we obtain a proof system for intuitionistic logic with constant domains 𝖨𝖰𝖢.

Corollary 24 (Conservativity).

Let φ be an intuitionistic formula (i.e. a formula with no occurrences of [Uncaptioned image]). Then, φ is valid in 𝖨𝖰 (𝖨𝖰𝖢) iff w:φ is provable in 𝖫𝖡𝖨𝖰(𝒟) (respectively, 𝖫𝖡𝖨𝖰(𝒞𝒟)).

The proof of Corollary 24 is straightforward from Definition 11. We show here a stronger proof-theoretic conservativity result: we can in fact extract a purely intuitionistic fragment out of 𝖫𝖡𝖨𝖰(𝒟), where every sequent in the fragment is interpretable in the semantics without the existence predicate. We prove this via syntactic means, by showing how we can translate intuitionistic proofs in 𝖫𝖡𝖨𝖰(𝒟) to proofs in Gentzen’s 𝖫𝖩 [11, 12]. A key idea is to first define a formula interpretation of a polytree sequent, and then show that every inference rule corresponds to a valid implication in 𝖫𝖩. We start by defining a notion of intuitionistic (polytree) sequent.

Definition 25.

A sequent S=,𝒯,ΓΔ is an intuitionistic sequent iff is a tree rooted at node u such that

  • every formula in S is an intuitionistic formula (i.e. it contains no occurrences of [Uncaptioned image]),

  • for every labeled formula w:φ in S and variable xVT(φ), x is available for w, and

  • if w:x and z:x are in 𝒯, then w=z.

By 𝖭𝖨𝖰(𝒟) we denote the restriction to intuitionistic sequents of the proof system 𝖫𝖡𝖨𝖰(𝒟) without the (ds) rule. The next lemma states an important property of 𝖫𝖡𝖨𝖰(𝒟), called the separation property, which was first discussed in the context of tense logics [14].

Lemma 26 (Separation).

An intuitionistic sequent S is provable in 𝖭𝖨𝖰(𝒟) iff it is provable in 𝖫𝖡𝖨𝖰(𝒟).

Proof (Outline).

One direction, from 𝖭𝖨𝖰(𝒟) to 𝖫𝖡𝖨𝖰(𝒟) is trivial. For the other direction, suppose π is a proof of S in 𝖫𝖡𝖨𝖰(𝒟). By induction on the structure of π, it can be shown that there is a proof π in 𝖫𝖡𝖨𝖰(𝒟) in which every sequent in π is almost intuitionistic – it satisfies all the requirements in Definition 25 except possibly the last condition (due to the possible use of the (ds) rule). Then, from π we can construct another proof π′′ of S that does not use (ds), by showing that one can always permute the rule (ds) up until it disappears. Since all the rules of 𝖫𝖡𝖨𝖰(𝒟), other than (ds), preserve the property of being an intuitionistic sequent, it then follows that π′′ is a proof in 𝖭𝖨𝖰(𝒟).

To translate a proof in 𝖭𝖨𝖰(𝒟) to 𝖫𝖩, we need to interpret a polytree sequent as a formula. This turns out to be somewhat problematic, due to the difficulty in interpreting the scopes of domain atoms, when interpreting them as universally quantified variables. Fortunately, in the case of intuitionistic sequents, the scopes of such variables follow a straightforward lexical scoping (i.e. their scopes are over formulae in the subtrees). To define the translation, we first relax the requirement on the domain atoms in intuitionistic sequents: a quasi-intuitionistic sequent is defined as in Definition 25, except that in the second clause, x is either available for w, or it does not occur in 𝒯. Obviously an intutionistic sequent is also a quasi-intuitionistic sequent. Given a quasi-intuitionistic sequent S and a label w, we write Sw to denote the quasi-intuitionistic sub-sequent of S that is rooted in w, i.e. the sequent obtained from S by removing any relational atoms, domain atoms, and labeled formulae that mention a world v not reachable from w. Given a multiset of labeled formulae Γ, we denote with Γu the labeled formulae in Γ that are labeled with u.

Definition 27.

Let S=,𝒯,ΓΔ be a quasi-intuitionistic sequent. We define its formula interpretation F(S) recursively on the height of the sequent tree and suppose S is rooted at u.

  • If S is a flat sequent, then F(X)=x(ΓΔ) where x are all the variables in 𝒯;

  • otherwise, if u has n successors w1,,wn, then

    F(S)=x(Γu(ΔuF(Sw1)F(Swn))).

The following proof-theoretic conservativity result can then be proved using a standard translation technique for relating nested sequents and traditional Gentzen sequent calculi [6].

Proposition 28.

Let S be an intuitionistic sequent. S is provable in 𝖭𝖨𝖰(𝒟) iff F(S) is provable in 𝖫𝖩.

Proof (Outline).

The proof is tedious, but not difficult and follows a general strategy to translate nested sequent proofs (which, recall, are notational variants of polytree sequent proofs) to traditional sequent proofs (with cuts) from the literature, see e.g., the translation from nested sequent to traditional sequent proofs for full intuitionistic linear logic [6]. For every inference rule in 𝖭𝖨𝖰(𝒟) of the form:

we show that the formula F(S1)F(Sn)F(S) is provable in 𝖫𝖩. Then, given any proof in 𝖭𝖨𝖰(𝒟), we simulate every inference step with its corresponding implication, followed by a cut.

As far as we know, for intuitionistic logic with constant domains 𝖨𝖰𝖢, there is no formalization in the traditional Gentzen sequent calculus that admits cut-elimination. There is, however, a formalization in prefixed tableaux by Fitting [9], which happens to be a syntactic variant of the intuitionistic fragment of 𝖫𝖡𝖨𝖰(𝒞𝒟) (shown in [19]).

4 Cut-Elimination

Figure 3: Admissible rules.

In this section, we show that 𝖫𝖡𝖨𝖰(𝒟) and 𝖫𝖡𝖨𝖰(𝒞𝒟) satisfy a sizable number of favorable properties culminating in syntactic cut-elimination. We explain here some key steps; the full details are available in the online appended version [26].

𝖫𝖡𝖨𝖰(𝒟) and 𝖫𝖡𝖨𝖰(𝒞𝒟) can be seen as first-order extensions of Postniece’s deep-nested sequent calculus for bi-intuitionistic logic DBiInt [31, 13]. Cut-elimination for DBiInt [13] was proven in two stages. First, cut-elimination was proven for a “shallow” version of the nested sequent calculus LBiInt, which can be seen as a variant of a display calculus [1]. The cut-elimination proof for this shallow calculus follows from Belnap’s generic cut-elimination for display calculi [1]. Second, cut-free proofs in the shallow calculus are shown to be translatable to proofs in the deep-nested calculus. We do not have the corresponding shallow versions of 𝖫𝖡𝖨𝖰(𝒟) and 𝖫𝖡𝖨𝖰(𝒞𝒟), so we cannot rely on Belnap’s generic cut-elimination. It may be possible to define shallow versions of our calculi, and then follow the same methodology outlined in [13] to prove cut-elimination, but we find that a direct cut-elimination proof is simpler, e.g., it avoids the need for proving the admissibility of certain structural rules called the display postulates [1], which lets one transition from shallow to deep inference systems.

Since our polytree sequents are a restriction of ordinary labeled sequents, another possible approach to cut-elimination is to apply the methodology for labeled sequent calculi [28]. A main issue in adapting this methodology is ensuring that the proof transformations needed in cut-elimination preserve the polytree structure of sequents. A key proof transformation in a typical cut-elimination proof for labeled calculi is label substitution: given a proof π1 and labels u and w, one constructs another proof π2 by replacing u with w everywhere in π1 and adjusts the inference rules accordingly. This is typically needed in the reduction of a cut where the last rules in both branches of the cut apply to the cut formula, and where one of the rules introduces (reading the rule bottom up) a new label and a new relational atom (e.g., (R)). Such a substitution operation may not preserve polytree structures. Another notable difference between our calculi and traditional labeled calculi is the absence of structural rules manipulating relational atoms. These differences mean that cut-elimination techniques for labeled sequent calculi cannot be immediately applied in our setting.

Instead, our cut-elimination proof builds on an approach by Pinto and Uustalu [29, 30], which deals with a polytree sequent calculus for propositional bi-intuitionistic logic. We thus provide a series of proof transformations, culminating in the elimination of cuts, which shares similarities with their work in the propositional case and expands in the first-order direction. These transformations are captured in proofs of the admissibility of rules shown in Figure 3. We illustrate some key transformations and why they are needed, through an example of a cut where (L) and (R) are applied to the cut formula. The formal details are available in the proof of Theorem 30.

Suppose we have the instance of cut shown below left, where π1 is shown below right and π2 is shown below bottom with wu.

A typical cut reduction strategy would be to cut π1 with π3 and π4 (both with cut formula φψ), producing the proofs π5 and π6 of ,ΓΔ,u:φ and ,Γ,u:ψΔ, respectively. Next, one would cut π5 with π1 (with cut formula φ), producing a proof π7, and then cut π7 with π6 (with cut formula ψ). There are a couple of issues with this strategy: (1) the cut formulae in the last two instances of cut have mismatched labels, i.e., w on one side and u on the other ; (2) the label w and the relational atom wRw are not present in the conclusions of π5 and π6, so the contexts of the premises of the cuts do not match.

To fix these issues, we first need to transform the proof π1 into two proofs π5 and π6, shown below left and right, respectively. As shown in the cut-elimination proof, a transformation that we use in this case is one that is represented by the rule (iw). This ensures that the contexts match the contexts of the concluding sequents in π3 and π4.

We then cut π3 and π4 with π5 and π6, respectively, which yields proofs π7 and π8 of ,Γu:φ,Δ and ,Γ,u:ψΔ, respectively. At this stage, we want to cut π7 with π1, and then cut the resulting proof with π8. However, this cut cannot be performed until the label w and its associated relational atom are removed from the conclusion of π1. Simply substituting u for w may break the polytree shape of the sequent, e.g., if there is a v such that wRv and vRu are in , then replacing u for w in wRw would break the polytree shape of the sequent. So the relational atoms in the sequent also need to be modified. A transformation that we use in this case is represented by the rule (brf), which shifts the relational atom wRw “forward” from w to u given that wu. We also need another transformation to “merge” the label u with the label w, deleting the relational atom in the process, represented as the (mrg) rule.

If we cut the above proof with π8, we obtain a proof of ,𝒯,ΓΔ. Here we gloss over the termination arguments, but the details are available in the proof of Theorem 30.

The above example illustrates one among several proof transformations needed in cut-elimination. These transformations make use of the auxiliary rules in Figure 3. The bulk of the cut-elimination proof consists of showing these rules (height-preserving/hp-) admissible and the rules of 𝖫𝖡𝖨𝖰(𝒟) and 𝖫𝖡𝖨𝖰(𝒞𝒟) height-preserving invertible (i.e., hp-invertible). (NB. We take the notions of (hp-)admissibility and (hp-)invertibility to be defined as usual.) All (hp-)admissible rules preserve the polytree structure of sequents, and with the exception of (gax), (cut), and (cd), all rules in Figure 3 are hp-admissible in both calculi. The (gax) and (cut) rules are strictly admissible in both calculi, while (cd) is hp-admissible in only 𝖫𝖡𝖨𝖰(𝒞𝒟) as the availability conditions are not imposed on rules, rendering domain atoms unnecessary (see Remark 14). We now discuss some of the most interesting rules of Figure 3.

Let us first explain the rules (brf) and (brb). For some labels w, u, and v, assume wu and u↠̸v for :=,wRv. Then, we know that (1) u and v are on two different paths passing through w of the polytree generated from , and (2) there is no vertex between v and w since otherwise a cycle would be present in . In this scenario, the rule (brf) (for branch forward) allows one to move the polytree “rooted” at v forward by connecting it to u instead of w as shown left in the below figure. The rule (brb) has a similar functionality; for some labels w, u, and v, assume wu and w↠̸v for :=,vRu. Then, the rule (brb) (for branch backward) lets one move the polytree “rooted” at v backward by connecting it to w instead of u as shown right in the below figure.

Figure 4: The left and right diagrams demonstrate the functionality of (brf) and (brb), respectively.

The (mrg) rule merges a label and its direct successor and corresponds to the rules nodemergeD and nodemergeU of Pinto and Uustalu [29]. The rule (id) reflects the redundancy of a variable labeled by two labels such that one is reachable by the other. Model-theoretically, this redundancy follows from the fact that if x is interpreted at w, and u is reachable from w (in a model), then x is interpreted at u as well, showing the domain atom u:x superfluous in the premise. Note that when the labels w and u are identical, then the rule represents a contraction on domain atoms; as ww always holds, we have that identical domain atoms can always be contracted in sequents. The rules (lwr) and (lft) allow us to modify the labels of formulae in a sequent by looking at its underlying polytree. More precisely, reading (lwr) and (lft) bottom-up, if wu we can both lower the label of u:Π on the right of the sequent to w, and lift the label of w:Σ on the left of the sequent to u.

Lemma 29.

All non-initial rules in 𝖫𝖡𝖨𝖰(𝒞) are hp-invertible.

Finally, we can prove the admissibility of the (cut) rule. As our proof proceeds via local transformations of proofs, the proof is constructive and yields a cut-elimination algorithm.

Theorem 30 (Cut-elimination).

The (cut) rule is admissible in 𝖫𝖡𝖨𝖰(𝒞).

Proof.

We proceed by a primary induction (PIH) on the complexity of the cut formula, and a secondary induction (SIH) on the sum of the heights of the proofs of the premises of (cut). Assume that we have proofs of the following form, with wu.

We argue that there is a proof of ,𝒯,ΓΔ by a case distinction on (r1) and (r2), the last rules applied in the above proofs. We focus on some interesting cases; the remaining cases can be found in the online appended version [26].

(𝒓𝟏)=(𝒂𝒙).

Then ,𝒯,ΓΔ,w:φ is of the form ,𝒯,Γ0,v0:p(t)Δ0,v1:p(t) where v0v1. If v1:p(t) is w:φ, then we have that ,𝒯,Γ,u:φΔ is of the form ,𝒯,Γ0,v0:p(t),u:p(t)Δ where Γ=Γ0,v0:p(t). Given that v0v1 and v1u, we can apply the hp-admissibility of (lft) on the latter to obtain a proof of ,𝒯,Γ0,v0:p(t),v0:p(t)Δ. Consequently, we obtain a proof of ,𝒯,Γ0,v0:p(t)Δ, i.e. of ,𝒯,ΓΔ, using the hp-admissibility of (ctrl). If v1:p(t) is not w:φ, then we have that ,𝒯,ΓΔ is of the form ,𝒯,Γ0,v0:p(t)Δ0,v1:p(t) where v0v1. The latter is provable by the admissibility of (gax).

(𝒓𝟏)=(𝒅𝒔).

Then ,𝒯,ΓΔ,w:φ is of the form ,𝒯,Γ0,v:p(t)Δ,w:φ and we have a proof of ,𝒯,v:VT(t),Γ0,v:p(t)Δ,w:φ. Consequently, we know that ,𝒯,ΓΔ is of the form ,𝒯,Γ0,v:p(t)Δ. We also have that ,𝒯,Γ,u:φΔ is of the form ,𝒯,Γ0,v:p(t),u:φΔ. We can repeatedly apply the hp-admissibility of (wv) on the proof of the latter to obtain a proof of S:=,𝒯,v:VT(t),Γ0,v:p(t),u:φΔ. Then, we proceed as follows:

Note that the instance of SIH is justified because the sum of the heights of the proofs of the premises has decreased.

(𝒓𝟏)=([Uncaptioned image]𝐋).

Then ,𝒯,ΓΔ,w:φ is of the form ,𝒯,Γ0,v:ψ[Uncaptioned image]χΔ,w:φ and we a have proof of ,v0Rv,𝒯,Γ0,v0:ψΔ,w:φ,v0:χ. Consequently, we know that ,𝒯,ΓΔ is of the form ,𝒯,Γ0,v:ψ[Uncaptioned image]χΔ. We also have that ,𝒯,Γ,u:φΔ is of the form ,𝒯,Γ0,v:ψ[Uncaptioned image]χ,u:φΔ. We apply Lemma 29 on the proof of the latter sequent to obtain a proof of ,v0Rv,𝒯,Γ0,v0:ψ,u:φΔ,v0:χ, which we call S. Thus, we proceed as shown below. Note that the instance of SIH is justified as the sum of the heights of the proofs of the premises is smaller than that of the original cut.

(𝒓𝟏)=(𝐑).

Then there are two cases to consider: in the left premise ,𝒯,ΓΔ,w:φ of (cut), either (1) w:φ is not the principal formula v:xψ, or (2) w:φ is the principal formula. In case (1), we have a proof of ,𝒯,ΓΔ1,v:xψ,v:ψ(t/x),w:φ, which we call S, and ,𝒯,Γ,u:φΔ is of the form ,𝒯,Γ,u:φΔ1,v:xψ. We proceed as follows.

In case (2), we have proof a of ,𝒯,ΓΔ,v:xψ,v:ψ(t/x), and ,𝒯,Γ,u:φΔ is of the form ,𝒯,Γ,u:xψΔ. In this case, we need to consider the shape of (r2). If u:xψ is not principal in (r2), then we apply the hp-invertibility of (r2) (Lemma 29) to the left premise of (cut) and use SIH to cut the result with the premise of (r2), applying (r2) afterward to reach our goal. If u:xψ is principal in (r2), then the premise of (r2) is of the shape ,𝒯,v:y,Γ,v:ψ(y/x)Δ where y is fresh. Then, we proceed as follows where π is the first proof given and x0,,xn are all the variables appearing in t.

Note that the step involving (id) is justified as t is available for v, meaning for each xiVT(t), there exists a domain atom ui:xi such that uiv, showing (id) applicable.

5 Concluding Remarks

Our analysis indicates that there may be two interesting and possibly distinct first-order extensions of bi-intuitionistic logic that may be worth exploring. The first is to consider a logic with decreasing domains, i.e., if wu then D(u)D(w) in the Kripke model. Semantically, this logic is easy to define, but its proof theory is not at all obvious. We are looking into the possibility of formalizing a notion of “non-existence predicate,” which is dual to the existence predicate, suggested by Restall [34]. This non-existence predicate may play a similar (but dual) role to the existence predicate in 𝖫𝖡𝖨𝖰(𝒟). The other extension is motivated from a proof-theoretic perspective. As mentioned in Remark 19, it seems that one can obtain a subsystem of 𝖫𝖡𝖨𝖰(𝒟) without the domain-shift rule (ds) that satisfies cut-elimination. As discussed in Section 3, the (ds) rule is crucial to ensure the completeness of 𝖡𝖨𝖰(𝒟) in the presence of the exclusion operator, and so, a natural question to ask is what the semantics of such a logic would look like.

References

  • [1] Nuel D. Belnap. Display logic. Journal of philosophical logic, 11(4):375–417, 1982. doi:10.1007/BF00284976.
  • [2] Kai Brünnler. Deep sequent systems for modal logic. Archive for Mathematical Logic, 48(6):551–577, 2009. doi:10.1007/s00153-009-0137-3.
  • [3] Robert A. Bull. Cut elimination for propositional dynamic logic without *. Zeitschrift für Mathematische Logik und Grundlagen der Mathematik, 38(2):85–100, 1992. doi:10.1002/MALQ.19920380107.
  • [4] Marcos A. Castilho, Luis Farinas del Cerro, Olivier Gasquet, and Andreas Herzig. Modal tableaux with propagation rules and structural rules. Fundamenta Informaticae, 32(3, 4):281–297, 1997. doi:10.3233/FI-1997-323404.
  • [5] Agata Ciabattoni, Tim Lyon, Revantha Ramanayake, and Alwen Tiu. Display to labelled proofs and back again for tense logics. ACM Transactions on Computational Logic, 22(3):1–31, 2021. doi:10.1145/3460492.
  • [6] Ranald Clouston, Jeremy E. Dawson, Rajeev Goré, and Alwen Tiu. Annotation-free sequent calculi for full intuitionistic linear logic - extended version. CoRR, abs/1307.0289, 2013. arXiv:1307.0289.
  • [7] Tristan Crolard. A formulae-as-types interpretation of subtractive logic. Journal of Logic and Computation, 14(4):529–570, 2004. doi:10.1093/logcom/14.4.529.
  • [8] Melvin Fitting. Tableau methods of proof for modal logics. Notre Dame Journal of Formal Logic, 13(2):237–247, 1972. doi:10.1305/NDJFL/1093894722.
  • [9] Melvin Fitting. Nested sequents for intuitionistic logics. Notre Dame Journal of Formal Logic, 55(1):41–61, 2014. doi:10.1215/00294527-2377869.
  • [10] D. Gabbay, V. Shehtman, and D. Skvortsov. Quantification in Non-classical Logics. Studies in Logic and Foundations of Mathematics. Elsevier, Amsterdam, London, 2009.
  • [11] Gerhard Gentzen. Untersuchungen über das logische Schließen. I. Mathematische Zeitschrift, 39(1):176–210, 1935.
  • [12] Gerhard Gentzen. Untersuchungen über das logische Schließen. II. Mathematische Zeitschrift, 39(1):405–431, 1935.
  • [13] Rajeev Goré, Linda Postniece, and Alwen Tiu. Cut-elimination and proof-search for bi-intuitionistic logic using nested sequents. In Carlos Areces and Robert Goldblatt, editors, Advances in Modal Logic 7, pages 43–66, Nancy, France, 2008. College Publications. URL: http://www.aiml.net/volumes/volume7/Gore-Postniece-Tiu.pdf.
  • [14] Rajeev Goré, Linda Postniece, and Alwen Tiu. On the correspondence between display postulates and deep inference in nested sequent calculi for tense logics. Logical Methods in Computer Science, 7(2):1–38, 2011. doi:10.2168/LMCS-7(2:8)2011.
  • [15] Rajeev Goré and Ian Shillito. Bi-intuitionistic logics: A new instance of an old problem. In Advances in Modal Logic 13, pages 269–288, Helsinki, Finland, 2020. College Publications. URL: http://www.aiml.net/volumes/volume13/Gore-Shillito.pdf.
  • [16] Andrzej Grzegorczyk. A philosophically plausible formal interpretation of intuitionistic logic. Indagationes Mathematicae, 26(5):596–601, 1964.
  • [17] Ryo Ishigaki and Kentaro Kikuchi. Tree-sequent methods for subintuitionistic predicate logics. In Nicola Olivetti, editor, Automated Reasoning with Analytic Tableaux and Related Methods, volume 4548 of Lecture Notes in Computer Science, pages 149–164, Berlin, Heidelberg, 2007. Springer Berlin Heidelberg. doi:10.1007/978-3-540-73099-6_13.
  • [18] Ryo Kashima. Cut-free sequent calculi for some tense logics. Studia Logica, 53(1):119–135, 1994. doi:10.1007/BF01053026.
  • [19] Tim Lyon. On the correspondence between nested calculi and semantic systems for intuitionistic logics. Journal of Logic and Computation, 31(1):213–265, December 2020. doi:10.1093/logcom/exaa078.
  • [20] Tim Lyon. Refining Labelled Systems for Modal and Constructive Logics with Applications. PhD thesis, Technische Universität Wien, 2021.
  • [21] Tim Lyon, Alwen Tiu, Rajeev Goré, and Ranald Clouston. Syntactic interpolation for tense logics and bi-intuitionistic logic via nested sequents. In Maribel Fernández and Anca Muscholl, editors, 28th EACSL Annual Conference on Computer Science Logic, volume 152 of LIPIcs, pages 28:1–28:16. Schloss Dagstuhl – Leibniz-Zentrum für Informatik, 2020. doi:10.4230/LIPIcs.CSL.2020.28.
  • [22] Tim S. Lyon. Nested sequents for intermediate logics: the case of Gödel-Dummett logics. Journal of Applied Non-Classical Logics, 33(2):121–164, 2023. doi:10.1080/11663081.2023.2233346.
  • [23] Tim S. Lyon. Nested sequents for intermediate logics: The case of Gödel-Dummett logics, 2024. Updated version, on arXiv. URL: https://arxiv.org/abs/2306.07550, doi:10.48550/arXiv.2306.07550.
  • [24] Tim S. Lyon and Lucía Gómez Álvarez. Automating reasoning with standpoint logic via nested sequents. In Proceedings of the 19th International Conference on Principles of Knowledge Representation and Reasoning, pages 257–266, August 2022. doi:10.24963/kr.2022/26.
  • [25] Tim S. Lyon and Jonas Karge. Constructive interpolation and concept-based beth definability for description logics via sequents. In Kate Larson, editor, Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, IJCAI-24, pages 3484–3492. International Joint Conferences on Artificial Intelligence Organization, August 2024. Main Track. URL: https://www.ijcai.org/proceedings/2024/386, doi:10.24963/ijcai.2024/386.
  • [26] Tim S. Lyon, Ian Shillito, and Alwen Tiu. Taking bi-intuitionistic logic first-order: A proof-theoretic investigation via polytree sequents, 2024. Appended version on arXiv. URL: https://arxiv.org/abs/2404.15855, doi:10.48550/arXiv.2404.15855.
  • [27] E. G. K. López-Escobar. On the interpolation theorem for the logic of constant domains. Journal of Symbolic Logic, 46(1):87–88, 1981. doi:10.2307/2273260.
  • [28] Sara Negri. Proof analysis in modal logic. Journal of Philosophical Logic, 34(5-6):507, 2005.
  • [29] Luís Pinto and Tarmo Uustalu. Relating sequent calculi for bi-intuitionistic propositional logic. In Steffen van Bakel, Stefano Berardi, and Ulrich Berger, editors, Proceedings Third International Workshop on Classical Logic and Computation, volume 47 of EPTCS, pages 57–72, 2010. doi:10.4204/EPTCS.47.7.
  • [30] Luís Pinto and Tarmo Uustalu. A proof-theoretic study of bi-intuitionistic propositional sequent calculus. Journal of Logic and Computation, 28(1):165–202, 2018. doi:10.1093/LOGCOM/EXX044.
  • [31] Linda Postniece. Deep inference in bi-intuitionistic logic. In Hiroakira Ono, Makoto Kanazawa, and Ruy de Queiroz, editors, Logic, Language, Information and Computation, pages 320–334, Berlin, Heidelberg, 2009. Springer Berlin Heidelberg. doi:10.1007/978-3-642-02261-6_26.
  • [32] Cecylia Rauszer. Applications of Kripke models to Heyting-Brouwer logic. Studia Logica, 36(1):61–71, 1977. doi:10.1007/BF02121115.
  • [33] Cecylia Rauszer. An algebraic and Kripke-style approach to a certain extension of intuitionistic logic, dissertations mathematicae, 1980.
  • [34] Greg Restall. Constant domain quantified modal logics without boolean negation. Australasian Journal of Logic, 3:45–62, 2005. doi:10.26686/ajl.v3i0.1772.
  • [35] Dana Scott. Identity and existence in intuitionistic logic. In Applications of Sheaves: Proceedings of the Research Symposium on Applications of Sheaf Theory to Logic, Algebra, and Analysis, Durham, July 9–21, 1977, pages 660–696. Springer, 2006.
  • [36] Ian Shillito. New Foundations for the Proof Theory of Bi-Intuitionistic and Provability Logics Mechanized in Coq. PhD thesis, Australian National University, Canberra, 2023.
  • [37] Alex K. Simpson. The Proof Theory and Semantics of Intuitionistic Modal Logic. PhD thesis, University of Edinburgh. College of Science and Engineering. School of Informatics, 1994.
  • [38] John G. Stell, Renate A. Schmidt, and David Rydeheard. A bi-intuitionistic modal logic: Foundations and automation. Journal of Logical and Algebraic Methods in Programming, 85(4):500–519, 2016. Relational and algebraic methods in computer science. doi:10.1016/j.jlamp.2015.11.003.
  • [39] Gaisi Takeuti. Proof theory, volume 81. Courier Corporation, 2013.
  • [40] Alwen Tiu, Egor Ianovski, and Rajeev Goré. Grammar logics in nested sequent calculus: Proof theory and decision procedures. In Thomas Bolander, Torben Braüner, Silvio Ghilardi, and Lawrence S. Moss, editors, Advances in Modal Logic 9, pages 516–537. College Publications, 2012. URL: http://www.aiml.net/volumes/volume9/Tiu-Ianovski-Gore.pdf.
  • [41] Luca Viganò. Labelled Non-Classical Logics. Springer Science & Business Media, 2000.

Appendix A Soundness

Theorem 20 (Soundness).

Let S be a sequent. If S is provable in 𝖫𝖡𝖨𝖰(𝒟) (𝖫𝖡𝖨𝖰(𝒞𝒟)), then S is (CD-)valid.

Proof.

We argue the claim by induction on the height of the given derivation and consider the 𝖫𝖡𝖨𝖰(𝒟) case as the 𝖫𝖡𝖨𝖰(𝒞𝒟) case is similar.

Base case.

It is straightforward to show that any instance of (L) or (R) is valid; hence, we focus on (ax) and show that any instance thereof is valid. Let us consider the following instance of (ax), where wu due to the side condition imposed on (ax), that is, there exist v1,,vnLab() such that wRv1,,vnRu.

Let us suppose ,𝒯,Γ,w:p(t)Δ,u:p(t) is invalid, i.e., a model M=(W,,U,D,I1,I2), M-interpretation ι, and M-assignment α exist such that the following hold: ι(w)ι(v1),,ι(vn)ι(u), M,ι(w),αp(t), and M,ι(u),α⊮p(t). By the monotonicity condition (M) (see Definition 3), it must be that M,ι(u),αp(t), giving a contradiction. Thus, every instance of (ax) must be valid.

Inductive step.

We prove the inductive step by contraposition, showing that if the conclusion of the last inference in the given proof is invalid, then at least one premise of the final inference must be invalid. We make a case distinction based on the last rule applied in the given derivation.

(ds).

Suppose ,𝒯,Γ,w:p(t)Δ is invalid with t=t1,,tn. Then, there exists a model M, M-interpretation ι, and M-assignment α such that M,ι(w),αp(t). Therefore, (α¯(t1),,α¯(tn))I2(w,p), and since I2(w,p)D(w)n, we have that α¯(ti)D(ι(w)) for 1in. By the (C1) and (C2) conditions, we know that M,ι,αw:VT(t). Therefore, ,𝒯,w:VT(t),Γ,w:p(t)Δ is invalid as well.

(L).

If we assume that ,𝒯,Γ,w:φψΔ is invalid, then there exists a model M, M-interpretation ι, and M-assignment α such that M,ι(w),αφψ, implying that M,ι(w),αφ and M,ι(w),αψ, showing that the premise ,𝒯,Γ,w:φ,w:ψΔ is invalid as well.

(R).

Let us suppose that ,𝒯,ΓΔ,w:φψ is invalid. Then, there exists an model M, M-interpretation ι, and M-assignment α such that M,ι(w),α⊮φψ. Hence, either M,ι(w),α⊮φ or M,ι(w),α⊮ψ. In the first case, the left premise of (R) is invalid, and in the second case, the right premise of (R) is invalid.

(L).

Similar to the (R) case.

(R).

Similar to the (L) case.

(L).

Assume ,𝒯,Γ,w:φψΔ is invalid and wu, i.e. a sequence wRv1,,vnRu of relational atoms exist in . By our assumption, there exists a model M, M-interpretation ι, and M-assignment such that ι(w)ι(v1),,ι(vn)ι(u) and M,ι(w),αφψ. Because M,ι(w),αφψ and is transitive, we know that either M,ι(u),α⊮φ or M,ι(u),αψ. In the first case, the left premise of (L) is invalid, and in the second case, the right premise of (L) is invalid.

(R).

Assume that ,𝒯,ΓΔ,w:φψ is invalid. Then, there exists a model M, an M-interpretation ι, and an M-assignment α such that M,ι(w),α⊮φψ. Hence, there exists a world u such that ι(w)u, M,u,αφ, and M,u,α⊮ψ. Let ι(v)=ι(v) for all labels vu and ι(u)=u otherwise. Then, M, ι, and α falsify the premise of (R), showing it invalid.

([Uncaptioned image]L).

Similar to the (R) case above.

([Uncaptioned image]R).

Similar to the (L) case above.

(L).

Suppose that S=,𝒯,Γ,w:xφΔ is invalid. Then, there exists a model M, an M-interpretation ι, and an M-assignment α such that M,ι(w),αxφ. Therefore, there exists an aD(ι(w)) such that M,ι(w),α[a/y]φ(y/x) with y not occurring in S. Then, as y is fresh, M, ι, and α[a/y] falsify the premise of (L), showing it invalid.

(R).

Suppose that S=,𝒯,ΓΔ,w:xφ is invalid. Then, there exists a model M, and M-interpretation ι, and M-assignment such that M,ι(w),α⊮xφ. By the side condition on (R), we know that 𝖠(t,Xw,,𝒯), meaning there exist labels u1,,unLab(S) such that u1:x1,,un:xn𝒯, VT(t)={x1,,xn}, and u1w,,unw. It follows that ι(u1)ι(w),,ι(un)ι(w) and α¯(x1)D(ι(u1)),,α¯(xn)D(ι(un)). By the increasing domain condition (ID), we have that α¯(x1)D(ι(w)),,α¯(xn)D(ι(w)). Therefore, by the (C1) and (C2) conditions, we know that α¯(t)D(ι(w)), showing that M,ι(w),α⊮φ(t/x), and thus, the premise is invalid.

(L).

Suppose that S=,𝒯,Γ,w:xφΔ is invalid. Then, there exists a model M, and M-interpretation ι, and M-assignment α such that M,ι(w),αxφ. By the side condition on (L), we know that wu and 𝖠(t,Xw,,𝒯). By the latter fact, there exist labels v1,,vnLab(S) such that v1:x1,,vn:xn𝒯, VT(t)={x1,,xn}, and v1w,,vnw. It follows that ι(v1)ι(w),,ι(vn)ι(w) and α¯(x1)D(ι(v1)),,α¯(xn)D(ι(vn)). By the increasing domain condition (ID), we have that α¯(x1)D(ι(w)),,α¯(xn)D(ι(w)). Therefore, by the (C1) and (C2) conditions and our assumption, we know that α¯(t)D(ι(w)), showing that M,ι(w),αφ(t/x). By the fact that wu, we know ι(w)ι(u) and α¯(t)D(ι(u)), showing that M,ι(u),αφ(t/x) by Proposition 6. Thus, the premise is invalid.

(R).

Let us assume that ,𝒯,ΓΔ,w:xφ is invalid. Then, there exists a model M, an M-interpretation ι, and an M-assignment α such that M,ι(w),α⊮xφ. Thus, there exists a world uW such that ι(w)u, aD(u), and M,u,α[a/y]⊮φ(y/x). We define ι(v)=ι(v) if vu and ι(u)=u. Then, M, ι, and α[a/y] falsify the premise ,wu,𝒯,u:y,ΓΔ,u:φ(y/x), showing it invalid.