Functorial Models of Differential Linear Logic
Abstract
Differentiation in logic has several sources of inspiration. The most recent is differentiable programming, models of which demand functoriality and good typing properties. More historical is reverse denotational semantics, taking inspiration from models of Linear Logic to differentiate proofs and -terms. In this paper, we take advantage of the rich structure of categorical models of Linear Logic to give a new functorial presentation of differentiation. We define differentiation as a functor from a coslice of the category of smooth maps to the category of linear maps. Extending linear–non-linear adjunction models of Linear Logic, this produces models of Differential Linear Logic. We use these functorial presentations to shed new light on integration in differential categories.
Keywords and phrases:
Categorical semantics, Differential Programming, Linear LogicFunding:
Marie Kerjean: Work partially funded by the project ANR-24-CE48-1914.Copyright and License:
2012 ACM Subject Classification:
Theory of computation Categorical semantics ; Theory of computation Denotational semantics ; Theory of computation Linear logicAcknowledgements:
We are grateful to Jean-Simon Lemay, Zeinab Galal, and Jad Koleilat for enriching discussions on this paper. We also thank the reviewers for the many constructive comments, helping us making this paper a better version of itselfEditors:
Maribel FernándezSeries and Publisher:
Leibniz International Proceedings in Informatics, Schloss Dagstuhl – Leibniz-Zentrum für Informatik
1 Introduction
Differentiation is the bread and butter of mathematical analysis and approximation theory, providing the closest linear approximation of a function. The differential of a function at a point is the linear map . For scalar functions , this becomes , where denotes multiplication. Models of Linear Logic ( henceforth) can either be axiomatized as strong monoidal adjunctions between a category whose morphisms are “linear maps” and a category whose morphisms are “non-linear maps”, or more accurately “general maps” (smooth maps in our case), or as Seely Categories, a setting in which one considers directly the comonad that can be deduced from the previous adjunction.
Linear Logic has been extended to Differential Linear Logic ( henceforth) [12], in which non-linear proofs can be turned into linear ones. Categorical presentations of differentiation originate from investigation of denotational models of this system, resulting in a variety of interrelated categorical definitions [1]. Differentiation in such settings might be axiomatized as an external operator acting on morphisms of a category [3], or as a hard-coded natural transformation on a Seely category [14].
The differential of composed functions is famously not equal to the composite of their differentials: the latter may not even typecheck, as the points at which differentials are computed matter. This is expressed by the chain rule111We write composite morphisms within a category in diagrammatic order throughout, so means followed by , but we compose functors contravariantly to avoid confusion when writing, say, .:
| (1) |
Functorial differentiation has been brought up to date by differentiable programming: this is a much more recent domain emerging from an effort to express machine learning algorithms in a functional and principled way [5, 13]. In particular, differentiation in differentiable programming is traditionally axiomatized as a transformation of programs acting on pairs (Equation 2). This makes a functor on a suitable category of programs, with axioms enforcing linearity of differentiation.
| (2) |
Our work takes a different approach, and exploits the linear–non-linear adjunction at the heart of Linear Logic.
Here represents the forgetful functor from linear to general maps; see Definition 8 for the full hypotheses. We consider differentiation as a functor from the coslice of the category of differentiable maps under the terminal object to the category of linear maps. The coslice category can be thought of as the category of pointed objects, meaning each object comes equipped with a point and morphisms respect these. We re-express models of in terms of models of equipped with a functor
with the intuition that maps a function to , its differential at the point . The only constraint we impose is that leaves linear maps unchanged, (11). This simplifies both the pairwise presentation of differentiable programming (equation (2)) and the non-functorial transformation of differential categories. We will show that this definition is equivalent to that coming from differential categories up to a pointed version of the chain rule; the converse is straightforward (Proposition 27).
This approach also yields insights into how related constructions from analysis, most notably integration, can be realised in models of LL. We demonstrate this by using the slice category to present the fundamental theorem of analysis as an adjunction.
Related work.
Work on differential categories [2], cartesian differential categories [3] and tangent categories [8] have been successful in categorifying differentiation from a mathematical perspective, but place models of as the strictest in the hierarchy of categorical definitions. Our use of coslice categories in the context of is novel, to our knowledge, although the dual construction of slice categories appears in various places: in reverse cartesian differential categories, partial differentials are defined on slice objects [9], and slice categories are extensively used in modeling dependent type theory. We hope our work encourages research into the relationship between differentiation and dependent types.
Outline and contributions.
We provide background material on differential categories in Section 2. The core results of our study are presented in Section 3, where we define as a functor on the coslice , and demonstrate that with a few assumptions, it induces the structure of a differential category on . We formulate categorical models of with a functorial definition of differentiation (Definition 15), which we prove to be equivalent to differential storage categories under some hypotheses (Proposition 26 and Proposition 27).
In Section 4, we first review the existing presentations of integration and fundamental theorems of analysis in differential categories, then show that the coslice construction allows for a new presentation as an adjunction between and (Proposition 34). Finally, in Section 5, we discuss potential future work.
2 Preliminaries
In this section, we give a quick introduction to the syntax and semantics of . For readers familiar with , we note that Definition 4 is the only one technically necessary to the rest of the paper; the rest of this section serves as motivation and historical introduction.
Linear Logic [15] () is the result of a decomposition of Intuitionistic Logic via an involutive linear negation. Along with multiplicative and additive versions of conjunction and disjunction, it introduces the exponential connective and its dual . Some important intuition for working in coming from the Curry-Howard correspondence is that a proof of a sequent of the form corresponds to the construction of a linear function from to , while a proof of constructs a non-linear function from to . When modelling in vector spaces, these intuitions are literally valid, in that a proof of is interpreted as a linear map and a proof of is interpreted as (the transpose across the linear–non-linear adjunction of) a not-necessarily-linear map.
[12] adds to new rules on the connective , expressing the fact that one is able to transform a non-linear proof into a linear one. While features a dereliction rule (), introduces a codereliction rule () which acts like differentiation at into the logic:
This creates a duality between linear and non-linear maps in . We recall also the exponential rules of called respectively weakening (), contraction () and promotion (). adds to these dual rules of coweakening () and cocontraction ().
Since the focus of this paper is on categorical models rather than the proof theory of these logics, we will not give a full presentation of the syntax of ; we omit the cut elimination rules and the corresponding commutative diagrams between their interpretation as these are direct translations of differential calculus rules. We refer the reader to the survey of Ehrhard [11] for a complete introduction. Let us now turn to the categorical models.
Definition 1.
A Seely category [19] is a monoidal closed category with finite products and a comonad such that is a (strong) monoidal functor
where denotes a terminal object. Monoidality of means we have “Seely isomorphisms”:
Following the convention, we will denote linear maps by and non-linear or smooth maps in the coKleisli category by (note the difference in arrow head). To model classical , one requires that be a -autonomous category, with an involutive duality .
The image of the diagonal map and terminal map for the product in under give each object of the form a canonical coalgebra structure:
That is, Seely categories are categorical models of intuitionistic Linear Logic [17], with the above maps interpreting contraction and weakening, while dereliction is interpreted by the counit of the comonad (as suggested by the notation) and promotion is interpreted by coKleisli extension. Moreover, we recover an instance of the linear–non-linear adjunction discussed in the Introduction by constructing the coKleisli category for the comonad .
The usual definition of categorical models of refines the axiomatization of Seely categories. For instance, to interpret and in this case one only needs to replace products with biproducts. This is a natural choice: to reflect the addition of rules to a logic, one adds more constructors to the categories providing the logic’s semantics.
Definition 2.
A semi-additive category is a category with finite products and coproducts such that the initial object is terminal and the binary product coincides with the binary coproduct. In the present paper, we denote the resulting biproduct monoidal structure by , with unit the zero object, denoted . We also give the following names to the canonical morphisms associated to this structure:
Note that for any object, is a commutative monoid and is a commutative comonoid. The hom-sets of a semi-additive category canonically carry the structure of commutative monoids: given , we may define to be the composite,
| (3) |
and this operation has the commutativity and associativity properties one would expect.
Example 3.
The category of real vector spaces and linear maps is semi-additive, with the direct sum and unit the zero vector space. The commutative monoid structure on morphisms is just the pointwise sum.
There is a whole hierarchy of axiomatizations of “differential categories” [1], but we will make use of the strongest notion.
Definition 4.
A differential category, called a “differential storage category” in [2] is a semi-additive Seely category (with biproducts denoted as in Definition 2) in which the comonad is equipped with a natural transformation satisfying:
-
invariance of linear maps under differentiation,
(4) -
and the generalized chain rule, requiring the following diagram to commute:
(5)
where and are constructed from the algebra structure on each object as follows:
These endow with coalgebra structure dualizing the algebra structure constructed earlier.222The definition given in [2] includes more equations; this presentation is equivalent by [1, Corollary 6.3].
The generalized chain rule (5) captures the usual formula for composed functions,
For the reader unfamiliar with differential categories, we now get a bit more into the categorical transposition of chain rule. We start with the simpler case of differentiation at , the chain rule without a context, called the “alternative chain rule” in [1]:
| (6) |
where is the inverse of the right unit map. The relationship becomes more apparent if we post-compose this diagram with , for and . The comultiplication expresses composition of transposed non-linear maps: if and are respectively the transposes of non-linear maps , then transpose to . As such, represents the differential . Meanwhile, by naturality of and , the alternative composite can be rearranged to ; the first two terms yield , while the rest translates to , which by a change of variables is equal to . The generalized chain rule (5) adds to this set-up a parameter along which functions are not differentiated, so captures the chain rule for partial differentiation.
Proposition 5 ([1, Lemma 6.5]).
The generalized chain rule 5 can be recovered from the alternative chain rule 6 if one asks for the following strength condition on , where represents the monoidality of within and can be reconstructed from the Seely isomorphisms .
| (7) |
In terms of the Seely isomorphisms, this becomes a more complex condition:
The transformations , and complement the components of interpreted in a general Seely category to produce an interpretation of . In fact, the differential categories axiomatized above are denotational models of intuitionistic , and of classical when we impose the extra condition of -autonomy on . The exploration of less strict axiomatizations has been fruitful in incorporating various mathematical examples from functional analysis and differential geometry into this logical framework. However, the definition above crucially does not reflect the symmetry of exponential rules in , and does not express differentiation using the traditional objects of computer science. Notably, the chain rule is at first sight quite intricate. In this paper, we attempt to simplify this by showing that it is a consequence of functoriality in a new presentation.
Rather than merely introducing new operations such as , our approach requires us to introduce new categories on top of the setup, expanding Definition 1 to an adjunction involving the coKleisli category of mentioned above. Nonetheless, the extension of to described in Definition 4 can be recovered from our construction.
Notation.
Given a differentiable function between two vector spaces, we denote by the linear map corresponding to the differential of at a point .
Example 6 (The distribution model).
The category of convenient spaces [4] provides an illustrative example of a differential category. Formulas are interpreted by real topological vector spaces endowed with a bornology, which is a well-behaving collection of bounded sets; we shall not attempt to give detailed definitions of these structures here. The exponential is interpreted as the completion of the vector space generated by the Dirac distributions,
The Dirac distribution , defined by , is formally an object of the dual of the space of smooth functions, otherwise known as the space of distributions of compact support over [18]. As the Dirac distributions form a linearly independent family of distributions, to define linear morphisms on it is enough to specify their value on Diracs and then extend linearly and continuously to the completion. As such, the components of are interpreted as follows:
Convenient spaces do not form a -autonomous category, and constructing -autonomous smooth models of with a non-trivial duality is tricky, often requiring compromise on the interpretation of the other components of , such as [10] which features an ad-hoc construction to allow differentiation. An alternative work-around for this issue is to construct polarized models of [16].
Example 7.
We denote by the category of sets and relations. The interpretation of in produces the well-known, historical “relational model”. The tensor product is given by the cartesian product of sets, and the product (which coincides with the coproduct) is given by the disjoint union of sets . The exponential is interpreted by finite multisets: , the set of all finite multisets of . The dereliction and codereliction respectively put in relation elements with singleton multisets containing them, and reciprocally. The contraction and cocontraction relate pairs of finite multisets to their disjoint union, while the weakening and coweakening relate to the empty multiset. The co-multiplication of the co-monad is , relates a finite multiset to each finite multiset of finite multisets of which it is the disjoint union.
3 Differentiation on the coslice
This section represents the heart of this paper: here we will start from a linear–non-linear adjunction and show how defining a functor from a coslice of the non-linear category to the linear category makes into a differential category, up to a well-pointed assumption.
Definition 8.
For the purposes of the present paper, a linear–non-linear adjunction is an adjunction:
| (8) |
in which:
-
is both monoidal closed and semi-additive;
-
has cartesian monoidal structure;
-
is a strong monoidal functor, so and .
This adds hypothesis to the usual notion of linear-non-linear adjunction, in which the semi-additivity is not required and the adjunction is only lax monoidal [17, Definition 21].
Notation.
We denote the endofunctor on from the adjunction 8 by , the unit by and the counit by . We moreover write for the comultiplication of the induced comonad on . We denote by and by , which is consistent with our notation for Seely categories in the special case where is the coKleisli category of . Note that preserves products, being a right adjoint. For notational convenience, we work as if and are strictly monoidal, in the sense that we suppress the isomorphisms and . With this convention, the projection map satisfies and moreover (considering the case ) preserves the left and right unit maps for on the nose.
Remark 9.
We will not actually use the fact that the monoidal structure on is closed, but we wished to ensure the hypotheses in the various definitions lined up.
Example 10.
The distribution model of Example 3 can also be understood from the adjunction point of view instead of the comonadic point of view. Let be the category whose objects are convenient vector spaces and whose morphisms are smooth functions and the category of convenient vector spaces and linear maps. In adjunction (8), is interpreted via , that is the space generated by Dirac distributions on , and is the forgetful functor. is a cartesian closed category, is monoidal closed and is a strong monoidal adjunction.
Lemma 11 ([17]).
Given a linear–non-linear adjunction as in Definition 8, acquires the structure of a Seely category.
As such, we can interpret the ingredients of Differential Linear Logic, and , as we did in Section 2. In order to interpret differentiation, we need to expose further structure. To begin with, we observe that the semi-additive structure on is partially preserved by .
Proposition 12.
Suppose that we are given a linear–non-linear adjunction as in Definition 8. For objects and , has a distinguished element,
where is the unique morphism. This is the identity element of a commutative monoid structure on .
Proof.
For , we define their sum using the codiagonal map:
| (9) |
To show that is the identity element, consider the transpose of :
by monoidality of . The bialgebra laws over , as described for example in [1, Section 7] give us and as such , the transpose of , so as required.
Next we formalize the category of pointed objects of .
Definition 13.
Suppose that is a category with a zero object , that is a category with terminal object and that we are given a functor sending to .
Recall that the coslice category is defined as follows. The objects of are pairs with and , and morphisms are morphisms such that .
We may similarly construct the coslice category where objects are instead pairs with and , and morphisms consist of with .
Notation.
In order to more explicitly keep track of the points involved in a morphism of , we denote a morphism by .
The coslice categories and come equipped with projection functors to and which respectively forget the points; we denote both by . Moreover, induces a functor which we denote by , mapping to . Putting these together, we get:
| (10) |
Example 14.
Let’s see how differentiation can be interpreted in Example 10 using the coslice construction. In this case is the zero vector space and is the category of pairs , where is a vector in . Differentiation defines a functor as follows: an object is mapped to and an arrow is sent to the linear map . The chain rule exactly translates to functoriality of . Indeed, morphisms and compose if and only if and then we have
Example 14 contains the essential idea of the present section. As we shall shortly see, there will be no need to independently construct a codereliction operator once we equip our adjunction with a differentiation functor, since codereliction can be recovered from as differentiation at .
Definition 15.
A functorial model of differentiation consists a linear–non-linear adjunction between and equipped with a functor such that
| (11) |
which is to say that the following triangle (ignoring the dashed functors) commutes:
Remark 16.
The choice of name in Definition 15 is not intended to seem grandiose: as we shall see, these categories do not quite constitute models of in general. As discussed in the introduction, the idea that the chain rule is compositional on pairs of objects is well-established [13]. Here we explore another presentation of differentiation as a functor compatible with linear–non-linear adjunctions as models of Linear Logic.
More explicitly, (11) says that for any object of and point we have , and that for any arrow in we have . This identity also implies that preserves products of objects in , in the following sense:
Lemma 17.
The functor in Definition 15 preserves limits of objects in the image of . In particular, for and in and , we have:
| (12) |
Proof.
A short proof is that creates limits and preserves them (because does), whence the identity (11) forces to preserve them.
Proposition 18.
Given morphisms in , we have:
| (13) |
Similarly, given:
we have,
| (14) |
Proof.
The first identity is an immediate consequence of functoriality of . For the second, the definition of produces the following diagram:
Applying , the upper square is mapped to by Lemma 17 and the lower square is mapped to by (11), yielding the desired sum, as required. As such, satisfies the chain rule and is linear with respect to the additive structure on morphisms. It follows that interacts well with products of morphisms too:
Corollary 19.
Let and and . Then we have:
| (15) |
Proof.
By definition, . Thus by Lemma 17,
We have: by (11) and similarly for , whence we recover the claimed product of morphisms.
Comparing with differential categories, the only thing missing from to make it a differential category is a codereliction satisfying the required identities.
Definition 20.
For an object of , we define:
Lemma 21.
The transformation is natural.
Proof.
We have a functor sending to and to ; this is a section of . By inspection, lifts to a natural transformation , and is the result of applying to this lifting, so is natural as required.
We denote by the transpose of a morphism in across the adjunction . Since , using Proposition 18, the definition of and (11) we have:
| (16) |
In particular, the linear rule holds for any object of , since the counit morphism transposes to the identity in (16). Following [1, Proposition 6.2], by making use of the semi-additive structure we can deduce that the product rule is satisfied:
| (17) |
The following uses functoriality to capture differentiation of translated functions: intuitively, .
Lemma 22.
The functor internalizes translations, meaning for any point and morphism in , we have:
where is the inverse of the right unit morphism (or equivalently, the coproduct injection morphism).
Proof.
By compositionality, we may without loss of generality consider the case . By Proposition 12, we have , so we can construct the following sequence of arrows in :
Observing (since its domain is ), the image under of the composite is:
where (11) gives left- and right-hand morphisms, while Corollary 19 gives the middle morphism. By the left unit law for and , the composite is the identity, as required.
We can use Lemma 22 to strengthen (16): we now consider the differential of a function at any point and express it with respect to .
Lemma 23.
For any and , we have:
| (18) |
where again is the inverse of the right unit map.
Proof.
Consider first the case , so . By Lemma 22,
By naturality of and monoidality of , the morphism part of the right-hand side is equal to
whence we can apply (16) to conclude,
| (19) |
by naturality of in , observing that . The general case follows by the argument used to derive (16).
We now begin working toward the chain rule, beginning with the context-free case, (6). This turns out to be a consequence of naturality of .
Lemma 24.
For any morphism , we have:
| (20) |
Proof.
Recalling that , we have a naturality square for that makes the arrows involved equal in , illustrated in the following diagram, to which we apply :
Corollary 25.
The codereliction satisfies the chain rule without a context, (6):
| (21) |
Proof.
Let in (20). We can expand the left-hand side with (16); the first term on the right-hand side is the definition of and the second term can be expanded using (19). Observing that , the resulting identity is:
The right-hand side can be manipulated into the expression in (21) by exploiting naturality of and rearranging parallel tensored terms.
However, to fully interpret , one also needs to interpret the chain rule within a context (5). This is the translation of a cut rule with a co-dereliction on the context of a promotion in with other elements not being cut.
Interpreting this rule in the context of Example 14, the proof takes a two-variable function (with transpose ) and promotes it:
Cutting with the codereliction rule means computing, for each vector , the function , which expands to, via the chain rule. Abstracting this reasoning leads us to a “pointed” version of the generalized chain rule.
Proposition 26.
The codereliction satisfies a pointed version of the generalized chain rule (13). Namely, for each , we have:
Proof.
We deduce the equation from Lemma 24 as follows. By Lemma 23, the left-hand side of (20) expands as:
Meanwhile, the right-hand side expands via two applications of (19) to give:
Here we pass from the first to the second line by naturality of and the fact that , and from the second to the third line by naturality of . Since is an isomorphism, the desired identity follows. These developments would be of questionable value if they diverged too far from the established formalism of differential categories, so the following result is important.
Proposition 27.
A differential category is a functorial model of differentiation (Def.15).
Proof.
Let be a differential category, with notation as in Definition 4. Let us denote by the coKleisli category and by the product on inherited from . Then define a functor by:
The effect of on morphisms is the exact categorical translation of the effect of in Example 14. To show that is functorial, consider:
Expanding the definition, we first apply naturality of and , then the argument in the proof of Proposition 26 and naturality of , then naturality of , and finally the chain rule of Definition 4:
as required. The fact that follows directly from , as
Remark 28.
The separation in axiomatic strength between differential categories and our model thus comes down to the separation between the pointed chain rule of Proposition 26 and the generalized version (5). If morphisms of the form are jointly epimorphic (a sort of well-pointedness condition), then the latter follows directly from the former. However, this condition is not satisfied by the category of sets and relations of Example 7 (see [6, section 4]) so this condition seems undesirably strong.
4 Integration from a functorial point of view
We now explain how presenting differentiation as a functor on the coslice allows for a new understanding of integration. We begin with a review of integration in Seely categories, based on work by Lemay, Cockett and Ehrhard [7, 11].
4.1 Integration and antiderivatives
The fundamental theorems of calculus relate differentiation and integration for functions of a single variable. The first (FTC1) states that differentiating the integral of a function returns the original function. For a function and , one has:
The second (FTC2) states that integrating a function’s derivative returns the function itself:
These are valid for real-valued functions only. For higher-order functions, one replaces expressions by . This means that integration is an operation on functions with a linear and a non-linear argument:
In Section 3, we considered differentiation at , as encoded by . More generally, we can present differentiation at any point in terms of a so-called deriving transformation.
Definition 29.
Definition 30 ([11]).
A differential category is said to have antiderivatives if the following morphism is invertible on every object :
In the distribution model would be interpreted as . We refer to [11, section 3.2] for detailed computations using this intuition, which involves integration by parts of the function .
Lemay and Cockett [7] gave a more general axiomatization of integration independent from differentiation. Integration is represented by a natural transformation . The basic intuition is that post-composing a function by results in the integral from zero, where stands for any linear argument and not specifically for a measure. Cockett and Lemay then formulate (FTC2) categorically as follows:
| (22) |
where . Theorem involves an extra commutativity of variables condition (Poincaré’s lemma), see [11, Prop. 13] and [7, Sec. 5.2]. We leave its study in our context for future work.
4.2 Derivatives and Anti-Derivatives in an adjunction
Consider a model of Differential Linear Logic, notated as in Definition 4. For the sake of simplicity, we consider as the co-Kleisli category over for the co-monad . We denote the transpose in of the map . We still write for the embedding of a linear map in , meaning .
Note that all statements and proofs, suitably modified, work in the more general context of a linear–non-linear adjunction , with a functor , provided that the generalized chain rule (5) holds: this is used in Proposition 32.
We would like to study a bijection of the following form:
| (23) |
The core intuition of our development is that maps a function to its derivative . Meanwhile, should map a function to its integral . Then (FTC1) will translate to . We now develop that statement categorically.
Definition 31.
We define the functor and the mapping such that:
Proposition 32.
is natural in and in . If is moreover an isomorphism, then it defines an adjunction between and the functor described as a section of in the proof of Lemma 21.
Proof.
Consider morphisms in and . To show naturality, one has to prove that . Consider then . By definition of , and functoriality of one has:
Note that Equation 23 describe (FTC1) only for functions which are null at some point . Hence, we will need an abelian structure on homsets. Following Proposition 12, has an additive structure for any and .
Lemma 33.
For morphisms in , one has . In particular, when is an abelian group, transposition across the adjunction preserves subtraction.
Assuming enrichment in abelian groups, we may now apply to the translation of which equals at , namely . Since , we have:
| (24) |
Define . Intuitively, maps a function to . Projecting in one gets and as such a morphism of :
Proof.
The proof amounts to a straightforward computation:
5 Conclusion
In this paper, we showed that differentiation as defined in Differential Linear Logic can be described as an extension of the formalism of linear–non-linear adjunction by a functor from a co-slice of the non-linear category to the linear category, , and that this recovers differential categories up to a well-pointedness condition. We then extended to an adjoint functor in order to describe antiderivatives. We believe that our presentation provides a more accessible axiomatization of differentiation that offers simpler intuitions.
The following avenues present natural next steps in this research.
Polarity and Differentiation.
Once this work stabilizes, we would like to investigate what polarized models of should be, and refine diagram (8) in the case where is -autonomous, or more generally decomposed into a polarized chirality. One could also use chiralities to relax the coincidence of products and coproducts we relied on in the present paper (constraining us to semi-additive categories) to the mere existence of these structures.
Integration.
Section 4 studies integration and (FTC2) only as an inverse to differentiation. A further step in this line of work would be to integrate the formulation of (FTC1) into this presentation. To complement our work on differentiation, we would like to understand how the axioms of integration can be lifted to a functorial presentation on their own to reach a more complete picture of the interactions between integration and differentiation. An interesting question would be whether any adjunction of the form results in antiderivatives, in the sense that the natural family of bijections is of the form proposed for and its inverse.
Well-pointedness.
We would like to find an extension of our set-up witnessing the generalized chain rule without recourse to the compromises of Remark 28. An avenue suggested by Zeinab Galal is to extend the domain of definition of to a comma category such as to provide “generalized points” at which to differentiate, possibly abstracting the pairwise functorial differentiation of differentiable programming, . This was the origin of the functor used to describe antiderivatives in Section 4.
Dependent types.
Finally, this setting gives a dependent flavor to differentiation, and it feels natural to investigate a possible link with dependent types.
References
- [1] R. F. Blute, J. R. B. Cockett, J.-S. P. Lemay, and R. A. G. Seely. Differential categories revisited. Applied Categorical Structures, 2020. doi:10.1007/s10485-019-09572-y.
- [2] R. F. Blute, J. R. B. Cockett, and R. A. G. Seely. Differential categories. Mathematical Structures in Computer Science, 2006. doi:10.1017/S0960129506005676.
- [3] R. F. Blute, J. R. B. Cockett, and R. A. G. Seely. Cartesian differential categories. Theory and Applications of Categories, 2009.
- [4] Richard Blute, Thomas Ehrhard, and Christine Tasson. A convenient differential category. Cahiers de topologie et géométrie différentielle catégoriques, 53(3):211–232, 2012. arXiv:1006.3140.
- [5] A. Brunel, D. Mazza, and M. Pagani. Backpropagation in the Simply Typed Lambda-calculus with Linear Negation. POPL, 2020. doi:10.1145/3371132.
- [6] Antonio Bucciarelli, Thomas Ehrhard, and Giulio Manzonetto. Not enough points is enough. In Computer Science Logic, 2007. doi:10.1007/978-3-540-74915-8_24.
- [7] J. R. B. Cockett and J.-S. Lemay. Integral categories and calculus categories. Mathematical Structures in Computer Science, 29:243–308, 2019. doi:10.1017/S0960129518000014.
- [8] J. Robin B. Cockett and Geoff S. H. Cruttwell. Differential Structure, Tangent Structure, and SDG. Appl. Categorical Struct., 22(2):331–417, 2014. doi:10.1007/s10485-013-9312-0.
- [9] J. Robin B. Cockett, Geoff S. H. Cruttwell, Jonathan Gallagher, Jean-Simon Pacaud Lemay, Benjamin MacAdam, Gordon D. Plotkin, and Dorette Pronk. Reverse derivative categories. In Maribel Fernández and Anca Muscholl, editors, 28th EACSL Annual Conference on Computer Science Logic, CSL 2020, January 13-16, 2020, Barcelona, Spain, volume 152 of LIPIcs, pages 18:1–18:16. Schloss Dagstuhl – Leibniz-Zentrum für Informatik, 2020. doi:10.4230/LIPIcs.CSL.2020.18.
- [10] Y. Dabrowski and M. Kerjean. Models of Linear Logic based on the Schwartz product. Theory and Applications of Categories, 2020. doi:10.48550/arXiv.1712.07344.
- [11] T. Ehrhard. An introduction to differential linear logic: proof-nets, models and antiderivatives. Math. Struct. Comput. Sci., 2017. doi:10.1017/S0960129516000372.
- [12] T. Ehrhard and L. Regnier. Differential interaction nets. Theoretical Computer Science, 364(2), 2006. doi:10.1016/j.tcs.2006.08.003.
- [13] Conal Elliott. The simple essence of automatic differentiation. In Proceedings of the ACM on Programming Languages (ICFP), 2018. doi:10.1145/3236765.
- [14] M. P. Fiore. Differential structure in models of multiplicative biadditive intuitionistic linear logic. In TLCA, 2007. doi:10.1007/978-3-540-73228-0_13.
- [15] J.-Y. Girard. Linear logic. Theoret. Comput. Sci., 50(1), 1987. doi:10.1016/0304-3975(87)90045-4.
- [16] Marie Kerjean. A logical account for linear partial differential equations. In Anuj Dawar and Erich Grädel, editors, Proceedings of the 33rd Annual ACM/IEEE Symposium on Logic in Computer Science, LICS 2018, Oxford, UK, July 09-12, 2018, pages 589–598. ACM, 2018. doi:10.1145/3209108.3209192.
- [17] Paul-André Mellies. Categorical semantics of linear logic. Société Mathématique de France, 2008.
- [18] L. Schwartz. Théorie des distributions. Hermann, Paris, 1966.
- [19] R. A. G. Seely. Linear Logic, *-Autonomous Categories and Cofree Coalgebras. In In Categories in Computer Science and Logic. American Mathematical Society, 1989.
