Apex Representatives
Abstract
Given a zigzag filtration, we want to find its barcode representatives, i.e., a compatible choice of bases for the homology groups that diagonalize the linear maps in the zigzag. To achieve this, we convert the input zigzag to a levelset zigzag of a real-valued function. This function generates a Mayer–Vietoris pyramid of spaces, which generates an infinite strip of homology groups. We call the origins of indecomposable (diamond) summands of this strip their apexes and give an algorithm to find representative cycles in these apexes from ordinary persistence computation. The resulting representatives map back to the levelset zigzag and thus yield barcode representatives for the input zigzag. Our algorithm for lifting a -dimensional cycle from ordinary persistence to an apex representative takes time. From this we can recover zigzag representatives in time , where is the size of the output.
Keywords and phrases:
zigzag persistent homology, Mayer–Vietoris pyramid, cycle representativesFunding:
Tamal K. Dey: partially supported by NSF grants CCF-2437030 and DMS-2301360.Copyright and License:
![[Uncaptioned image]](x1.png)
2012 ACM Subject Classification:
Theory of computation Computational geometry ; Mathematics of computing Algebraic topologyEditors:
Oswin Aichholzer and Haitao WangSeries and Publisher:

1 Introduction
In topological data analysis, one often wants to not only describe the distribution of topological features in data, but also to identify individual features directly in the input. For persistent homology, a natural starting point are the cycle representatives of homology classes. Specifically, one wants to find barcode representatives, a consistent choice of bases for the input sequence of homology groups. For ordinary persistence, this comes directly from the computation: a representative at the start of the bar includes into every consecutive space and therefore remains a valid choice of the basis element.
For zigzag persistence, the situation is more complicated. Because simplices can enter and leave the complex, a representative cycle at one step in the zigzag filtration may not exist at a later step. So even though zigzag persistence algorithms keep track of representative cycles, the specific choices that they make do not always produce compatible barcode representatives. To cope with this problem, Dey et al. [9] introduced an algorithm that keeps track of additional information that allows it to report consistent barcode representatives. For a zigzag filtration with insertions and deletions, the algorithm requires preprocessing and time to report one or all representatives for any single bar, and time to report all representatives.
On the surface, this is as well as one can hope to do: there are bars, over spaces, with each representative consisting of simplices. But we can do better. Every zigzag filtration is isomorphic to a levelset zigzag filtration of some function. The function generates the Mayer–Vietoris pyramid [5], which decomposes into diamond summands [2]. Each diamond has a single space as its origin, its apex: this space maps into every other space in the diamond. By finding the diamond’s representative in the apex – an apex representative – we can quickly recover its representative in every other space, including those that fall in the levelset zigzag, and therefore in the original zigzag; see Figure 1. All the information can be recovered from an ordinary persistence computation. It takes matrix multiplication time, , to compute persistence [16], and an extra time to recover a -dimensional apex representative of size . After the preprocessing, zigzag representatives can be recovered in time, where is the size of the output.
2 Background
We assume familiarity with simplicial and cell complexes, homology, persistent homology, including zigzag persistence. We only recap select topics to establish notation. Any gaps can be filled with the standard literature [14, 12, 13].
Throughout the paper, we take care to work with arbitrary field coefficients, but abuse notation as follows. Cell means the coefficient of in is non-zero, . If is a chain in some cell complex , then means that is supported on the subcomplex , i.e., . Similarly, refers to the restriction of chain to , i.e., .
Persistence.
Given a filtered cell complex , let be a matrix that represents its boundary operator, with rows and columns ordered according to the filtration. To find ordinary persistence pairing, one can compute a decomposition, , where matrix is reduced, meaning its pivots – lowest non-zero elements in its columns – appear in unique rows, and matrix is full-rank upper-triangular. We index the columns and rows of the matrices by the (totally ordered) cells of the input complex. The persistence pairing is given by the pivots of matrix : a -dimensional homology class created by the addition of simplex is destroyed by the addition of simplex iff , where returns the pivot in the column . By definition, the corresponding column stores a -dimensional chain whose boundary is the cycle .
The decomposition is not unique: multiple matrices and satisfy it. However, the original persistence algorithm [10] performs operations in a lazy fashion, subtracting columns from left to right only if their pivots collide. We call this a lazy reduction. It has the following special properties that simplify our algorithms.
Lemma 1 (Lemma 1 in [17]).
Assume decomposition is obtained via the lazy reduction, and and are such that . If , then entry .
Proof.
Induction on ; see [17].
Remark 2.
The authors of [17] simplify the statement of the preceding lemma by abusing the notation. They assume that if , then , a special imaginary cell that precedes every cell in the complex. In particular, it follows that in a lazy reduction, entry if .
The contrapositive of the lemma is the following corollary.
Corollary 3.
Assume decomposition is obtained via the lazy reduction, and and are such that . If , then .
Zigzag persistence.
We start with a zigzag of simplicial complexes,
(1) |
where every consecutive pair of complexes vary by at most one simplex, i.e., can be equal to in the sequence. It is convenient to assume that every simplex appears and disappears exactly once in this sequence, and therefore is present during some interval , i.e., iff . This assumption can be made without loss of generality by assuming that the union of all simplices across all times, forms a -complex. The full definition of this object is too verbose – we refer the reader to [7], which adapts the classical concept of -complexes in [14, p. 103] to our case of zigzag filtration – but informally it relaxes the requirement that in a simplicial complex two simplices intersect in a common face. All the familiar notions of simplicial homology work the same, but now instead of having a simplex supported over multiple intervals in the zigzag, we can have multiple simplices, with the same boundary, each supported on a single interval. We use to denote the size of the input complex.
Remark.
To simplify the notation we let and .
Remark 4.
We assume that , i.e., is not a single point. We can assume this without loss of generality because we can always pad the zigzag with extra copies of a space.
Applying homology, we get a zigzag of homology groups, where we suppress dimension for simplicity,
(2) |
Just like with ordinary persistence, this zigzag decomposes into bars or persistence intervals [3]. Let be a choice of elements in such that the non-zero elements form a basis for . We say that such bases are compatible across the zigzag if the maps in Equation 2 diagonalize, i.e., . For every , are non-zero over a single persistence interval . Cycles that give a set of compatible bases are called zigzag representatives, with being the representatives of the -th bar in the zigzag barcode.
Real-valued function.
A convenient setting for persistence is that of a Morse-like [4] real-valued function . We denote with the preimage of an interval and allow the endpoints to be infinite, , in which case the interval is understood to be open at the infinite ends. We denote the following pairs of spaces:
Let be the critical values of the function . Let be regular values interleaved with the critical values: . The following constructions play an important role in the theory of persistent homology:
-
Extended persistence (EP):
-
Levelset zigzag (LZZ):
Carlsson et al. [5] showed that these two sequences contain the same information by arranging the four types of spaces into a Mayer–Vietoris Pyramid, see Figure 2(left). Once homology is applied, the pyramid unrolls into an infinite Mayer–Vietoris strip of homology groups, where every diamond belongs to the Mayer–Vietoris long exact sequence, making it exact in the terminology of [3], see Figure 2(right). This allows one to translate the decomposition between any two paths that differ by a single diamond (and therefore between any pair of paths via composition).
Bendich et al. [2] further showed that the translation rules in [5] hold at the level of the basis elements of the individual paths, which means the entire pyramid decomposes as a direct sum of (indecomposable) pointwise -dimensional diamond summands, flush with its boundaries, shown in blue in Figure 2(right). The decomposition of the levelset zigzag and extended persistence are simply slices through these flush diamond summands. Bauer et al. [1] study the Mayer–Vietoris Pyramid in the cohomological setting.
We call the bottom-most space in the support of a diamond its apex. The homology class assigned to the diamond in that space, an apex class, and any cycle that belongs to such a class, an apex representative. We note that just like the choice of the basis for ordinary persistence is not unique, neither is the choice of the apex classes. An apex class is characterized by lying outside the image of the maps into the apex; specifically, for the four types of apexes their classes satisfy:
s.t. | and | |||
s.t. | and | |||
s.t. | and | |||
s.t. | and |
Because there is a map from the apex to every space in the diamond, we can recover a representative in any space by simply mapping an apex representative forward. All the maps in the pyramid are inclusions, except for the boundary homomorphism connecting two consecutive dimensions of homology in the strip. We say more about this translation in Section 7, but the overarching point is that given an apex representative, it is straightforward to recover every other representative, and levelset zigzag representatives in particular.
3 Setup
Prism.
Given a zigzag in Equation 1, we define a larger cell complex , called a prism:
(3) |
together with a function on its underlying space, defined by the projection onto the second component, . See Figure 3. We observe that the prism consists of two types of cells: horizontal cells and vertical cells .
We define an integer interlevel set, for , We note that its underlying space . Then for any pair of integers , we get four types of pairs of subspaces,
, | , | , | . |
Applying homology to each pair, we get relative homology groups that appear in the four quadrants of the Mayer–Vietoris pyramid: .
Because even-indexed spaces in Equation 1 include into odd-indexed spaces, , we have an isomorophism of homology groups , induced by deformation retractions. It follows that the levelset zigzag [5] of function ,
is isomorphic to our starting zigzag in Equation 2. Explicitly, the isomorphism is given by the following bijection between the four types of intervals:
(open-open) | |||||
(open-closed) | |||||
(closed-open) | |||||
(closed-closed) |
Intervals.
Because we perform computation on the input complex , rather than the prism, we need a notation for different intervals and pairs of intervals. The connection between these and the corresponding intervals in will be made clear in Section 6. We denote the sub- and super-level sets:
and define four pairs of spaces:
Because every step of the input zigzag is a simplicial complex, it follows that for any ,
(4) |
The following relationships between sub- and super-levelsets follow immediately:
(5) | ||||
(6) |
Cone.
To compute the persistence decomposition of the zigzag, we follow the algorithm of Dey and Hou [7], who use a construction similar to extended persistence [6] on the cone over the input complex , where indicates a join with the cone vertex . Every simplex in the base space gets value from the input zigzag. Every simplex in the cone gets value from the input zigzag. We then define a filtration, where the base space simplices come first, ordered by increasing value, and the cone simplices come second, ordered by decreasing value. Specifically,
which in reduced homology is isomorphic to
(7) |
The latter is an ordinary filtration and we compute its persistence via the decomposition of the boundary map of the cone . Crucially, the resulting bars are in one-to-one correspondence with the bars in the decomposition of the input zigzag in Equation 2.
4 Lifting
Because prism has so many cells, it is expensive to process directly. We want to perform computation on the cone instead. To recover the apex representatives for the prism, we need to lift cone cycles in to prism cycles in . In Section 6, we explain what exact cycles to lift to get different apex representatives. Meanwhile, we first describe an algorithm that given a (relative) cycle in produces a cycle in , where is one of the four types of intervals, , , , . Algorithm 1 in this section is a reinterpretation of an inefficient, but easier-to-understand Lift-Cycle-easy algorithm in the full version of this paper [8]. We encourage the reader to go through through that algorithm first.
Remark.
A -chain in consists of two types of -cells: , where is a -simplex, and , where is a -simplex.
Remark.
We use an abridged notation, , to represent the chain . This choice is guided by computational efficiency: the former requires constant space vs. the space required for the latter.
Algorithm 1 takes a -cycle , a direction expressed as a pair of values , and an initial -cycle . We call the pair a direction because the order of the values specifies whether we process the cycle in increasing () or decreasing () order. The algorithm stretches the cycle from to , covering ; see Figure 1.
Correctness.
The correctness of Algorithm 1 follows from the following claim about the boundary structure, which will be important in Section 6. We let be the restriction of cycle to the simplices whose times lie in the interval , i.e., .
Claim 5.
Given input cycle , the boundary of the lifted cycle satisfies where
Proof.
Simplex generates a cell iff (Line 5). This cell contributes to the boundary of . Simplex generates a cell iff . By definition this is the case only for simplices in . It follows that if , then is a cycle in . What is not immediate is why the chains and in Lines 9 and 13 are in the prism .
Claim 6.
Chains and added in Lines 9 and 13 of Algorithm 1 are present in .
Proof.
Without loss of generality, assume . There are three types of chains:
-
1.
. has a non-zero coefficient on this interval iff (Line 5), in which case . Existence of coface implies .
-
2.
. From Equation 4, .
-
3.
. Suppose that this chain is not in . This means . Equation 4 implies that has no cofaces in . Therefore, and . Therefore, . Since we assumed is a relative cycle in , it cannot have any boundary inside the interval itself – a contradiction.
Remark.
It may seem strange that we state nothing about the relationship between and . This is because we exploit additional properties of the algorithm in Section 6.
Running time.
Let be the size of the input -cycle . Algorithm 1 requires time. It goes through all simplices that are faces of some simplex ; there are at most such -simplices. For each one, the algorithm sorts its cofaces by their times . Although there is no bound on the number of cofaces of one simplex, the total number of the cofaces to sort is again at most . Individual operations in the update take constant time, so the overall running time of Algorithm 1 is . The size of the lifted cycle is .
5 Apex Representatives
We give a characterization of apex representatives in the prism.
Lemma 7 (Closed endpoint).
-
1.
If a relative cycle , with , contains with , then
-
2.
If a relative cycle , with , contains with , then
-
3.
If an absolute cycle , with , contains and with and , then
Proof.
All the statements follow the same proof, except for the endpoints of the intervals, so we only show the first. Suppose there is some class that maps to . Then for some chain . Since and (since ), must be in . But from Equation 4, has no cofaces in – a contradiction.
Lemma 8 (Open endpoint).
-
1.
If the boundary of a relative cycle , with , contains with , then
-
2.
If the boundary of a relative cycle , with contains with , then
-
3.
If the boundary of a relative cycle , with , contains with and with , then
Proof.
All the statements follow the same proof, except for the endpoints of the intervals, so we only show the first. From the long exact sequence of the triple [14, p. 118], , we know that the image in the claim is equal to the kernel of the map induced by the boundary,
For to be in the kernel, needs to be a relative boundary in , i.e., for some and . Since and , must be in . But from Equation 4, has no cofaces in – a contradiction.
Theorem 9.
-
1.
If a relative cycle , with , contains with , and its boundary contains with , then is an apex representative.
-
2.
If a relative cycle , with , contains with , and its boundary contains with , then is an apex representative.
-
3.
If an absolute cycle , with , contains and with and , then is an apex representative.
-
4.
If the boundary of a relative cycle , with , contains with and with , then is an apex representative.
6 Four Intervals
We assume is the decomposition of the boundary matrix of the cone filtration in Equation 7. We use to refer to simplices in the base space , and for the simplices in the cone .
Summary.
The main content of this section is summarized in Table 1: lifting the stated cycles in , derived from the lazy reduction, with the given arguments to the algorithm Lift-Cycle, produces apex representatives in . The results of the section are more general, but also more verbose; they derive the expression for the cycles for any reduction.
EP type | LZZ type | Apex | |||
---|---|---|---|---|---|
Ordinary | closed-open | ||||
Relative | open-closed | ||||
Extended | open-open | ||||
Extended | closed-closed |
6.1 Ordinary (closed-open)
Throughout this subsection, we assume the following setting:
a pair in the cone filtration, born at and dying at , i.e., . |
Claim 10.
Let . Then . Furthermore, and .
Proof.
Since is upper-triangular with all diagonal entries non-zero, is the latest simplex in , i.e., . Therefore, . By definition, the boundary . is its latest simplex, i.e., . Therefore, . It follows that if , then .
Claim 11.
Relative cycle produced by the call is an apex representative in .
Proof.
6.2 Relative (open-closed)
Throughout this subsection, we assume the following setting:
a pair in the cone filtration, born at and dying at , i.e., . |
Claim 12.
Let chain consist of the boundary of the cone simplices in restricted to the base space,
Then . Furthermore, and .
Proof.
Let , i.e., .
-
1.
, .
Because consists of only cone simplices, every simplex is in the boundary of some cone simplex . Because is the latest simplex in , we have that is added before . Therefore, . Because is the only simplex in with in its boundary, .
-
2.
.
Let
be the shared boundary between the base space and the cone parts of . We note that because cone simplices can only be in the boundary of cone simplices, . Therefore, all the cone simplices are such that . Therefore, any simplex in the shared boundary is in .
Putting the first two sub-claims together, .
-
3.
.
is a persistence pair, therefore, . Because cone simplices can only be in the boundaries of cone simplices, . Because is the only simplex in that has in its boundary, it follows that
Remark 13 (Lazy reduction).
If is obtained by lazy reduction, it follows from Corollary 3 that consists only of cone simplices. In other words, the chain in the prior claim simplifies to
Claim 14.
Relative cycle produced by the call is an apex representative in .
Proof.
6.3 Extended (open-open)
Throughout this subsection, we assume the following setting:
a pair in the cone filtration born at , dying at , i.e., . |
Claim 15.
Let chain consist of those simplices in that are either in the cone, or in , i.e.,
Let ; then . Furthermore, .
Proof.
We split the chain into three parts:
and re-write its boundary accordingly: .
-
1.
, .
because is the latest simplex in . Putting together Equations 6 and 5, implies . Therefore, .
Because , but , .
-
2.
, .
Since chain , its boundary . Since boundaries , so is . Every simplex in has the form , where . Therefore, . It follows that .
Because , it has no cofaces in . Therefore, . Because is the only simplex in that has in its boundary, . Therefore, .
Therefore, .
Remark 16 (Lazy reduction).
Furthermore, if comes from the lazy reduction, then the next claim proves that in the previous claim, chain . In other words, we can use directly.
Claim 17 (Lazy reduction).
If is obtained from the lazy reduction, then all the base space simplices in are in , i.e., .
Proof.
Let be a simplex in . In this case, for the lazy reduction, because , from Corollary 3, taking and , we get
Claim 18.
Absolute cycle produced by the call is an apex representative in .
Proof.
6.4 Extended (closed-closed)
Throughout this subsection, we assume the following setting:
a pair in the cone filtration, born at , dying at , i.e., . |
Claim 19.
Let . Then . Furthermore,
and .
Proof.
We split the chain into three parts:
and re-write its boundary accordingly:
-
1.
.
Putting together Equations 6 and 5, implies . Because is the latest simplex in and , . It follows that . Since , we have . Therefore, .
-
2.
We already saw that . To show that it is also in , we note that since , its boundary . . Therefore, .
Because , Equation 4 implies that cannot be in the boundary of any simplex since . Since and , it follows that .
-
3.
Because is the only simplex in that has in its boundary, .
Claim 20 (Lazy reduction).
If is obtained from the lazy reduction, then is in , , and .
Proof.
We only need to show that, in the case of the lazy reduction, , i.e., . Then the claim follows from the previous Claim 19.
The proof is analogous to the proof of Claim 17. Let be a simplex in . In this case, for the lazy reduction, because , from Corollary 3, taking and , we get
Remark 21.
can be empty. For example, if is a single vertex that appears at and then disappears at . This is the reason why we need to bootstrap Algorithm 1 with an initial cycle .
Claim 22.
Relative cycle produced by the call is an apex representative in .
Proof.
7 Zigzag Representatives
To solve our original problem – to find zigzag representatives – all that remains is to map an apex representative into an appropriate space in the zigzag. Because we assumed in Remark 4 that no simplex is supported on a single space, the death and the birth for any given interval are distinct. Suppose we want to map an apex representative , with , into a zigzag representative in .
If is odd, then the -cycle may contain a vertical cell , which does not map into -cells in the levelset. To sidestep this complication, we perturb . Let be a real value near that lies inside our birth-death interval. We assume ; the other case is symmetric. We implicitly subdivide at and extend our previous notation to allow for the interlevel sets ending at , e.g., . We note that deformation retracts onto , with the homotopy following the second (time) coordinate, and includes into once the cell times are shifted from to . We want to compose three maps:
Because is an apex of a diamond, and is a space in the diamond, the first map is an inclusion (of pairs), so remains a relative cycle in . Let
then the second (boundary) map takes to
which maps into by dropping the second coordinate,
By storing the apex cycle in an interval tree [11, 15], we can retrieve any zigzag representative of size in time .
References
- [1] Ulrich Bauer, Magnus Bakke Botnan, and Benedikt Fluhr. Structure and interleavings of relative interlevel set cohomology. arXiv [math.AT], August 2021. arXiv:2108.09298.
- [2] Paul Bendich, Herbert Edelsbrunner, Dmitriy Morozov, and Amit Patel. Homology and robustness of level and interlevel sets. Homology, Homotopy and Applications, 15(1):51–72, 2013.
- [3] Gunnar Carlsson and Vin de Silva. Zigzag persistence. Foundations of computational mathematics, 10(4):367–405, August 2010. doi:10.1007/s10208-010-9066-0.
- [4] Gunnar Carlsson, Vin de Silva, Sara Kališnik, and Dmitriy Morozov. Parametrized homology via zigzag persistence. Algebraic & Geometric Topology, 19(2):657–700, March 2019. doi:10.2140/agt.2019.19.657.
- [5] Gunnar Carlsson, Vin de Silva, and Dmitriy Morozov. Zigzag persistent homology and real-valued functions. In Proceedings of the Twenty-fifth Annual Symposium on Computational Geometry, SCG ’09, pages 247–256, New York, NY, USA, 2009. ACM. doi:10.1145/1542362.1542408.
- [6] David Cohen-Steiner, Herbert Edelsbrunner, and John Harer. Extending persistence using Poincaré and Lefschetz duality. Foundations of computational mathematics, 9(1):79–103, February 2009. doi:10.1007/s10208-008-9027-z.
- [7] Tamal K. Dey and Tao Hou. Fast Computation of Zigzag Persistence. In Shiri Chechik, Gonzalo Navarro, Eva Rotenberg, and Grzegorz Herman, editors, 30th Annual European Symposium on Algorithms (ESA 2022), volume 244 of Leibniz International Proceedings in Informatics (LIPIcs), pages 43:1–43:15, Dagstuhl, Germany, 2022. Schloss Dagstuhl – Leibniz-Zentrum für Informatik. doi:10.4230/LIPIcs.ESA.2022.43.
- [8] Tamal K Dey, Tao Hou, and Dmitriy Morozov. Apex representatives. arXiv [cs.CG], February 2025. arXiv:2502.17704.
- [9] Tamal K. Dey, Tao Hou, and Dmitriy Morozov. A fast algorithm for computing zigzag representatives. In Yossi Azar and Debmalya Panigrahi, editors, Proceedings of the 2025 Annual ACM-SIAM Symposium on Discrete Algorithms, SODA 2025, New Orleans, LA, USA, January 12-15, 2025, pages 3530–3546. SIAM, 2025. doi:10.1137/1.9781611978322.116.
- [10] Herber Edelsbrunner, David Letscher, and Afra Zomorodian. Topological persistence and simplification. Discrete & computational geometry, 28(4):511–533, November 2002. doi:10.1007/s00454-002-2885-2.
- [11] Herbert Edelsbrunner. Dynamic data structures for orthogonal intersection queries. Technical Report F59, Inst. Information Process., Graz University of Technology, Austria, 1980. URL: https://pub.ista.ac.at/˜edels/Papers/1980-01-OrthogonalIntersectionQueries.pdf.
- [12] Herbert Edelsbrunner and John Harer. Computational Topology: An Introduction. American Mathematical Soc., 2010.
- [13] Herbert Edelsbrunner and Dmitriy Morozov. Persistent homology. In Jacob E Goodman, Joseph O’Rourke, and Csaba D Tóth, editors, Handbook of Discrete and Computational Geometry. CRC Press, 2017. URL: https://www.csun.edu/˜ctoth/Handbook/chap24.pdf.
- [14] Allen Hatcher. Algebraic Topology. Cambridge University Press, 2002.
- [15] Edward M. McCreight. Efficient algorithms for enumerating intersecting intervals and rectangles. Technical Report CSL-80-9, Xeox Palo Alto Research Center, 1980.
- [16] Dmitriy Morozov and Primoz Skraba. Persistent (co)homology in matrix multiplication time. arXiv [math.AT], December 2024. doi:10.48550/arXiv.2412.02591.
- [17] Arnur Nigmetov and Dmitriy Morozov. Topological optimization with big steps. Discrete & computational geometry, 72(1):310–344, July 2024. doi:10.1007/s00454-023-00613-x.