On Solving Asymmetric Diagonally Dominant Linear Systems in Sublinear Time
Abstract
We initiate a study of solving a row/column diagonally dominant (RDD/CDD) linear system in sublinear time, with the goal of estimating for a given vector and a specific solution . This setting naturally generalizes the study of sublinear-time solvers for symmetric diagonally dominant (SDD) systems [Andoni-Krauthgamer-Pogrow, ITCS 2019] to the asymmetric case, which has remained underexplored despite extensive work on nearly-linear-time solvers for RDD/CDD systems.
Our first contributions are characterizations of the problem’s mathematical structure. We express a solution via a Neumann series, prove its convergence, and upper bound the truncation error on this series through a novel quantity of , termed the maximum -norm gap. This quantity generalizes the spectral gap of symmetric matrices and captures how the structure of governs the problem’s computational difficulty.
For systems with bounded maximum -norm gap, we develop a collection of algorithmic results for locally approximating under various scenarios and error measures. We derive these results by adapting the techniques of random-walk sampling, local push, and their bidirectional combination, which have proved powerful for special cases of solving RDD/CDD systems, particularly estimating PageRank and effective resistance on graphs. Our general framework yields deeper insights, extended results, and improved complexity bounds for these problems. Notably, our perspective provides a unified understanding of Forward Push and Backward Push, two fundamental approaches for estimating random-walk probabilities on graphs.
Our framework also inherits the hardness results for sublinear-time SDD solvers and local PageRank computation, establishing lower bounds on the maximum -norm gap or the accuracy parameter. We hope that our work opens the door for further study into sublinear solvers, local graph algorithms, and directed spectral graph theory.
Keywords and phrases:
Spectral Graph Theory, Linear Systems, Sublinear AlgorithmsCopyright and License:
2012 ACM Subject Classification:
Mathematics of computing Spectra of graphs ; Theory of computation Graph algorithms analysis ; Theory of computation Streaming, sublinear and near linear time algorithmsAcknowledgements:
We thank the anonymous reviewers for their valuable comments. Mingji Yang wishes to thank Prof. Shang-Hua Teng for inspiring discussions and encouragement, and Guanyu Cui for helpful discussions on effective resistance.Funding:
This research was supported by National Natural Science Foundation of China (No. 92470128, No. U2241212).Editor:
Shubhangi SarafSeries and Publisher:
Leibniz International Proceedings in Informatics, Schloss Dagstuhl – Leibniz-Zentrum für Informatik
1 Introduction
Solving systems of linear equations is one of the most fundamental problems in numerical linear algebra and theoretical computer science. In the classic version of this problem, we are given a matrix and a vector in , and the goal is to compute a solution vector satisfying . Beyond general-purpose solvers for arbitrary linear systems (e.g., using fast matrix multiplication or the conjugate gradient method), extensive research has focused on developing efficient solvers for special classes of systems.
In particular, for the important classes of Laplacian and symmetric diagonally dominant (SDD) systems, the breakthrough work of Spielman and Teng [39, 40] established the first nearly-linear-time (in ) solvers. This gave rise to the influential Laplacian Paradigm, which revolutionized algorithmic graph theory and numerical linear algebra, with widespread applications ranging from network science to machine learning (see, e.g., [42, 44]). Among subsequent efforts to generalize the SDD solvers, a line of work [13, 12, 11] developed nearly-linear-time solvers for asymmetric row/column diagonally dominant (RDD/CDD) systems, which significantly expanded the scope of the Laplacian Paradigm.
On the other hand, partly motivated by the advances in quantum algorithms for solving linear systems in sublinear time [24], Andoni, Krauthgamer, and Pogrow [4] pioneered the study of classical algorithms for approximately solving a single entry of SDD systems in sublinear time. Under specific access models and error measures, they established:
-
a lower bound for general SDD systems, where is the condition number of .
The second result demonstrates the necessity of a quadratic dependence on the condition number (equivalently, the reciprocal of the spectral gap) for sublinear-time SDD solvers.
In light of the previous research for nearly-linear-time RDD/CDD solvers and sublinear-time SDD solvers, it is natural to ask whether the sublinear-time SDD solvers can be extended to the more general RDD/CDD cases. In this paper, we initiate a study in this direction and give partial positive answers to this question. We show that the sublinear-time solvers for well-conditioned SDD systems can be extended to “well-structured” RDD/CDD systems, provided that we first define an appropriate generalization of the key structural quantity, the spectral gap for symmetric matrices, to asymmetric matrices. We achieve this by re-characterizing the problem’s mathematical structure and introducing a new concept called the maximum -norm gap.
Algorithmically, the sublinear SDD solver in [4] works by solely generating random-walk samplings based on to approximate a truncated Neumann series of the solution. In contrast, we conduct a deeper investigation of the complexity upper bounds by applying two techniques in addition to random-walk sampling: the local push method, which performs local exploration in ; and the bidirectional method, which integrates random-walk sampling with local push. Together, we derive a suite of upper bounds for solving RDD/CDD systems under diverse access models and error measures. For instance, we extend the algorithmic result in [4] to RDD systems and derive new results for RCDD systems with smaller dependence on some parameters.
Our algorithmic toolkit and investigation of the diverse upper bounds are inspired by recent advances in local algorithms for estimating PageRank [9] and effective resistance [17] on graphs [50, 49, 47, 45, 6, 43, 14, 52], which are important special cases of solving RDD/CDD systems. The techniques of random-walk sampling [39, 19], local push [2, 1], and the bidirectional method [32] have been extensively studied for these problems, and recent works have further uncovered their new properties and optimality in certain settings. Nonetheless, previous works typically analyze their applications to PageRank and effective resistance computation separately. Our perspective of formulating these problems as solving linear systems, however, provides a more general and unified framework for understanding these techniques and problems, revealing their deeper connections. As we shall see, this bigger picture yields novel insights, extended results, and improved complexity bounds.
Notably, our perspective reveals a connection between two fundamental local push algorithms on graphs, namely ForwardPush [2] and BackwardPush [1]222Their original names are ApproximatePageRank and ApproxContributions, respectively.. These algorithms iteratively perform local push operations to explore the graph in opposite directions. Although both algorithms share similar approaches, they have been treated as distinct methods for different problems, with each analyzed separately. In contrast, by abstracting both methods as a single algebraic primitive, we demonstrate that ForwardPush and BackwardPush are equivalent to applying this primitive to different linear systems. This characterization helps to explain their distinct properties and enables unified analysis of both approaches.
On the lower-bound side, our framework inherits the hardness result for sublinear-time SDD solvers, establishing the necessity of our assumption on the maximum -norm gap; also, known lower bounds for local PageRank computation imply lower bounds on the accuracy parameter for our setting. As our work bridges the study of sublinear-time solvers and local graph algorithms, we believe that further investigation could uncover more connections and results for these topics.
In the remainder of this section, we formally define the problem, present our main contributions, and provide a technical overview.
1.1 Basic Notations
For , we define . We call a matrix RDD (row diagonally dominant) if it satisfies for all and its diagonal entries are positive. We call a matrix CDD (column diagonally dominant) if its transpose is RDD, and call a matrix SDD (symmetric diagonally dominant) if it is symmetric and RDD. It is well-known that any SDD matrix is PSD (positive semidefinite). We call a square matrix Z-matrix if its off-diagonal entries are nonpositive. We use to denote the -th canonical unit vector and to denote the all-one vector. For any matrix or vector, we use to denote taking the entrywise absolute value.
We call two real numbers Hölder conjugates (or is conjugate to ) if they satisfy . By convention, we also formally let and view and as Hölder conjugates. For any , we use to denote the -norm of a vector , and to denote the matrix norm induced by vector -norm of a matrix .
Restriction and Pseudoinverse.
For a subspace , we use to denote the restriction of the linear map to , with induced norm for any . We write the pseudoinverse (a.k.a. Moore–Penrose inverse) of as .
Spectral Gap.
For an SDD matrix , we define its spectral gap as half the smallest nonzero eigenvalue of , where is the diagonal matrix that satisfies for each . The (effective) condition number of , denoted by , is defined as the ratio between the largest and smallest nonzero eigenvalues of . It holds that .
Graphs.
We consider directed graphs , with and . We assume that . If , we write . We denote the (possibly weighted) adjacency matrix as . For each , we define its indegree and outdegree . The outdegree matrix is the diagonal matrix with for each . We denote ’s minimum and maximum outdegree as and , respectively.
Eulerian Graphs and Laplacian.
We call a graph Eulerian if for all . On Eulerian graphs, we simply write , , and . Undirected graphs constitute a special case of Eulerian graphs where each edge corresponds to two directed edges in opposite directions. The directed Laplacian matrix is defined as , which satisfies and is CDDZ. is RCDDZ for Eulerian graphs and is SDDZ for undirected graphs.
1.2 Problem Formulation
We consider a linear system , where is an RDD/CDD matrix and , and a coefficient vector . We assume that and all nonzero entries in , , and have absolute values in . We assume that the algorithms are given the dimension and have oracle access to , , and via the following basic queries:
-
Diagonal queries for : return in time for a given index ;
-
Row/column queries for : return the indices and corresponding values of nonzero entries for a specified row/column of , in time linear in the number of returned indices;
-
Entrywise queries for and : return or in time for a given index .
Our results will assume additional access operations, which will be specified in the statements.
Following the concept of local computation algorithms [36] and the previous work [4], we consider a fixed solution that is determined by and , and require invoking the algorithm with different and accuracy parameters returns estimates of that are all consistent with the “global” solution . Our choice of will be given in Theorem 1.
We also consider the problems of computing (Personalized) PageRank [9] and effective resistance [17] on graphs, viewing them as special cases of our formulation of solving RDD/CDD systems. For these graph problems, we assume the standard adjacency-list model [22], where each degree query takes time and each neighbor query returns a neighbor index along with the edge weight in time.
For Personalized PageRank (PPR), we consider a directed graph , a decay factor , and a source distribution . To ensure that PPR is well-defined, we assume that . The PPR vector is defined as the unique solution to the following two equivalent forms of the PPR equation:
| (1) | |||
| (2) |
Both equations can be viewed as linear systems of the form , where the coefficient matrices and are both CDDZ and invertible. Note that for the second form, the solution to the corresponding system is , an outdegree-scaled version of the PPR vector. We define the PPR value from to as , and the PageRank vector as . It holds that for all . We also consider the following two equivalent forms of the PageRank contribution equation:
| (3) | |||
| (4) |
where is a specified target node and is called the PageRank contribution vector to . It holds that for all . Similarly, both equations can be viewed as linear systems of the form , where the coefficient matrices and are both RDDZ and invertible. Note that for the second form, the corresponding vector is .
For effective resistance, we consider a connected undirected graph and two distinct nodes . The effective resistance (a.k.a. resistance distance) between and , denoted by , is defined as the equivalent resistance between if the graph is thought of as an electrical network with each edge having resistance . Algebraically, . As we will establish in Lemma 27, setting and in our formulation yields . Here, is SDDZ and .
1.3 Previous Work for SDD Systems
[4] studies the case when the linear system is for some SDD matrix . Define as the diagonal matrix that satisfies for each and . They formulate the fixed solution as and give a Neumann series expansion of . For the algorithmic results, they assume that the algorithm is given , an upper bound on (alternatively, as a lower bound on ) and set a truncation parameter for the Neumann series as .
Based on the truncated Neumann series, they present a randomized algorithm that, given a coordinate , computes an estimate satisfying with probability at least . The algorithm runs in time , where is the maximum time cost to simulate one step in the random walk defined by . This result is implicit in the proof of [3, Theorem 5.1].
On the negative side, [4] proves an query lower bound (in terms of probing ) for achieving a weaker absolute error bound of , for and . The matrix in their hard instance is a Laplacian matrix of a fixed unweighted undirected graph with maximum degree and thus satisfies . Therefore, this lower bound can also be written as . To our knowledge, no other work has explicitly studied sublinear-time SDD/RDD/CDD solvers.
1.4 Formulation of and the -Norm Gaps
Our first contribution is a Neumann-series-based characterization of a solution to , which is consistent with the solution considered by [4] for SDD systems.
We decompose uniquely as , where is a diagonal matrix and all diagonal entries of are . Define and .
The next theorem gives the definition of and its properties.
Theorem 1.
For any RDD/CDD and , define to be
Then is well-defined and satisfies . If is SDD, then .
Next, we study a truncated version of and upper bound the truncation error. As previous analysis based on eigendecomposition is not directly applicable to asymmetric matrices, we introduce a novel concept called the -norm gap of : for any , we define the -norm gap of as
where is conjugate to . We further define the maximum -norm gap of as .333One could alternatively define , which yields a possibly smaller quantity but our main results continue to hold with this definition. To our knowledge, no prior work has explicitly studied these quantities.
The following theorem guarantees that for any RDD/CDD matrix , the maximum -norm gap lies in , and when is SDD, coincides with the spectral gap . Thus, it is a natural generalization of the spectral gap to asymmetric matrices.
Theorem 2.
If is RDD/CDD, then ; if is SDD, then .
We will devise algorithms to estimate the truncated version of , defined as
| (5) |
for an integer truncation parameter . The next theorem upper bounds the truncation error in terms of a given lower bound on the maximum -norm gap .
Theorem 3.
Suppose . To ensure that , it suffices to set
| (6) |
where denotes the largest diagonal entry in .444Here and after, we use and to hide polylogarithmic factors in , , , and (reciprocals of) quantities in , , and .
Relationship with the Formulation in [4].
Our formulations of and the maximum -norm gap generalize the ones in [4], in the sense that when is SDD, matches their solution and equals , the spectral gap of . Therefore, the quadratic lower bound on for SDD systems in that paper translates into a quadratic lower bound on for general RDD/CDD systems. Also, our setting of the truncation parameter in Equation (6) matches theirs when is SDD and is a canonical unit vector.
1.5 Main Algorithmic Results
For our algorithmic results, we assume that the algorithm is given a quantity as a lower bound on and an accuracy parameter . We use , , and the specific terms in the accuracy guarantee to set the truncation parameter according to Equation (6), where we assume that suitable upper bounds on the quantities in the logarithmic factor are known.
The statements of our results will use the following definition. For RDD , we use to denote the maximum time cost to simulate a single step in the random walk defined by the row substochastic matrix . This quantity depends on the structure and representation of . For instance, if each row of has at most nonzero entries, then ; if the nonzero entries in have equal absolute values and we are allowed to sample a uniformly random index of the nonzero entries in each column of in time, then . For CDD , we define .
Each of our algorithmic results excels under different system types, access models, and parameter regimes. We present a selection of our results here, with additional results given in the full version of this paper [28]. Furthermore, a remark at the end of this subsection establishes that most of our results have “symmetric” counterparts (e.g., for CDD systems) that can be obtained by exchanging the roles of certain quantities.
We first present our results of using random-walk sampling for RDD systems.
Theorem 4.
Suppose that is RDD, we can sample from the distribution in time, and is known. Then there exists a randomized algorithm that computes an estimate such that in time .
This theorem subsumes the main algorithmic result of [4] for SDD systems and extends it to the RDD case while achieving a mild -factor improvement. Their original result is recovered as a special case when is SDD and is a canonical unit vector. Our algorithm is similar to theirs, but we achieve the improved complexity by adopting a different random-walk sampling scheme.
We also show that for RDDZ along with nonnegative vectors and , if we allow the relaxed error bound , the complexity can be improved to depend quadratically on via a variance analysis of the random-walk sampling process. We adopt this error measure since it provides a natural accuracy guarantee and has been previously studied in [4]. It is shown in [4] that for RDD systems.
Theorem 5.
Suppose that is RDDZ, , we can sample from the distribution in time, and is known. Then there exists a randomized algorithm that computes a such that in time .
The following theorem further considers the relative error guarantee of . The complexity depends quadratically on and linearly on and is also achieved using random-walk sampling.
Theorem 6.
Suppose that is RDDZ, , we can sample from the distribution in time, and are known, and . Then there exists a randomized algorithm that computes an estimate such that in expected time
Our next result leverages the local push method to derive a deterministic algorithm for special RCDD systems, whose complexity depends linearly on .
Theorem 7.
Suppose that is RCDD, its nonzero entries have absolute values of , and we can scan the nonzero entries of in time. Then there exists a deterministic algorithm that computes a such that in time plus .
Lastly, applying the bidirectional method to special RCDD systems yields complexity bounds with improved dependence on and , achieving either or using different parameter settings.
Theorem 8.
Suppose that is RCDD, its nonzero entries have absolute values of , we can sample from the distribution in time, and are known, and we can scan through the nonzero entries of in time. Then there exists a randomized algorithm that computes an estimate such that in time plus
Remark.
As we shall see, all theorems in this subsection except Theorem 5 still hold if we replace RDD/RDDZ by CDD/CDDZ, swap and (except in ), and replace by in the statements. Theorem 5 is an exception because its proof relies on the property that for RDD , which does not have a straightforward analog for the CDD case.
1.6 Connections to PageRank and Effective Resistance Computation
Our results can be directly applied to the PPR or PageRank contribution equations and the single-pair effective resistance problem to yield complexity bounds for different graph types under various access models and accuracy guarantees. Notably, as we shall see, for PageRank computation, half the decay factor serves as a lower bound on the maximum -norm gap of the corresponding systems; for effective resistance computation, the spectral gap of the graph Laplacian equals the maximum -norm gap of the system. Rather than exhaustively presenting all applications of our results, here we highlight selected results that generalize and improve upon the previously best complexity bounds.
The following theorem is derived by applying Theorem 6 to the PageRank equation (2) combined with tighter lower bounds on that we establish for Eulerian graphs.
Theorem 9.
For any unweighted Eulerian graph , given the decay factor , a target node , and , there exists a randomized algorithm that estimates the PageRank value within relative error with probability at least in time
This improves over the previously best upper bounds of , which was given by [45] and stated for unweighted undirected graphs. Our algorithm is essentially the same as that in [45], which generates random walks from the target node , and the improvement comes from our tighter lower bounds on for Eulerian graphs.
For effective resistance computation, recall that we consider connected undirected graphs . Lemma 27 establishes that the value that our algorithms approximate equals when setting and . Thus, Theorems 4 and 8 directly imply the following complexity bounds for estimating effective resistance.
Corollary 10.
For any connected unweighted undirected graph , given nodes and as a lower bound on , there exists a randomized algorithm that estimates the effective resistance within absolute error with probability at least in time
where .
This subsumes the previous bounds of given by [14] with essentially the same setting of . Moreover, since can be lower bounded by [34, Corollary 3.3], by setting the absolute error parameter to be , the last bound in our result implies that an estimate of within relative error can be computed in time . This improves over the previous bound of given by [52]. Our algorithms are essentially the same as those in [14, 52], i.e., using random-walk sampling and the bidirectional method, and the improvements stem from our simpler sampling scheme that avoids using extra data structures and a refined analysis.
On the other hand, known hardness results for local PageRank and effective resistance computation can potentially yield lower bounds for sublinear-time solvers. We highlight the following result, derived by establishing a reduction from estimating single-node PageRank on undirected graphs to solving SDD systems and applying the lower bound for PageRank computation from [45].
Theorem 11.
For any large enough and , there exist and that satisfy the following. Every randomized algorithm that, given access to an invertible SDD matrix whose spectral gap is , succeeds with probability at least to approximate within absolute error , must probe coordinates of in the worst case. Here, .
1.7 Understanding ForwardPush and BackwardPush on Graphs
Inspired by the local push algorithms ForwardPush and BackwardPush, we formulate our Push algorithm as a unified primitive for estimating a summation of matrix powers applied to a vector, which is closely related to our approach to solving RDD/CDD systems. This abstraction reveals that ForwardPush and BackwardPush, despite appearing as distinct algorithms for two problems with different propagation strategies, are equivalent to applying Push to different linear systems, modulo variable scaling. The apparent differences arise from the scaling and the distinct behaviors that Push exhibits on different types of linear systems.
Specifically, for PageRank computation, ForwardPush from node corresponds to applying Push to Equation (2) with and outdegree scaling, while BackwardPush from node applies Push to the contribution equations (3) or (4). Our analysis demonstrates that Push provides closed-form complexity bounds on certain CDD systems and accuracy guarantees on RDD systems, which explains the known computational properties of ForwardPush and BackwardPush on directed graphs.
For RCDD systems, Push inherits both complexity and accuracy advantages, clarifying why ForwardPush and BackwardPush perform well on undirected graphs. This perspective further reveals that on Eulerian graphs, ForwardPush from any node is equivalent to BackwardPush from that node on the transpose graph, establishing a basic connection previously unrecognized.
1.8 Technical Overview
Our characterizations of the solution and the -norm gaps are based on fundamental properties of Neumann series and restricted linear maps. Since asymmetric matrices may be non-diagonalizable, the eigendecomposition techniques used for symmetric matrices become inapplicable. We address this challenge by analyzing the operator norms of restricted linear maps to establish series convergence and derive truncation error bounds.
Our algorithms estimate by adapting three techniques for random-walk probability estimation on graphs: random-walk sampling [39, 19], local push [2, 1], and the bidirectional method [32].
When is RDD, the matrix is row substochastic, enabling us to interpret as an expectation over random walks that start from the distribution and transition according to . Such random walks may terminate early, and the algorithm needs to record the signs of the entries in along the walk. This method extends the approach in [4], and we adopt a different sampling scheme and conduct variance analysis in some special cases to reduce the dependence on in the complexity.
Based on local push methods, we formulate a Push primitive that estimates the vector through deterministic local computation. Push maintains coordinate variables initialized to and iteratively applies the linear operator to selected coordinates. Our analysis relies on invariant properties preserved by these operations, including an inequality variant that helps to handle negative entries. This approach yields closed-form accuracy guarantees for RDD systems and complexity bounds for special CDD systems. Combining these two aspects yield our result for special RCDD systems.
The bidirectional method combines random-walk sampling and local push from two directions. We adapt the BiPPR framework [32], performing Push from and exploiting the invariant property to construct an unbiased estimator for , which can be sampled via random walks from when is RDD. By balancing the costs of both components, we achieve improved dependence on and , particularly for RCDD systems.
Crucially, we can transpose the expression for to obtain an equivalent summation with replaced by and the roles of and exchanged. This allows us to alternatively apply Push from and random-walk sampling from when is CDD, leading to symmetric algorithmic procedures and complexity results.
1.9 Paper Organization
The remainder of this paper is organized as follows. Section 2 introduces more related work. Section 3 elaborates on our formulation of and -norm gaps and proves Theorems 1 to 3. In Sections 4, 5, and 6, we present our algorithms based on random-walk sampling, local push, and the bidirectional method, respectively, and prove Theorems 4 to 8. After that, Sections 7 and 8 relate our problem and results to local computation of PageRank and effective resistance, respectively. In the full version of this paper [28], we provide more preliminaries, discussions on future directions, detailed explanations of the relationship between our Push primitive and the ForwardPush and BackwardPush algorithms, additional results, and all omitted proofs.
2 Other Related Work
A vast literature exists on nearly-linear-time Laplacian solvers and their extensions, including solvers for undirected Laplacian/SDD systems (e.g., [39, 40, 26]) and directed Laplacian/RDD/CDD systems (e.g., [13, 11, 25]). These solvers achieve nearly linear time complexity in with polylogarithmic dependence on and the condition number. The development of global RDD/CDD solvers relies on reductions from solving RDD/CDD systems to solving Eulerian Laplacian systems, combined with efficient methods for computing PPR vectors with small and the stationary distribution of random walks on graphs [13]. However, the global techniques from algorithmic linear algebra and known reductions to the Eulerian case do not directly apply to the local setting, revealing a fundamental distinction between global and local RDD/CDD solvers. In fact, the lower bounds established in [3] and this work demonstrate that local SDD solvers require polynomial dependence on and , indicating a separation between global and local SDD solvers.
[16] develops probabilistic logspace solvers for certain classes of directed Laplacian systems. Their method also relies on approximating truncated Neumann series, but they bound the truncation error using spectral radius and Jordan normal form, which yield truncation parameters of at least . Such a huge truncation parameter makes their algorithm and analysis inapplicable to the sublinear-time setting. As an aside, in the quantum regime, [41] presents algorithms for inverting well-conditioned matrices in quantum logspace, but their approaches are not directly applicable to our classical sublinear-time framework.
The idea of using random-walk sampling to solve linear systems dates back to the von Neumann-Ulam algorithm for approximating matrix inversion [20, 48]. The bidirectional method for estimating random-walk probabilities on graphs is first proposed in [33], which is inspired by property testing techniques [23, 27] and later simplified by the BiPPR framework [32].
The bidirectional idea has been widely applied to compute PageRank [5, 31, 32, 8, 49, 47, 6, 43], effective resistance [14, 52], heat kernel [8], and Markov Chain transition probability [5]. Among them, [47] proves that the simple BiPPR framework computes single-node PageRank on unweighted directed graphs in optimal time complexity (in terms of and ). Their analysis relies on a new complexity bound of BackwardPush, which is parameterized by the PageRank value of the target node. Recently, [52] shows that the bidirectional technique can yield faster algorithms for constructing effective resistance sketch (as defined in [18]) on expander graphs, and [43] combines random-walk sampling with a novel randomized local push technique to improve the complexity of estimating single-node PageRank on directed graphs with bounded in-degree.
[37] uses the bidirectional method to estimate a single element in the product of a matrix power and a vector, which relates to our estimation of . However, they only derive an average-case complexity bound under some bounded-norm conditions and discuss its applications to solving PSD systems. In contrast, we conduct a more comprehensive study of this problem and apply it to solving RDD/CDD systems.
PageRank and PPR have been extensively studied and widely applied; we refer interested readers to the surveys [29, 21, 50]. More lower bounds for PageRank computation can be found in [8, 50, 6] and references therein. The recent work [6] conducts a comprehensive study of various types of PPR estimation problems using different graph access queries under both worst-case and average-case settings. For constant decay factors, they provide nearly tight complexity upper and lower bounds for achieving constant relative error guarantees when the target value is above a given threshold.
Effective resistance is ubiquitous in spectral graph theory [17, 34, 38, 25]. A line of work [35, 51, 14, 52] focuses on locally estimating single-pair effective resistance through multi-step random-walk probabilities. [10] studies this problem on non-expander graphs and establishes strong complexity lower bounds, though it does not explicitly give a lower bound on the spectral gap of the graph. Recently, [52] provides a lower bound on the relative error parameter for this problem. Besides, a line of work [30, 18, 52] studies the problem of constructing effective resistance sketches. Technically, [30] leverages random-walk sampling, [18] uses count sketches and SDD solvers, and [52] employs a bidirectional approach.
3 Formulation of and the -Norm Gaps
In this section, we prove Theorems 1, 2, and 3 and give further explanations on our formulations of and the -norm gaps.
First, we need the following lemma that upper bounds for certain and Hölder conjugates . Its proof is given in the full version of this paper [28].
Lemma 12.
The following hold:
-
1.
If is RDD, then .
-
2.
If is CDD, then .
-
3.
If is RCDD, then for any Hölder conjugates .
To establish our formulation of in Theorem 1, we use the following lemma, which characterizes the operator norm and Neumann series of certain restricted linear maps. The proof of this lemma is given in the full version of this paper [28].
Lemma 13.
Suppose and is the operator norm induced by some vector norm . If , then, for , we have and
Proof of Theorem 1.
First assume that is RDD. Applying Lemmas 12 and 13, we have and
As , we have , so converges to
Thus, we can check that
Next, observe that for any Hölder conjugates , we have
for each , so
for any Hölder conjugates .
When is CDD, we consider the expression . We have by Lemma 12, so applying the above arguments shows that converges to and
So we have proved that is well-defined and satisfies when is RDD or CDD.
Next, assume that is SDD. Then is RCDD and Lemma 12 implies that . Repeating the above arguments with gives
Since is symmetric, is also symmetric, so . By the property of the pseudoinverse, we have , finishing the proof.
Recall that if is RDD, then , and this quantity measures how strongly the diagonal entries dominate the off-diagonal entries in each row. However, this quantity can equal zero, making it unsuitable as a useful notion of “gap.” In contrast, our Theorem 2 shows that the maximum -norm gap is always strictly positive when is RDD/CDD. As an example, , which refines the quantity by replacing with and restricting the operator to .
4 Random-Walk Sampling
This section presents a Monte Carlo algorithm for estimating via random-walk sampling. All our algorithms in this paper aim to estimate by approximating . By Theorem 3 and our setting of , it suffices to ensure that the estimate satisfies that is at most half the desired accuracy guarantee with probability at least , and we will omit this matter in the following proofs.
We first focus on RDD systems and transfer the results to CDD systems by transposing the expression of at the end of this section. When is RDD, is row substochastic and we can estimate
by generating random-walk sampling from . In each sample, we first sample a walk length from uniformly at random and sample a source coordinate from the distribution . Then we simulate a lazy random walk for steps starting from according to the transition matrix . Specifically, at each step from , with probability the walk stays put at , with probability the walk moves to each , and with the remaining probability the walk terminates, where . Note that these probabilities lie in since is RDD. Additionally, we keep track of the product of the signs of the initial entry in and the entries in along the walk (where stay-put steps have sign ), which is denoted as . After steps, if the walk has not terminated and is at coordinate , we take the value as the estimate of this sample. We repeat this process for independent samples and return the average as the final estimate . We give a pseudocode for this approach in the full version of this paper [28].
We emphasize that our sampling scheme is different from the framework in [4], in that we first sample the walk length and then perform steps of the random walk, while they perform steps in each sample and take the quantities obtained in each step into account. As it turns out, our scheme is easier to analyze and will save a factor of in the number of samples.
We first establish the unbiasedness of the sampling scheme, whose proof is given in the full version of this paper [28].
Lemma 14.
Each sample described above gives an unbiased estimate of .
We can now prove Theorem 4 by applying the Hoeffding bound.
Proof of Theorem 4.
We use random-walk sampling as described above. The algorithm returns as the average of independent samples, where each sampled value has absolute value at most . Thus, by Lemma 14 and the Hoeffding bound, is upper bounded by
To guarantee that this probability is at most , we set .
As each sample simulates at most steps of random walk, and each step takes time, the time complexity is , as desired.
The proof of Theorem 5 relies on a variance analysis when , , and are nonnegative. Additionally, Theorem 6 assumes that is known and provides a relative error guarantee. Its proof relies on the Stopping Rule Theorem given in [15] for adaptively setting the number of samples in Monte Carlo estimation. The proofs of Theorems 5 and 6 are given in the full version of this paper [28].
On the other hand, if is CDD, then is RDD and is row substochastic. As we can transpose the expression of to obtain
our algorithms and results for RDD systems (except Theorem 5) applies to CDD systems by interchanging with and replacing with . This justifies our claim that we can derive symmetric results by replacing RDD/RDDZ by CDD/CDDZ, swapping and (except in ), and replacing by in the theorem statements. This argument also straightforwardly applies to our subsequent algorithms and results based on local push and the bidirectional method.
5 The Local Push Method
In this section, we adapt the local push methods to estimate . We first describe our Push algorithm as a primitive that can be applied to both RDD and CDD systems. After that, we establish different properties of Push for RDD and CDD systems. This leads to our proof for Theorem 7.
For both RDD and CDD systems , we describe our Push algorithm as a primitive that can be used for approximating the vector . The pseudocode is given in Algorithm 1.
In the Push algorithm, the initialization step sets the reserve and residue vectors to , except that is set to be , which requires time if we assume that we can scan through the nonzero entries of in time. Next, the main loop iterates over levels from to . At each level , the algorithm performs a local push operation on each coordinate whose residue exceeds the threshold in absolute value. The push operation on at level sets the reserve to , increments by , and sets to . In effect, the second step in the push operation increases by and increases by for each with , which can be done in time given oracle row/column access to .
The following lemma gives the key invariant property of the Push algorithm. Its proof is based on induction and can be found in the full version of this paper [28].
Lemma 15.
The push operations preserve the following invariant:
In light of this invariant, we use as an estimate of . Note that this quantity can be computed during the push process. The next lemma shows that if is RDD, then the absolute error between this quantity and can be bounded.
Lemma 16.
If is RDD, then .
Proof.
By Lemma 15 and Equation (5), we have
where we used for RDD and for any as guaranteed by the process of Push.
We will also use the following inequality version of the invariant to bound the running time of the Push algorithm, whose proof is similar to that of the invariant equation and can be found in the full version of this paper [28].
Lemma 17.
The push operations preserve the following inequality:
The next lemma bounds the complexity of the Push algorithm by a convoluted expression. We will shortly see how to simplify this expression for some special systems and settings.
Lemma 18.
Suppose that we can scan through the nonzero entries of in time. Then the complexity of the Push algorithm is bounded by
Proof.
Observe that each time a push operation is performed on a coordinate , the value of is increased by at least . However, by Lemma 17, is always upper bounded by . Therefore, the total number of push operations performed on is at most . Since each push operation on takes time, the lemma follows by summing the cost of the push operations over all and adding the time for initialization.
If is CDD and the nonzero entries of have absolute values of , the next lemma shows that the complexity of Push can be simplified. Its proof is given in the full version of this paper [28].
Lemma 19.
Suppose that is CDD, the nonzero entries of have absolute values of , and we can scan through the nonzero entries of in time. Then the complexity of the Push algorithm is .
Having established the properties of the Push algorithm for RDD and CDD systems, we can readily combine them to prove Theorem 7 for RCDD systems.
Proof of Theorem 7.
6 The Bidirectional Method
This section combines the techniques of random-walk sampling and local push to develop bidirectional algorithms for estimating , which leads to Theorem 8.
The framework of the bidirectional method for RDD systems is presented in Algorithm 2. First, we invoke the Push algorithm to obtain the reserve and residue vectors. Recall that the invariant equation of Push (Lemma 15) implies that
So, instead of directly using as an estimate of , we estimate the right-hand side of the above equation using random-walk samplings from to reduce approximation error. We employ the same sampling scheme as described in Section 4 to obtain a walk length and the coordinate reached by the random walk after steps, but each sampled value now involves the summation . We take the average of the estimates across independent samples and add it to to obtain the final estimate .
We note that, compared to the sampling scheme in [14], our approach eliminates the need for using additional data structures to maintain the prefix sums of residues, since we can directly compute in time per sample without increasing the asymptotic complexity.
The next lemma establishes the unbiasedness of the bidirectional estimator.
Lemma 20.
The sum of and each sampled value in the bidirectional method described above gives an unbiased estimate of .
Proof.
Following the proof of Lemma 14, we can show that the expectation of each sampled value is . Combining this with the invariant equation in Lemma 15 completes the proof.
Next, we prove Theorem 8 by proving the two stated complexity bounds separately in the following two lemmas. Their proofs are partly inspired by [14] and [52], respectively. The proof of Lemma 22 uses variance analysis and is given in the full version of this paper [28].
Lemma 21.
Suppose the same assumptions as in Theorem 8. Then there exists a randomized algorithm that computes a such that in time plus
Proof.
We use the bidirectional method as in Algorithm 2. Note that each sampled value equals for some and and the Push algorithm ensures that for each . Thus, the absolute value of each sampled value is at most . Using the Hoeffding bound, it follows that
To guarantee that this probability is at most , we set , where will be determined shortly.
By Lemma 19, the push cost is . We set , where can be computed in time given our assumptions. Consequently, the cost for random-walk sampling and push both becomes , completing the proof.
Lemma 22.
Suppose the same assumptions as in Theorem 8. Then there exists a randomized algorithm that computes a such that in time plus
Proof of Theorem 8.
7 Connections with PageRank Computation
This section discusses the connections between our framework and PageRank computation and presents proofs for Theorems 9 and 11.
As mentioned in Section 1, the PPR equations (1) and (2) and PageRank contribution equations (3) and (4) can be formulated as RDD/CDD systems. For example, by Equations (1) and (3), for a node , setting or in our formulation both yield ; by Equation (2), for nodes , setting yields . It is worth noting that on Eulerian graphs, Equations (2) and (4) are RCDD systems, and on undirected graphs, they are SDD systems.
By the definition of the -norm gaps, we have
where we used since is invertible. Similarly, we have . Thus, serves as a lower bound on the maximum -norm gap of all these matrices involved in the PPR and PageRank contribution equations.
7.1 Results for PageRank Computation when is RCDD
Our framework provides new insights and results for PageRank computation, in particular when the involved system is RCDD. Theorem 9 stated in the introduction is one such example, which shows that previous results for single-node PageRank computation on undirected graphs can be improved and generalized to Eulerian graphs.
To prove Theorem 9, we investigate the case when the matrix in Equation (2) is RCDD. This matrix is CDD, and it is also RDD if holds for all . In particular, this condition holds when is Eulerian or is large enough. Now, by applying Theorem 6, we directly obtain the following result.
Theorem 23.
For any unweighted graph and decay factor , suppose that holds for all . Then there exists a randomized algorithm that, given , , and accuracy parameter , computes an estimate of within relative error with success probability at least in time
where hides factors.
Proof.
Consider applying Theorem 6 to Equation (2) with and . Note that the corresponding equals , , , , and the obtained -multiplicative approximation of directly yields a -multiplicative approximation of . Therefore, the time complexity is
as desired.
To prove Theorem 9, we establish some lower bounds on PageRank values in the next lemma, which may be of independent interest. This lemma is partly inspired by [8, Lemma 5.13], [46, 45], and [47, Theorem 1.1], and we give its proof in the full version of this paper [28].
Lemma 24.
For any weighted directed graph and , we have
where and denote the entrywise -norm and the Frobenius norm, respectively. If is Eulerian, we further have ; if is unweighted Eulerian, we further have .
Proof of Theorem 9.
Theorem 23 gives the complexity bound . By Lemma 24, on unweighted Eulerian graphs, we have
By plugging these lower bounds on into the complexity bound, we obtain the desired results up to factors (where we omit the terms of since we often consider the case when ). These factors can be removed by using non-truncated random walks for sampling (cf. [45]), leading to the stated complexity bounds.
7.2 A Lower Bound on the Accuracy Parameter for SDD Solvers
This subsection proves Theorem 11. To this end, we establish the following reduction from single-node PageRank computation on undirected graphs to solving SDD systems.
Lemma 25.
Suppose that there exists a randomized algorithm that computes an estimate such that for any SDD system in time. Then there exists a randomized algorithm that, given , estimates on unweighted undirected graphs within constant relative error and with success probability at least in time .
To prove this lemma, we use the following upper bound on on Eulerian graphs, whose proof is given in the full version of this paper [28].
Lemma 26.
On any Eulerian graph and , we have .
Proof of Lemma 25.
Consider the PageRank equation (2) with . When is undirected, the matrix is SDD. By setting and , the supposed algorithm can compute an estimate such that w.p. at least in time . Using Lemma 26 and , we have . Thus, with probability at least , , so is an estimate of within constant relative error, completing the proof.
Proof of Theorem 11.
[45] establishes a complexity lower bound of for estimating within constant relative error with constant success probability on unweighted undirected graphs, where is constant and the bound holds for any possible combination of and . This lower bound applies to the number of queries to the graph structure. Therefore, combining this lower bound with Lemma 25 and noting that the reduction uses yield the desired lower bound of for .
8 Connections with Effective Resistance Computation
This section justifies the relationship between our framework and effective resistance computation on graphs in Lemma 27 and proves Corollary 10.
Recall that in the context of computing effective resistances, we assume that is undirected and connected. In our framework, we set and . By Theorem 2, , so a lower bound on the spectral gap serves as a lower bound on the maximum -norm gap . The following lemma states that with this setting, the quantity that our algorithms approximate equals the effective resistance .
Lemma 27.
When and , we have .
Lemma 27 can be proved using the results in [7], and we provide a different self-contained proof in the full version of this paper [28].
Now we can directly apply Theorems 4 and 8 to prove Corollary 10. The only remaining detail in the proof is to derive a better setting of for the case .
Proof of Corollary 10.
References
- [1] Reid Andersen, Christian Borgs, Jennifer T. Chayes, John E. Hopcroft, Vahab S. Mirrokni, and Shang-Hua Teng. Local computation of PageRank contributions. Internet Mathematics, 5(1):23–45, 2008. doi:10.1080/15427951.2008.10129302.
- [2] Reid Andersen, Fan R. K. Chung, and Kevin J. Lang. Using PageRank to locally partition a graph. Internet Mathematics, 4(1):35–64, 2007. doi:10.1080/15427951.2007.10129139.
- [3] Alexandr Andoni, Robert Krauthgamer, and Yosef Pogrow. On solving linear systems in sublinear time. CoRR, abs/1809.02995, 2018. doi:10.48550/arXiv.1809.02995.
- [4] Alexandr Andoni, Robert Krauthgamer, and Yosef Pogrow. On solving linear systems in sublinear time. In Proceedings of the 10th Innovations in Theoretical Computer Science Conference, volume 124, pages 3:1–3:19, 2019. doi:10.4230/LIPIcs.ITCS.2019.3.
- [5] Siddhartha Banerjee and Peter Lofgren. Fast bidirectional probability estimation in Markov models. In Advances in Neural Information Processing Systems 28, pages 1423–1431, 2015. URL: https://proceedings.neurips.cc/paper/2015/hash/ede7e2b6d13a41ddf9f4bdef84fdc737-Abstract.html.
- [6] Christian Bertram, Mads Vestergaard Jensen, Mikkel Thorup, Hanzhi Wang, and Shuyi Yan. Estimating random-walk probabilities in directed graphs. CoRR, abs/2504.16481, 2025. doi:10.48550/arXiv.2504.16481.
- [7] Enrico Bozzo. The Moore-Penrose inverse of the normalized graph Laplacian. Linear Algebra and its Applications, 439(10):3038–3043, 2013. doi:10.1016/j.laa.2013.08.039.
- [8] Marco Bressan, Enoch Peserico, and Luca Pretto. Sublinear algorithms for local graph-centrality estimation. SIAM Journal on Computing, 52(4):968–1008, 2023. doi:10.1137/19M1266976.
- [9] Sergey Brin and Lawrence Page. The anatomy of a large-scale hypertextual web search engine. Computer Networks, 30(1-7):107–117, 1998. doi:10.1016/S0169-7552(98)00110-X.
- [10] Dongrun Cai, Xue Chen, and Pan Peng. Effective resistances in non-expander graphs. In Proceedings of the 31st Annual European Symposium on Algorithms, volume 274, pages 29:1–29:18, 2023. doi:10.4230/LIPIcs.ESA.2023.29.
- [11] Michael B. Cohen, Jonathan A. Kelner, Rasmus Kyng, John Peebles, Richard Peng, Anup B. Rao, and Aaron Sidford. Solving directed Laplacian systems in nearly-linear time through sparse LU factorizations. In Proceedings of the 59th IEEE Symposium on Foundations of Computer Science, pages 898–909, 2018. doi:10.1109/FOCS.2018.00089.
- [12] Michael B. Cohen, Jonathan A. Kelner, John Peebles, Richard Peng, Anup B. Rao, Aaron Sidford, and Adrian Vladu. Almost-linear-time algorithms for Markov chains and new spectral primitives for directed graphs. In Proceedings of the 49th Annual ACM Symposium on Theory of Computing, pages 410–419, 2017. doi:10.1145/3055399.3055463.
- [13] Michael B. Cohen, Jonathan A. Kelner, John Peebles, Richard Peng, Aaron Sidford, and Adrian Vladu. Faster algorithms for computing the stationary distribution, simulating random walks, and more. In Proceedings of the 57th IEEE Symposium on Foundations of Computer Science, pages 583–592, 2016. doi:10.1109/FOCS.2016.69.
- [14] Guanyu Cui, Hanzhi Wang, and Zhewei Wei. Mixing time matters: Accelerating effective resistance estimation via bidirectional method. In Proceedings of the 31st ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pages 177–188, 2025. doi:10.1145/3690624.3709298.
- [15] Paul Dagum, Richard M. Karp, Michael Luby, and Sheldon M. Ross. An optimal algorithm for Monte Carlo estimation. SIAM Journal on Computing, 29(5):1484–1496, 2000. doi:10.1137/S0097539797315306.
- [16] Dean Doron, François Le Gall, and Amnon Ta-Shma. Probabilistic logarithmic-space algorithms for Laplacian solvers. In Approximation, Randomization, and Combinatorial Optimization. Algorithms and Techniques, volume 81, pages 41:1–41:20, 2017. doi:10.4230/LIPIcs.APPROX-RANDOM.2017.41.
- [17] Peter G. Doyle and J. Laurie Snell. Random walks and electric networks, volume 22. The Mathematical Association of America, 1984.
- [18] Rajat Vadiraj Dwaraknath, Ishani Karmarkar, and Aaron Sidford. Towards optimal effective resistance estimation. In Advances in Neural Information Processing Systems 36, 2023. URL: http://papers.nips.cc/paper_files/paper/2023/hash/b8e2046160a568145af6d42eeef199f4-Abstract-Conference.html.
- [19] Dániel Fogaras, Balázs Rácz, Károly Csalogány, and Tamás Sarlós. Towards scaling fully Personalized PageRank: Algorithms, lower bounds, and experiments. Internet Mathematics, 2(3):333–358, 2005. doi:10.1080/15427951.2005.10129104.
- [20] George E. Forsythe and Richard A. Leibler. Matrix inversion by a Monte Carlo method. Mathematics of Computation, 4(31):127–129, 1950. URL: https://www.ams.org/journals/mcom/1950-04-031/S0025-5718-1950-0038138-X/home.html.
- [21] David F. Gleich. Pagerank beyond the web. SIAM Review, 57(3):321–363, 2015. doi:10.1137/140976649.
- [22] Oded Goldreich and Dana Ron. Property testing in bounded degree graphs. Algorithmica, 32(2):302–343, 2002. doi:10.1007/S00453-001-0078-7.
- [23] Oded Goldreich and Dana Ron. On testing expansion in bounded-degree graphs. In Studies in Complexity and Cryptography, volume 6650, pages 68–75. Springer, 2011. doi:10.1007/978-3-642-22670-0_9.
- [24] Aram W. Harrow, Avinatan Hassidim, and Seth Lloyd. Quantum algorithm for linear systems of equations. Physical Review Letters, 103(15):150502, 2009. doi:10.1103/PhysRevLett.103.150502.
- [25] Arun Jambulapati, Sushant Sachdeva, Aaron Sidford, Kevin Tian, and Yibin Zhao. Eulerian graph sparsification by effective resistance decomposition. In Proceedings of the 2025 ACM-SIAM Symposium on Discrete Algorithms, pages 1607–1650, 2025. doi:10.1137/1.9781611978322.50.
- [26] Arun Jambulapati and Aaron Sidford. Ultrasparse ultrasparsifiers and faster Laplacian system solvers. In Proceedings of the 2021 ACM-SIAM Symposium on Discrete Algorithms, pages 540–559, 2021. doi:10.1137/1.9781611976465.33.
- [27] Satyen Kale, Yuval Peres, and C. Seshadhri. Noise tolerance of expanders and sublinear expansion reconstruction. SIAM Journal on Computing, 42(1):305–323, 2013. doi:10.1137/110837863.
- [28] Tsz Chiu Kwok, Zhewei Wei, and Mingji Yang. On solving asymmetric diagonally dominant linear systems in sublinear time. CoRR, abs/2509.13891, 2025. doi:10.48550/arXiv.2509.13891.
- [29] Amy Nicole Langville and Carl Dean Meyer. Survey: Deeper inside PageRank. Internet Mathematics, 1(3):335–380, 2003. doi:10.1080/15427951.2004.10129091.
- [30] Lawrence Li and Sushant Sachdeva. A new approach to estimating effective resistances and counting spanning trees in expander graphs. In Proceedings of the 2023 ACM-SIAM Symposium on Discrete Algorithms, pages 2728–2745, 2023. doi:10.1137/1.9781611977554.ch102.
- [31] Peter Lofgren, Siddhartha Banerjee, and Ashish Goel. Bidirectional PageRank estimation: from average-case to worst-case. In Proceedings of the 12th International Workshop on Algorithms and Models for the Web-Graph, volume 9479, pages 164–176, 2015. doi:10.1007/978-3-319-26784-5_13.
- [32] Peter Lofgren, Siddhartha Banerjee, and Ashish Goel. Personalized PageRank estimation and search: a bidirectional approach. In Proceedings of the 9th ACM International Conference on Web Search and Data Mining, pages 163–172, 2016. doi:10.1145/2835776.2835823.
- [33] Peter Lofgren, Siddhartha Banerjee, Ashish Goel, and Seshadhri Comandur. FAST-PPR: Scaling Personalized PageRank estimation for large graphs. In Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pages 1436–1445, 2014. doi:10.1145/2623330.2623745.
- [34] László Lovász. Random walks on graphs: a survey. Combinatorics, Paul Erdös is Eighty, 2(1-46):4, 1993.
- [35] Pan Peng, Daniel Lopatta, Yuichi Yoshida, and Gramoz Goranci. Local algorithms for estimating effective resistance. In Proceedings of the 27th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pages 1329–1338, 2021. doi:10.1145/3447548.3467361.
- [36] Ronitt Rubinfeld, Gil Tamir, Shai Vardi, and Ning Xie. Fast local computation algorithms. In Proceedings of Innovations in Computer Science, pages 223–238, 2011. URL: http://conference.iiis.tsinghua.edu.cn/ICS2011/content/papers/36.html.
- [37] Nitin Shyamkumar, Siddhartha Banerjee, and Peter Lofgren. Sublinear estimation of a single element in sparse linear systems. In Proceedings of the 54th Annual Allerton Conference on Communication, Control, and Computing, pages 856–860, 2016. doi:10.1109/ALLERTON.2016.7852323.
- [38] Daniel A. Spielman and Nikhil Srivastava. Graph sparsification by effective resistances. SIAM Journal on Computing, 40(6):1913–1926, 2011. doi:10.1137/080734029.
- [39] Daniel A. Spielman and Shang-Hua Teng. Nearly-linear time algorithms for graph partitioning, graph sparsification, and solving linear systems. In Proceedings of the 36th Annual ACM Symposium on Theory of Computing, pages 81–90, 2004. doi:10.1145/1007352.1007372.
- [40] Daniel A. Spielman and Shang-Hua Teng. Nearly linear time algorithms for preconditioning and solving symmetric, diagonally dominant linear systems. SIAM Journal on Matrix Analysis and Applications, 35(3):835–885, 2014. doi:10.1137/090771430.
- [41] Amnon Ta-Shma. Inverting well conditioned matrices in quantum logspace. In Proceedings of the 45th Annual ACM Symposium on Theory of Computing, pages 881–890, 2013. doi:10.1145/2488608.2488720.
- [42] Shang-Hua Teng. The Laplacian Paradigm: Emerging algorithms for massive graphs. In Proceedings of the 7th Annual Conference on Theory and Applications of Models of Computation, volume 6108, pages 2–14, 2010. doi:10.1007/978-3-642-13562-0_2.
- [43] Mikkel Thorup, Hanzhi Wang, Zhewei Wei, and Mingji Yang. PageRank centrality in directed graphs with bounded in-degree. In Proceedings of the 2026 ACM-SIAM Symposium on Discrete Algorithms, 2026. To appear. arXiv preprint at https://arxiv.org/abs/2508.01257. doi:10.48550/arXiv.2508.01257.
- [44] Nisheeth K. Vishnoi. . Foundations and Trends in Theoretical Computer Science, 8(1-2):1–141, 2013. doi:10.1561/0400000054.
- [45] Hanzhi Wang. Revisiting local PageRank estimation on undirected graphs: Simple and optimal. In Proceedings of the 30th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pages 3036–3044, 2024. doi:10.1145/3637528.3671820.
- [46] Hanzhi Wang and Zhewei Wei. Estimating single-node PageRank in time. Proceedings of the VLDB Endowment, 16(11):2949–2961, 2023. doi:10.14778/3611479.3611500.
- [47] Hanzhi Wang, Zhewei Wei, Ji-Rong Wen, and Mingji Yang. Revisiting local computation of PageRank: Simple and optimal. In Proceedings of the 56th Annual ACM Symposium on Theory of Computing, pages 911–922, 2024. doi:10.1145/3618260.3649661.
- [48] Wolfgang R. Wasow. A note on the inversion of matrices by random walks. Mathematical Tables and Other Aids to Computation, 6(38):78–81, 1952. doi:10.2307/2002546.
- [49] Zhewei Wei, Ji-Rong Wen, and Mingji Yang. Approximating single-source Personalized PageRank with absolute error guarantees. In Proceedings of the 27th International Conference on Database Theory, volume 290, pages 9:1–9:19, 2024. doi:10.4230/LIPIcs.ICDT.2024.9.
- [50] Mingji Yang, Hanzhi Wang, Zhewei Wei, Sibo Wang, and Ji-Rong Wen. Efficient algorithms for Personalized PageRank computation: a survey. IEEE Transactions on Knowledge and Data Engineering, 36(9):4582–4602, 2024. doi:10.1109/TKDE.2024.3376000.
- [51] Renchi Yang and Jing Tang. Efficient estimation of pairwise effective resistance. Proceedings of the ACM International Conference on Management of Data, 1(1):16:1–16:27, 2023. doi:10.1145/3588696.
- [52] Yichun Yang, Rong-Hua Li, Meihao Liao, and Guoren Wang. Improved algorithms for effective resistance computation on graphs. In Proceedings of the 38th Conference on Learning Theory, volume 291, pages 5892–5920, 2025. URL: https://proceedings.mlr.press/v291/yichun25a.html.
