Abstract 1 Introduction 2 Other Related Work 3 Formulation of 𝒙 and the 𝒑-Norm Gaps 4 Random-Walk Sampling 5 The Local Push Method 6 The Bidirectional Method 7 Connections with PageRank Computation 8 Connections with Effective Resistance Computation References

On Solving Asymmetric Diagonally Dominant Linear Systems in Sublinear Time

Tsz Chiu Kwok ORCID Shanghai University of Finance and Economics, China Zhewei Wei ORCID Renmin University of China, Beijing, China Mingji Yang ORCID Renmin University of China, Beijing, China
Abstract

We initiate a study of solving a row/column diagonally dominant (RDD/CDD) linear system 𝐌𝒙=𝒃 in sublinear time, with the goal of estimating 𝒕𝒙 for a given vector 𝒕n and a specific solution 𝒙. This setting naturally generalizes the study of sublinear-time solvers for symmetric diagonally dominant (SDD) systems [Andoni-Krauthgamer-Pogrow, ITCS 2019] to the asymmetric case, which has remained underexplored despite extensive work on nearly-linear-time solvers for RDD/CDD systems.

Our first contributions are characterizations of the problem’s mathematical structure. We express a solution 𝒙 via a Neumann series, prove its convergence, and upper bound the truncation error on this series through a novel quantity of 𝐌, termed the maximum p-norm gap. This quantity generalizes the spectral gap of symmetric matrices and captures how the structure of 𝐌 governs the problem’s computational difficulty.

For systems with bounded maximum p-norm gap, we develop a collection of algorithmic results for locally approximating 𝒕𝒙 under various scenarios and error measures. We derive these results by adapting the techniques of random-walk sampling, local push, and their bidirectional combination, which have proved powerful for special cases of solving RDD/CDD systems, particularly estimating PageRank and effective resistance on graphs. Our general framework yields deeper insights, extended results, and improved complexity bounds for these problems. Notably, our perspective provides a unified understanding of Forward Push and Backward Push, two fundamental approaches for estimating random-walk probabilities on graphs.

Our framework also inherits the hardness results for sublinear-time SDD solvers and local PageRank computation, establishing lower bounds on the maximum p-norm gap or the accuracy parameter. We hope that our work opens the door for further study into sublinear solvers, local graph algorithms, and directed spectral graph theory.

Keywords and phrases:
Spectral Graph Theory, Linear Systems, Sublinear Algorithms
Copyright and License:
[Uncaptioned image] © Tsz Chiu Kwok, Zhewei Wei, and Mingji Yang; licensed under Creative Commons License CC-BY 4.0
2012 ACM Subject Classification:
Mathematics of computing Spectra of graphs
; Theory of computation Graph algorithms analysis ; Theory of computation Streaming, sublinear and near linear time algorithms
Related Version:
Full Version: https://arxiv.org/abs/2509.13891 [28]
Acknowledgements:
We thank the anonymous reviewers for their valuable comments. Mingji Yang wishes to thank Prof. Shang-Hua Teng for inspiring discussions and encouragement, and Guanyu Cui for helpful discussions on effective resistance.
Funding:
This research was supported by National Natural Science Foundation of China (No. 92470128, No. U2241212).
Editor:
Shubhangi Saraf

1 Introduction

Solving systems of linear equations is one of the most fundamental problems in numerical linear algebra and theoretical computer science. In the classic version of this problem, we are given a matrix 𝐌n×n and a vector 𝒃n in range(𝐌), and the goal is to compute a solution vector 𝒙n satisfying 𝐌𝒙=𝒃. Beyond general-purpose solvers for arbitrary linear systems (e.g., using fast matrix multiplication or the conjugate gradient method), extensive research has focused on developing efficient solvers for special classes of systems.

In particular, for the important classes of Laplacian and symmetric diagonally dominant (SDD) systems, the breakthrough work of Spielman and Teng [39, 40] established the first nearly-linear-time (in nnz(𝐌)) solvers. This gave rise to the influential Laplacian Paradigm, which revolutionized algorithmic graph theory and numerical linear algebra, with widespread applications ranging from network science to machine learning (see, e.g., [42, 44]). Among subsequent efforts to generalize the SDD solvers, a line of work [13, 12, 11] developed nearly-linear-time solvers for asymmetric row/column diagonally dominant (RDD/CDD) systems, which significantly expanded the scope of the Laplacian Paradigm.

On the other hand, partly motivated by the advances in quantum algorithms for solving linear systems in sublinear time [24], Andoni, Krauthgamer, and Pogrow [4] pioneered the study of classical algorithms for approximately solving a single entry of SDD systems in sublinear time. Under specific access models and error measures, they established:

  • a polylog(n)-time solver for well-conditioned SDD systems; 111In fact, [4] considers the (effective) condition number of a normalized version of the involved SDD matrix. See Section 1.3 for details.

  • a Ω~(κ2) lower bound for general SDD systems, where κ is the condition number of 𝐌.

The second result demonstrates the necessity of a quadratic dependence on the condition number (equivalently, the reciprocal of the spectral gap) for sublinear-time SDD solvers.

In light of the previous research for nearly-linear-time RDD/CDD solvers and sublinear-time SDD solvers, it is natural to ask whether the sublinear-time SDD solvers can be extended to the more general RDD/CDD cases. In this paper, we initiate a study in this direction and give partial positive answers to this question. We show that the sublinear-time solvers for well-conditioned SDD systems can be extended to “well-structured” RDD/CDD systems, provided that we first define an appropriate generalization of the key structural quantity, the spectral gap for symmetric matrices, to asymmetric matrices. We achieve this by re-characterizing the problem’s mathematical structure and introducing a new concept called the maximum p-norm gap.

Algorithmically, the sublinear SDD solver in [4] works by solely generating random-walk samplings based on 𝐌 to approximate a truncated Neumann series of the solution. In contrast, we conduct a deeper investigation of the complexity upper bounds by applying two techniques in addition to random-walk sampling: the local push method, which performs local exploration in 𝐌; and the bidirectional method, which integrates random-walk sampling with local push. Together, we derive a suite of upper bounds for solving RDD/CDD systems under diverse access models and error measures. For instance, we extend the algorithmic result in [4] to RDD systems and derive new results for RCDD systems with smaller dependence on some parameters.

Our algorithmic toolkit and investigation of the diverse upper bounds are inspired by recent advances in local algorithms for estimating PageRank [9] and effective resistance [17] on graphs [50, 49, 47, 45, 6, 43, 14, 52], which are important special cases of solving RDD/CDD systems. The techniques of random-walk sampling [39, 19], local push [2, 1], and the bidirectional method [32] have been extensively studied for these problems, and recent works have further uncovered their new properties and optimality in certain settings. Nonetheless, previous works typically analyze their applications to PageRank and effective resistance computation separately. Our perspective of formulating these problems as solving linear systems, however, provides a more general and unified framework for understanding these techniques and problems, revealing their deeper connections. As we shall see, this bigger picture yields novel insights, extended results, and improved complexity bounds.

Notably, our perspective reveals a connection between two fundamental local push algorithms on graphs, namely ForwardPush [2] and BackwardPush [1]222Their original names are ApproximatePageRank and ApproxContributions, respectively.. These algorithms iteratively perform local push operations to explore the graph in opposite directions. Although both algorithms share similar approaches, they have been treated as distinct methods for different problems, with each analyzed separately. In contrast, by abstracting both methods as a single algebraic primitive, we demonstrate that ForwardPush and BackwardPush are equivalent to applying this primitive to different linear systems. This characterization helps to explain their distinct properties and enables unified analysis of both approaches.

On the lower-bound side, our framework inherits the hardness result for sublinear-time SDD solvers, establishing the necessity of our assumption on the maximum p-norm gap; also, known lower bounds for local PageRank computation imply lower bounds on the accuracy parameter for our setting. As our work bridges the study of sublinear-time solvers and local graph algorithms, we believe that further investigation could uncover more connections and results for these topics.

In the remainder of this section, we formally define the problem, present our main contributions, and provide a technical overview.

1.1 Basic Notations

For n+, we define [n]:={1,2,,n}. We call a matrix 𝐌n×n RDD (row diagonally dominant) if it satisfies 𝐌(j,j)kj|𝐌(j,k)| for all j[n] and its diagonal entries are positive. We call a matrix CDD (column diagonally dominant) if its transpose is RDD, and call a matrix SDD (symmetric diagonally dominant) if it is symmetric and RDD. It is well-known that any SDD matrix is PSD (positive semidefinite). We call a square matrix Z-matrix if its off-diagonal entries are nonpositive. We use 𝒆k to denote the k-th canonical unit vector and 𝟏 to denote the all-one vector. For any matrix or vector, we use || to denote taking the entrywise absolute value.

We call two real numbers p,q>1 Hölder conjugates (or q is conjugate to p) if they satisfy 1/p+1/q=1. By convention, we also formally let 1/:=0 and view and 1 as Hölder conjugates. For any p[1,], we use 𝒙p to denote the p-norm of a vector 𝒙, and 𝐌p to denote the matrix norm induced by vector p-norm of a matrix 𝐌n×n.

Restriction and Pseudoinverse.

For a subspace Un, we use 𝐌|U to denote the restriction of the linear map 𝐌 to U, with induced norm 𝐌|Up:=max𝒙U,𝒙p=1𝐌𝒙p for any p[1,]. We write the pseudoinverse (a.k.a. Moore–Penrose inverse) of 𝐌 as 𝐌+.

Spectral Gap.

For an SDD matrix 𝐒n×n, we define its spectral gap γ(𝐒) as half the smallest nonzero eigenvalue of 𝐒~:=𝐃𝐒1/2𝐒𝐃𝐒1/2, where 𝐃𝐒 is the diagonal matrix that satisfies 𝐃(k,k)=𝐒(k,k) for each k[n]. The (effective) condition number of 𝐒, denoted by κ(𝐒), is defined as the ratio between the largest and smallest nonzero eigenvalues of 𝐒. It holds that κ(𝐒~)=Θ(1/γ(𝐒)).

Graphs.

We consider directed graphs G=(V,E), with n:=|V| and m:=|E|. We assume that V=[n]. If (u,v)E, we write uv. We denote the (possibly weighted) adjacency matrix as 𝐀Gn×n. For each vV, we define its indegree dG(v):=uv𝐀G(u,v) and outdegree dG+(v):=vu𝐀G(v,u). The outdegree matrix 𝐃Gn×n is the diagonal matrix with 𝐃G(v,v)=dG+(v) for each vV. We denote G’s minimum and maximum outdegree as δG+:=minvV{dG+(v)} and ΔG+:=maxvV{dG+(v)}, respectively.

Eulerian Graphs and Laplacian.

We call a graph Eulerian if dG(v)=dG+(v) for all vV. On Eulerian graphs, we simply write dG(v):=dG+(v), δG:=δG+, and ΔG:=ΔG+. Undirected graphs constitute a special case of Eulerian graphs where each edge corresponds to two directed edges in opposite directions. The directed Laplacian matrix is defined as 𝐋G:=𝐃G𝐀G, which satisfies 𝟏𝐋G=𝟎 and is CDDZ. 𝐋G is RCDDZ for Eulerian graphs and is SDDZ for undirected graphs.

1.2 Problem Formulation

We consider a linear system 𝐌𝒙=𝒃, where 𝐌n×n is an RDD/CDD matrix and 𝒃range(𝐌), and a coefficient vector 𝒕n. We assume that 𝒃,𝒕𝟎 and all nonzero entries in 𝐌, 𝒃, and 𝒕 have absolute values in [1/poly(n),poly(n)]. We assume that the algorithms are given the dimension n and have oracle access to 𝐌, 𝒃, and 𝒕 via the following basic queries:

  • Diagonal queries for 𝐌: return 𝐌(k,k) in O(1) time for a given index k[n];

  • Row/column queries for 𝐌: return the indices and corresponding values of nonzero entries for a specified row/column of 𝐌, in time linear in the number of returned indices;

  • Entrywise queries for 𝒃 and 𝒕: return 𝒃(k) or 𝒕(k) in O(1) time for a given index k[n].

Our results will assume additional access operations, which will be specified in the statements.

Following the concept of local computation algorithms [36] and the previous work [4], we consider a fixed solution 𝒙 that is determined by 𝐌 and 𝒃, and require invoking the algorithm with different 𝒕 and accuracy parameters returns estimates of 𝒕𝒙 that are all consistent with the “global” solution 𝒙. Our choice of 𝒙 will be given in Theorem 1.

We also consider the problems of computing (Personalized) PageRank [9] and effective resistance [17] on graphs, viewing them as special cases of our formulation of solving RDD/CDD systems. For these graph problems, we assume the standard adjacency-list model [22], where each degree query takes O(1) time and each neighbor query returns a neighbor index along with the edge weight in O(1) time.

For Personalized PageRank (PPR), we consider a directed graph G, a decay factor α(0,1), and a source distribution 𝒔{𝒚0n:𝒚1=1}. To ensure that PPR is well-defined, we assume that δG+>0. The PPR vector 𝝅G,α,𝒔 is defined as the unique solution to the following two equivalent forms of the PPR equation:

(𝐈(1α)𝐀G𝐃G1)𝝅G,α,𝒔=α𝒔, (1)
(𝐃G(1α)𝐀G)(𝐃G1𝝅G,α,𝒔)=α𝒔. (2)

Both equations can be viewed as linear systems of the form 𝐌𝒙=𝒃, where the coefficient matrices 𝐈(1α)𝐀G𝐃G1 and 𝐃G(1α)𝐀G are both CDDZ and invertible. Note that for the second form, the solution to the corresponding system is 𝐃G1𝝅G,α,𝒔, an outdegree-scaled version of the PPR vector. We define the PPR value from s to t as 𝝅G,α(s,t):=𝝅G,α,𝒆s(t), and the PageRank vector 𝝅G,α as 𝝅G,α:=𝝅G,α,1/n𝟏. It holds that 𝝅G,α(t)=1nsV𝝅G,α(s,t) for all tV. We also consider the following two equivalent forms of the PageRank contribution equation:

(𝐈(1α)𝐃G1𝐀G)𝝅G,α,t1=α𝒆t, (3)
(𝐃G(1α)𝐀G)𝝅G,α,t1=α𝐃G𝒆t, (4)

where tV is a specified target node and 𝝅G,α,t1 is called the PageRank contribution vector to t. It holds that 𝝅G,α,t1(s)=𝝅G,α(s,t) for all sV. Similarly, both equations can be viewed as linear systems of the form 𝐌𝒙=𝒃, where the coefficient matrices 𝐈(1α)𝐃G1𝐀G and 𝐃G(1α)𝐀G are both RDDZ and invertible. Note that for the second form, the corresponding vector 𝒃 is α𝐃G𝒆t.

For effective resistance, we consider a connected undirected graph G and two distinct nodes s,tV. The effective resistance (a.k.a. resistance distance) between s and t, denoted by RG(s,t), is defined as the equivalent resistance between s,t if the graph is thought of as an electrical network with each edge (u,v)E having resistance 1/𝐀G(u,v). Algebraically, RG(s,t)=(𝒆s𝒆t)𝐋G+(𝒆s𝒆t). As we will establish in Lemma 27, setting 𝐌=𝐋G and 𝒃=𝒕=𝒆s𝒆t in our formulation yields 𝒕𝒙=RG(s,t). Here, 𝐌=𝐋G is SDDZ and 𝒃=𝒆s𝒆trange(𝐌).

1.3 Previous Work for SDD Systems

[4] studies the case when the linear system is 𝐒𝒙=𝒃 for some SDD matrix 𝐒. Define 𝐃𝐒 as the diagonal matrix that satisfies 𝐃(k,k)=𝐒(k,k) for each k[n] and 𝐒~:=𝐃𝐒1/2𝐒𝐃𝐒1/2. They formulate the fixed solution as 𝒙:=𝐃𝐒1/2𝐒~+𝐃𝐒1/2𝒃 and give a Neumann series expansion of 𝒙. For the algorithmic results, they assume that the algorithm is given κ, an upper bound on κ(𝐒~) (alternatively, γ as a lower bound on γ(𝐒)) and set a truncation parameter for the Neumann series as L:=Θ(κlog(κκ(𝐃𝐒)𝒃0/ε))=Θ~(κ).

Based on the truncated Neumann series, they present a randomized algorithm that, given a coordinate t[n], computes an estimate x^t satisfying |x^t𝒙(t)|ε𝐃𝐒1𝒃 with probability at least 3/4. The algorithm runs in time O(f(𝐒)L3logL/ε2)=O~(f(𝐒)κ3ε2)=O~(f(𝐒)γ3ε2), where f(𝐒) is the maximum time cost to simulate one step in the random walk defined by 𝐒. This result is implicit in the proof of [3, Theorem 5.1].

On the negative side, [4] proves an Ω(κ(𝐒)2/log3n)=Ω~(κ(𝐒)2) query lower bound (in terms of probing 𝒃) for achieving a weaker absolute error bound of ε𝒙, for κ(𝐒)=O(n/logn) and ε=Θ(1/logn). The matrix 𝐒 in their hard instance is a Laplacian matrix of a fixed unweighted undirected graph with maximum degree 4 and thus satisfies κ(𝐒~)=Θ(κ(𝐒)). Therefore, this lower bound can also be written as Ω~(1/γ(𝐒)2). To our knowledge, no other work has explicitly studied sublinear-time SDD/RDD/CDD solvers.

1.4 Formulation of 𝒙 and the 𝒑-Norm Gaps

Our first contribution is a Neumann-series-based characterization of a solution 𝒙 to 𝐌𝒙=𝒃, which is consistent with the solution considered by [4] for SDD systems.

We decompose 𝐌 uniquely as 𝐌=𝐃𝐌𝐀𝐌, where 𝐃𝐌 is a diagonal matrix and all diagonal entries of 𝐀𝐌 are 0. Define 𝐌~:=𝐃𝐌1/2𝐌𝐃𝐌1/2 and 𝐀~𝐌:=𝐃𝐌1/2𝐀𝐌𝐃𝐌1/2.

The next theorem gives the definition of 𝒙 and its properties.

Theorem 1.

For any RDD/CDD 𝐌 and 𝐛range(𝐌), define 𝐱 to be

𝒙:=12=0(12(𝐈+𝐃𝐌1𝐀𝐌))𝐃𝐌1𝒃.

Then 𝐱 is well-defined and satisfies 𝐌𝐱=𝐛. If 𝐌 is SDD, then 𝐱=𝐃𝐌1/2𝐌~+𝐃𝐌1/2𝐛.

Next, we study a truncated version of 𝒙 and upper bound the truncation error. As previous analysis based on eigendecomposition is not directly applicable to asymmetric matrices, we introduce a novel concept called the p-norm gap of 𝐌: for any p[1,], we define the p-norm gap of 𝐌 as

γp(𝐌):=112(𝐈+𝐃𝐌1/q𝐀𝐌𝐃𝐌1/p)|range(𝐈𝐃𝐌1/q𝐀𝐌𝐃𝐌1/p)p,

where q is conjugate to p. We further define the maximum p-norm gap of 𝐌 as γmax(𝐌):=maxp[1,]γp(𝐌).333One could alternatively define γmax(𝐌):=maxp{1,2,}γp(𝐌), which yields a possibly smaller quantity but our main results continue to hold with this definition. To our knowledge, no prior work has explicitly studied these quantities.

The following theorem guarantees that for any RDD/CDD matrix 𝐌, the maximum p-norm gap γmax(𝐌) lies in (0,1], and when 𝐌 is SDD, γmax(𝐌) coincides with the spectral gap γ(𝐌). Thus, it is a natural generalization of the spectral gap to asymmetric matrices.

Theorem 2.

If 𝐌 is RDD/CDD, then 0<γmax(𝐌)1; if 𝐌 is SDD, then γmax(𝐌)=γ2(𝐌)=γ(𝐌).

We will devise algorithms to estimate the truncated version of 𝒙, defined as

𝒙L:=12=0L1(12(𝐈+𝐃𝐌1𝐀𝐌))𝐃𝐌1𝒃 (5)

for an integer truncation parameter L. The next theorem upper bounds the truncation error in terms of a given lower bound on the maximum p-norm gap γmax(𝐌).

Theorem 3.

Suppose 0<γγmax(𝐌). To ensure that |𝐭𝐱L𝐭𝐱|12ε, it suffices to set

L:=Θ(1γlog(1γεdmax(𝐌)𝒕0𝐃𝐌1𝒕𝒃0𝐃𝐌1𝒃))=Θ~(1γ), (6)

where dmax(𝐌) denotes the largest diagonal entry in 𝐌.444Here and after, we use Θ~ and O~ to hide polylogarithmic factors in n, 1/γ, 1/ε, and (reciprocals of) quantities in 𝐌, 𝐛, and 𝐭.

Relationship with the Formulation in [4].

Our formulations of 𝒙 and the maximum p-norm gap generalize the ones in [4], in the sense that when 𝐌 is SDD, 𝒙=𝐃𝐌1/2𝐌~+𝐃𝐌1/2𝒃 matches their solution and γmax(𝐌) equals γ(𝐌), the spectral gap of 𝐌. Therefore, the quadratic lower bound on 1/γ(𝐌) for SDD systems in that paper translates into a quadratic lower bound on 1/γmax(𝐌) for general RDD/CDD systems. Also, our setting of the truncation parameter L in Equation (6) matches theirs when 𝐌 is SDD and 𝒕 is a canonical unit vector.

1.5 Main Algorithmic Results

For our algorithmic results, we assume that the algorithm is given a quantity γ>0 as a lower bound on γmax(𝐌) and an accuracy parameter ε>0. We use γ, ε, and the specific terms in the accuracy guarantee to set the truncation parameter L according to Equation (6), where we assume that suitable upper bounds on the quantities in the logarithmic factor are known.

The statements of our results will use the following definition. For RDD 𝐌, we use frow(𝐌) to denote the maximum time cost to simulate a single step in the random walk defined by the row substochastic matrix 12(𝐈+𝐃𝐌1|𝐀𝐌|). This quantity depends on the structure and representation of 𝐌. For instance, if each row of 𝐌 has at most d nonzero entries, then frow(𝐌)=O(d); if the nonzero entries in 𝐀𝐌 have equal absolute values and we are allowed to sample a uniformly random index of the nonzero entries in each column of 𝐀𝐌 in O(1) time, then frow(𝐌)=O(1). For CDD 𝐌, we define fcol(𝐌):=frow(𝐌).

Each of our algorithmic results excels under different system types, access models, and parameter regimes. We present a selection of our results here, with additional results given in the full version of this paper [28]. Furthermore, a remark at the end of this subsection establishes that most of our results have “symmetric” counterparts (e.g., for CDD systems) that can be obtained by exchanging the roles of certain quantities.

We first present our results of using random-walk sampling for RDD systems.

Theorem 4.

Suppose that 𝐌 is RDD, we can sample from the distribution |𝐭|/𝐭1 in O(1) time, and 𝐭1 is known. Then there exists a randomized algorithm that computes an estimate x^ such that Pr{|x^𝐭𝐱|ε𝐃𝐌1𝐛}34 in time O(frow(𝐌)𝐭12L3ε2)=O~(frow(𝐌)𝐭12γ3ε2).

This theorem subsumes the main algorithmic result of [4] for SDD systems and extends it to the RDD case while achieving a mild (logL)-factor improvement. Their original result is recovered as a special case when 𝐌 is SDD and 𝒕 is a canonical unit vector. Our algorithm is similar to theirs, but we achieve the improved complexity by adopting a different random-walk sampling scheme.

We also show that for RDDZ 𝐌 along with nonnegative vectors 𝒃 and 𝒕, if we allow the relaxed error bound ε𝒙, the complexity can be improved to depend quadratically on L via a variance analysis of the random-walk sampling process. We adopt this error measure since it provides a natural accuracy guarantee and has been previously studied in [4]. It is shown in [4] that 𝐃𝐌1𝒃2𝒙 for RDD systems.

Theorem 5.

Suppose that 𝐌 is RDDZ, 𝐛,𝐭𝟎, we can sample from the distribution 𝐭/𝐭1 in O(1) time, and 𝐭1 is known. Then there exists a randomized algorithm that computes a x^ such that Pr{|x^𝐭𝐱|ε𝐱}34 in time O(frow(𝐌)𝐭12L2ε2)=O~(frow(𝐌)𝐭12γ2ε2).

The following theorem further considers the relative error guarantee of ε𝒕𝒙. The complexity depends quadratically on L and linearly on 𝒕1𝐃𝐌1𝒃/𝒕𝒙 and is also achieved using random-walk sampling.

Theorem 6.

Suppose that 𝐌 is RDDZ, 𝐛,𝐭𝟎, we can sample from the distribution 𝐭/𝐭1 in O(1) time, 𝐭1 and 𝐃𝐌1𝐛 are known, and 𝐭𝐱>0. Then there exists a randomized algorithm that computes an estimate x^ such that Pr{|x^𝐭𝐱|ε𝐭𝐱}34 in expected time

O(frow(𝐌)𝒕1𝐃𝐌1𝒃L2ε2𝒕𝒙)=O~(frow(𝐌)𝒕1𝐃𝐌1𝒃γ2ε2𝒕𝒙).

Our next result leverages the local push method to derive a deterministic algorithm for special RCDD systems, whose complexity depends linearly on 1/ε.

Theorem 7.

Suppose that 𝐌 is RCDD, its nonzero entries have absolute values of Ω(1), and we can scan the nonzero entries of 𝐛 in O(𝐛0) time. Then there exists a deterministic algorithm that computes a x^ such that |x^𝐭𝐱|ε𝐭1 in time O(𝐛0) plus O(𝐛1L3ε1)=O~(𝐛1γ3ε1).

Lastly, applying the bidirectional method to special RCDD systems yields complexity bounds with improved dependence on L and ε, achieving either L7/3ε2/3 or L5/2ε1 using different parameter settings.

Theorem 8.

Suppose that 𝐌 is RCDD, its nonzero entries have absolute values of Ω(1), we can sample from the distribution |𝐭|/𝐭1 in O(1) time, 𝐭1 and frow(𝐌) are known, and we can scan through the nonzero entries of 𝐛 in O(𝐛0) time. Then there exists a randomized algorithm that computes an estimate x^ such that Pr{|x^𝐭𝐱|ε}34 in time O(𝐛0) plus

O(min(frow(𝐌)13𝒕123𝒃123L73ε23,frow(𝐌)12𝒕1𝒃112𝐃𝐌1𝒃12L52ε1))
=O~(min(frow(𝐌)1/3𝒕12/3𝒃12/3γ7/3ε2/3,frow(𝐌)1/2𝒕1𝒃11/2𝐃𝐌1𝒃1/2γ5/2ε)).
 Remark.

As we shall see, all theorems in this subsection except Theorem 5 still hold if we replace RDD/RDDZ by CDD/CDDZ, swap 𝐛 and 𝐭 (except in 𝐭𝐱), and replace frow(𝐌) by fcol(𝐌) in the statements. Theorem 5 is an exception because its proof relies on the property that 𝐃𝐌1𝐛2𝐱 for RDD 𝐌, which does not have a straightforward analog for the CDD case.

1.6 Connections to PageRank and Effective Resistance Computation

Our results can be directly applied to the PPR or PageRank contribution equations and the single-pair effective resistance problem to yield complexity bounds for different graph types under various access models and accuracy guarantees. Notably, as we shall see, for PageRank computation, half the decay factor serves as a lower bound on the maximum p-norm gap of the corresponding systems; for effective resistance computation, the spectral gap of the graph Laplacian equals the maximum p-norm gap of the system. Rather than exhaustively presenting all applications of our results, here we highlight selected results that generalize and improve upon the previously best complexity bounds.

The following theorem is derived by applying Theorem 6 to the PageRank equation (2) combined with tighter lower bounds on 𝝅G,α(t) that we establish for Eulerian graphs.

Theorem 9.

For any unweighted Eulerian graph G, given the decay factor α, a target node tV, and δG, there exists a randomized algorithm that estimates the PageRank value 𝛑G,α(t) within relative error ε with probability at least 3/4 in time

O(1αε2dG(t)δG1n𝝅G,α(t))=O(1ε2δGmin(dG(t)α2,m/dG(t)α2,ΔGα,mα)).

This improves over the previously best upper bounds of O(1ε2δGmin(dG(t)α2,mα2)), which was given by [45] and stated for unweighted undirected graphs. Our algorithm is essentially the same as that in [45], which generates random walks from the target node t, and the improvement comes from our tighter lower bounds on 𝝅G,α(t) for Eulerian graphs.

For effective resistance computation, recall that we consider connected undirected graphs G. Lemma 27 establishes that the value 𝒕𝒙 that our algorithms approximate equals RG(s,t) when setting 𝐌=𝐋G and 𝒃=𝒕=𝒆s𝒆t. Thus, Theorems 4 and 8 directly imply the following complexity bounds for estimating effective resistance.

Corollary 10.

For any connected unweighted undirected graph G, given nodes s,tV and γ>0 as a lower bound on γ(𝐋G), there exists a randomized algorithm that estimates the effective resistance RG(s,t) within absolute error ε with probability at least 3/4 in time

O(min(L3ε2min(dG(s),dG(t))2,L7/3ε2/3,L5/2εmin(dG(s),dG(t))1/2)),

where L:=Θ(1γlog(1γε(1dG(s)+1dG(t)))).

This subsumes the previous bounds of O(min(L3logLε2min(dG(s),dG(t))2,L7/3logLε2/3)) given by [14] with essentially the same setting of L. Moreover, since RG(s,t) can be lower bounded by 1/2/min(dG(s),dG(t)) [34, Corollary 3.3], by setting the absolute error parameter ε to be εr1/2/min(dG(s),dG(t)), the last bound in our result implies that an estimate of RG(s,t) within relative error εr can be computed in time O(min(dG(s),dG(t))1/2L5/2εr1). This improves over the previous bound of O(min(dG(s),dG(t))1/2L3logLεr1) given by [52]. Our algorithms are essentially the same as those in [14, 52], i.e., using random-walk sampling and the bidirectional method, and the improvements stem from our simpler sampling scheme that avoids using extra data structures and a refined analysis.

On the other hand, known hardness results for local PageRank and effective resistance computation can potentially yield lower bounds for sublinear-time solvers. We highlight the following result, derived by establishing a reduction from estimating single-node PageRank on undirected graphs to solving SDD systems and applying the lower bound for PageRank computation from [45].

Theorem 11.

For any large enough n and ε=Ω(1/n), there exist 𝐛n and t[n] that satisfy the following. Every randomized algorithm that, given access to an invertible SDD matrix 𝐒n×n whose spectral gap is Ω(1), succeeds with probability at least 3/4 to approximate 𝐱(t) within absolute error ε𝐱, must probe Ω(1/ε) coordinates of 𝐒 in the worst case. Here, 𝐱=𝐒1𝐛.

This result gives an Ω(1/ε) lower bound for local SDD solvers with accuracy guarantee ε𝒙, demonstrating the necessity of a linear dependence on 1/ε in our Theorem 4. This lower bound on the accuracy parameter complements the lower bound on the spectral gap given by [4].

1.7 Understanding ForwardPush and BackwardPush on Graphs

Inspired by the local push algorithms ForwardPush and BackwardPush, we formulate our Push algorithm as a unified primitive for estimating a summation of matrix powers applied to a vector, which is closely related to our approach to solving RDD/CDD systems. This abstraction reveals that ForwardPush and BackwardPush, despite appearing as distinct algorithms for two problems with different propagation strategies, are equivalent to applying Push to different linear systems, modulo variable scaling. The apparent differences arise from the scaling and the distinct behaviors that Push exhibits on different types of linear systems.

Specifically, for PageRank computation, ForwardPush from node s corresponds to applying Push to Equation (2) with 𝒔=𝒆s and outdegree scaling, while BackwardPush from node t applies Push to the contribution equations (3) or (4). Our analysis demonstrates that Push provides closed-form complexity bounds on certain CDD systems and accuracy guarantees on RDD systems, which explains the known computational properties of ForwardPush and BackwardPush on directed graphs.

For RCDD systems, Push inherits both complexity and accuracy advantages, clarifying why ForwardPush and BackwardPush perform well on undirected graphs. This perspective further reveals that on Eulerian graphs, ForwardPush from any node is equivalent to BackwardPush from that node on the transpose graph, establishing a basic connection previously unrecognized.

1.8 Technical Overview

Our characterizations of the solution 𝒙 and the p-norm gaps are based on fundamental properties of Neumann series and restricted linear maps. Since asymmetric matrices may be non-diagonalizable, the eigendecomposition techniques used for symmetric matrices become inapplicable. We address this challenge by analyzing the operator norms of restricted linear maps to establish series convergence and derive truncation error bounds.

Our algorithms estimate 𝒕𝒙L=12𝒕=0L1(12(𝐈+𝐃𝐌1𝐀𝐌))𝐃𝐌1𝒃 by adapting three techniques for random-walk probability estimation on graphs: random-walk sampling [39, 19], local push [2, 1], and the bidirectional method [32].

When 𝐌 is RDD, the matrix 12(𝐈+𝐃𝐌1|𝐀𝐌|) is row substochastic, enabling us to interpret 𝒕𝒙L as an expectation over random walks that start from the distribution |𝒕|/𝒕1 and transition according to 12(𝐈+𝐃𝐌1|𝐀𝐌|). Such random walks may terminate early, and the algorithm needs to record the signs of the entries in 𝐀𝐌 along the walk. This method extends the approach in [4], and we adopt a different sampling scheme and conduct variance analysis in some special cases to reduce the dependence on L in the complexity.

Based on local push methods, we formulate a Push primitive that estimates the vector 𝒙L==0L1(12(𝐈+𝐃𝐌1𝐀𝐌))𝐃𝐌1𝒃 through deterministic local computation. Push maintains coordinate variables initialized to 𝐃𝐌1𝒃 and iteratively applies the linear operator 12(𝐈+𝐃𝐌1𝐀𝐌) to selected coordinates. Our analysis relies on invariant properties preserved by these operations, including an inequality variant that helps to handle negative entries. This approach yields closed-form accuracy guarantees for RDD systems and complexity bounds for special CDD systems. Combining these two aspects yield our result for special RCDD systems.

The bidirectional method combines random-walk sampling and local push from two directions. We adapt the BiPPR framework [32], performing Push from 𝐃𝐌1𝒃 and exploiting the invariant property to construct an unbiased estimator for 𝒕𝒙L, which can be sampled via random walks from |𝒕|/𝒕1 when 𝐌 is RDD. By balancing the costs of both components, we achieve improved dependence on L and ε, particularly for RCDD systems.

Crucially, we can transpose the expression for 𝒕𝒙L to obtain an equivalent summation with 𝐀𝐌 replaced by 𝐀𝐌 and the roles of 𝒃 and 𝒕 exchanged. This allows us to alternatively apply Push from 𝐃𝐌1𝒕 and random-walk sampling from |𝒃|/𝒃1 when 𝐌 is CDD, leading to symmetric algorithmic procedures and complexity results.

1.9 Paper Organization

The remainder of this paper is organized as follows. Section 2 introduces more related work. Section 3 elaborates on our formulation of 𝒙 and p-norm gaps and proves Theorems 1 to 3. In Sections 4, 5, and 6, we present our algorithms based on random-walk sampling, local push, and the bidirectional method, respectively, and prove Theorems 4 to 8. After that, Sections 7 and 8 relate our problem and results to local computation of PageRank and effective resistance, respectively. In the full version of this paper [28], we provide more preliminaries, discussions on future directions, detailed explanations of the relationship between our Push primitive and the ForwardPush and BackwardPush algorithms, additional results, and all omitted proofs.

2 Other Related Work

A vast literature exists on nearly-linear-time Laplacian solvers and their extensions, including solvers for undirected Laplacian/SDD systems (e.g., [39, 40, 26]) and directed Laplacian/RDD/CDD systems (e.g., [13, 11, 25]). These solvers achieve nearly linear time complexity in nnz(𝐌) with polylogarithmic dependence on 1/ε and the condition number. The development of global RDD/CDD solvers relies on reductions from solving RDD/CDD systems to solving Eulerian Laplacian systems, combined with efficient methods for computing PPR vectors with small α and the stationary distribution of random walks on graphs [13]. However, the global techniques from algorithmic linear algebra and known reductions to the Eulerian case do not directly apply to the local setting, revealing a fundamental distinction between global and local RDD/CDD solvers. In fact, the lower bounds established in [3] and this work demonstrate that local SDD solvers require polynomial dependence on 1/ε and 1/γ, indicating a separation between global and local SDD solvers.

[16] develops probabilistic logspace solvers for certain classes of directed Laplacian systems. Their method also relies on approximating truncated Neumann series, but they bound the truncation error using spectral radius and Jordan normal form, which yield truncation parameters of at least n2. Such a huge truncation parameter makes their algorithm and analysis inapplicable to the sublinear-time setting. As an aside, in the quantum regime, [41] presents algorithms for inverting well-conditioned matrices in quantum logspace, but their approaches are not directly applicable to our classical sublinear-time framework.

The idea of using random-walk sampling to solve linear systems dates back to the von Neumann-Ulam algorithm for approximating matrix inversion [20, 48]. The bidirectional method for estimating random-walk probabilities on graphs is first proposed in [33], which is inspired by property testing techniques [23, 27] and later simplified by the BiPPR framework [32].

The bidirectional idea has been widely applied to compute PageRank [5, 31, 32, 8, 49, 47, 6, 43], effective resistance [14, 52], heat kernel [8], and Markov Chain transition probability [5]. Among them, [47] proves that the simple BiPPR framework computes single-node PageRank on unweighted directed graphs in optimal time complexity (in terms of n and m). Their analysis relies on a new complexity bound of BackwardPush, which is parameterized by the PageRank value of the target node. Recently, [52] shows that the bidirectional technique can yield faster algorithms for constructing effective resistance sketch (as defined in [18]) on expander graphs, and [43] combines random-walk sampling with a novel randomized local push technique to improve the complexity of estimating single-node PageRank on directed graphs with bounded in-degree.

[37] uses the bidirectional method to estimate a single element in the product of a matrix power and a vector, which relates to our estimation of 12𝒕=0L1(12(𝐈+𝐃𝐌1𝐀𝐌))𝐃𝐌1𝒃. However, they only derive an average-case complexity bound under some bounded-norm conditions and discuss its applications to solving PSD systems. In contrast, we conduct a more comprehensive study of this problem and apply it to solving RDD/CDD systems.

PageRank and PPR have been extensively studied and widely applied; we refer interested readers to the surveys [29, 21, 50]. More lower bounds for PageRank computation can be found in [8, 50, 6] and references therein. The recent work [6] conducts a comprehensive study of various types of PPR estimation problems using different graph access queries under both worst-case and average-case settings. For constant decay factors, they provide nearly tight complexity upper and lower bounds for achieving constant relative error guarantees when the target value is above a given threshold.

Effective resistance is ubiquitous in spectral graph theory [17, 34, 38, 25]. A line of work [35, 51, 14, 52] focuses on locally estimating single-pair effective resistance through multi-step random-walk probabilities. [10] studies this problem on non-expander graphs and establishes strong complexity lower bounds, though it does not explicitly give a lower bound on the spectral gap of the graph. Recently, [52] provides a lower bound on the relative error parameter for this problem. Besides, a line of work [30, 18, 52] studies the problem of constructing effective resistance sketches. Technically, [30] leverages random-walk sampling, [18] uses count sketches and SDD solvers, and [52] employs a bidirectional approach.

3 Formulation of 𝒙 and the 𝒑-Norm Gaps

In this section, we prove Theorems 1, 2, and 3 and give further explanations on our formulations of 𝒙 and the p-norm gaps.

First, we need the following lemma that upper bounds 𝐃𝐌1/q𝐀𝐌𝐃𝐌1/pp for certain 𝐌 and Hölder conjugates p,q. Its proof is given in the full version of this paper [28].

Lemma 12.

The following hold:

  1. 1.

    If 𝐌 is RDD, then 𝐃𝐌1𝐀𝐌1.

  2. 2.

    If 𝐌 is CDD, then 𝐀𝐌𝐃𝐌111.

  3. 3.

    If 𝐌 is RCDD, then 𝐃𝐌1/q𝐀𝐌𝐃𝐌1/pp1 for any Hölder conjugates p,q.

To establish our formulation of 𝒙 in Theorem 1, we use the following lemma, which characterizes the operator norm and Neumann series of certain restricted linear maps. The proof of this lemma is given in the full version of this paper [28].

Lemma 13.

Suppose 𝐗n×n and is the operator norm induced by some vector norm . If 𝐗1, then, for 𝐗¯:=12(𝐈+𝐗), we have 𝐗¯|range(𝐈𝐗)<1 and

((𝐈𝐗)|range(𝐈𝐗))1=12((𝐈𝐗¯)|range(𝐈𝐗))1=12=0(𝐗¯|range(𝐈𝐗)).

Proof of Theorem 1.

First assume that 𝐌 is RDD. Applying Lemmas 12 and 13, we have 𝐃𝐌1𝐀𝐌1 and

((𝐈𝐃𝐌1𝐀𝐌)|range(𝐈𝐃𝐌1𝐀𝐌))1=12=0(12(𝐈+𝐃𝐌1𝐀𝐌)|range(𝐈𝐃𝐌1𝐀𝐌)).

As 𝒃range(𝐌)=range(𝐃𝐌𝐀𝐌), we have 𝐃𝐌1𝒃range(𝐈𝐃𝐌1𝐀𝐌), so 𝒙 converges to

12=0(12(𝐈+𝐃𝐌1𝐀𝐌))𝐃𝐌1𝒃 =12=0(12(𝐈+𝐃𝐌1𝐀𝐌)|range(𝐈𝐃𝐌1𝐀𝐌))𝐃𝐌1𝒃
=((𝐈𝐃𝐌1𝐀𝐌)|range(𝐈𝐃𝐌1𝐀𝐌))1𝐃𝐌1𝒃.

Thus, we can check that

𝐌𝒙=𝐃𝐌(𝐈𝐃𝐌1𝐀𝐌)((𝐈𝐃𝐌1𝐀𝐌)|range(𝐈𝐃𝐌1𝐀𝐌))1𝐃𝐌1𝒃=𝐃𝐌𝐃𝐌1𝒃=𝒃.

Next, observe that for any Hölder conjugates p,q, we have

𝐃𝐌1/p(12(𝐈+𝐃𝐌1/q𝐀𝐌𝐃𝐌1/p))𝐃𝐌1/q
=𝐃𝐌1/p(12𝐃𝐌1/p(𝐈+𝐃𝐌1𝐀𝐌)𝐃𝐌1/p)𝐃𝐌1/q=(12(𝐈+𝐃𝐌1𝐀𝐌))𝐃𝐌1

for each 0, so

𝒙=12𝐃𝐌1/p=0(12(𝐈+𝐃𝐌1/q𝐀𝐌𝐃𝐌1/p))𝐃𝐌1/q𝒃

for any Hölder conjugates p,q.

When 𝐌 is CDD, we consider the expression 𝒙=12𝐃𝐌1=0(12(𝐈+𝐀𝐌𝐃𝐌1))𝒃. We have 𝐀𝐌𝐃𝐌111 by Lemma 12, so applying the above arguments shows that 𝒙 converges to 𝐃𝐌1((𝐈𝐀𝐌𝐃𝐌1)|range(𝐈𝐀𝐌𝐃𝐌1))1𝒃 and

𝐌𝒙=(𝐈𝐀𝐌𝐃𝐌1)𝐃𝐌𝐃𝐌1((𝐈𝐀𝐌𝐃𝐌1)|range(𝐈𝐀𝐌𝐃𝐌1))1𝒃=𝒃.

So we have proved that 𝒙 is well-defined and satisfies 𝐌𝒙=𝒃 when 𝐌 is RDD or CDD.

Next, assume that 𝐌 is SDD. Then 𝐌 is RCDD and Lemma 12 implies that 𝐀~𝐌21. Repeating the above arguments with 𝐃𝐌1/2𝒃range(𝐈𝐀~𝐌) gives

𝒙=𝐃𝐌1/2((𝐈𝐀~𝐌)|range(𝐈𝐀~𝐌))1𝐃𝐌1/2𝒃=𝐃𝐌1/2(𝐌~|range(𝐌~))1𝐃𝐌1/2𝒃.

Since 𝐌 is symmetric, 𝐌~ is also symmetric, so range(𝐌~)=ker(𝐌~). By the property of the pseudoinverse, we have 𝒙=𝐃𝐌1/2(𝐌~|ker(𝐌~))1𝐃𝐌1/2𝒃=𝐃𝐌1/2𝐌~+𝐃𝐌1/2𝒃, finishing the proof.

Recall that if 𝐌 is RDD, then 1𝐃𝐌1𝐀𝐌=minj[n]{d𝐌(j)kj|𝐌(j,k)|d𝐌(j)}0, and this quantity measures how strongly the diagonal entries dominate the off-diagonal entries in each row. However, this quantity can equal zero, making it unsuitable as a useful notion of “gap.” In contrast, our Theorem 2 shows that the maximum p-norm gap γmax(𝐌) is always strictly positive when 𝐌 is RDD/CDD. As an example, γ(𝐌)=112(𝐈+𝐃𝐌1𝐀𝐌)|range(𝐈𝐃𝐌1𝐀𝐌), which refines the quantity 1𝐃𝐌1𝐀𝐌 by replacing 𝐃𝐌1𝐀𝐌 with 12(𝐈+𝐃𝐌1𝐀𝐌) and restricting the operator to range(𝐈𝐃𝐌1𝐀𝐌).

Although the p-norm gaps only involve operator norms for the restricted linear maps, Theorem 3 shows that they are sufficient to bound the truncation error between 𝒕𝒙L and 𝒕𝒙. The proofs of Theorems 2 and 3 are given in the full version of this paper [28].

4 Random-Walk Sampling

This section presents a Monte Carlo algorithm for estimating 𝒕𝒙 via random-walk sampling. All our algorithms in this paper aim to estimate 𝒕𝒙 by approximating 𝒕𝒙L. By Theorem 3 and our setting of L, it suffices to ensure that the estimate x^ satisfies that |x^𝒕𝒙L| is at most half the desired accuracy guarantee with probability at least 3/4, and we will omit this matter in the following proofs.

We first focus on RDD systems and transfer the results to CDD systems by transposing the expression of 𝒕𝒙L at the end of this section. When 𝐌 is RDD, 12(𝐈+𝐃𝐌1|𝐀𝐌|) is row substochastic and we can estimate

𝒕𝒙L=12𝒕=0L1(12(𝐈+𝐃𝐌1𝐀𝐌))𝐃𝐌1𝒃

by generating random-walk sampling from |𝒕|/𝒕1. In each sample, we first sample a walk length from [0,L1] uniformly at random and sample a source coordinate v from the distribution |𝒕|/𝒕1. Then we simulate a lazy random walk for steps starting from v according to the transition matrix 12(𝐈+𝐃𝐌1|𝐀𝐌|). Specifically, at each step from v, with probability 12 the walk stays put at v, with probability |𝐀𝐌(u,v)|2d𝐌(v) the walk moves to each u[n], and with the remaining probability 12u[n]|𝐀𝐌(u,v)|2d𝐌(v) the walk terminates, where d𝐌(v):=𝐌(v,v). Note that these probabilities lie in [0,1] since 𝐌 is RDD. Additionally, we keep track of the product of the signs of the initial entry in 𝒕 and the entries in 𝐀𝐌 along the walk (where stay-put steps have sign 1), which is denoted as σ. After steps, if the walk has not terminated and is at coordinate v, we take the value σ12𝒕1𝒃(v)d𝐌(v)L as the estimate of this sample. We repeat this process for ns independent samples and return the average as the final estimate x^. We give a pseudocode for this approach in the full version of this paper [28].

We emphasize that our sampling scheme is different from the framework in [4], in that we first sample the walk length and then perform steps of the random walk, while they perform L steps in each sample and take the quantities obtained in each step into account. As it turns out, our scheme is easier to analyze and will save a factor of logL in the number of samples.

We first establish the unbiasedness of the sampling scheme, whose proof is given in the full version of this paper [28].

Lemma 14.

Each sample described above gives an unbiased estimate of 𝐭𝐱L.

We can now prove Theorem 4 by applying the Hoeffding bound.

Proof of Theorem 4.

We use random-walk sampling as described above. The algorithm returns x^ as the average of ns independent samples, where each sampled value has absolute value at most 12𝒕1𝐃𝐌1𝒃L. Thus, by Lemma 14 and the Hoeffding bound, Pr{|x^𝒕𝒙L|12ε𝐃𝐌1𝒃} is upper bounded by

2exp(2ns(12ε𝐃𝐌1𝒃)2(𝒕1𝐃𝐌1𝒃L)2)=2exp(nsε22𝒕12L2).

To guarantee that this probability is at most 1/4, we set ns:=Θ(𝒕12L2/ε2).

As each sample simulates at most L steps of random walk, and each step takes O(frow(𝐌)) time, the time complexity is O(frow(𝐌)Lns)=O(frow(𝐌)𝒕12L3/ε2), as desired.

The proof of Theorem 5 relies on a variance analysis when 𝐀𝐌, 𝒃, and 𝒕 are nonnegative. Additionally, Theorem 6 assumes that 𝐃𝐌1𝒃 is known and provides a relative error guarantee. Its proof relies on the Stopping Rule Theorem given in [15] for adaptively setting the number of samples in Monte Carlo estimation. The proofs of Theorems 5 and 6 are given in the full version of this paper [28].

On the other hand, if 𝐌 is CDD, then 𝐌 is RDD and 12(𝐈+𝐃𝐌1|𝐀𝐌|) is row substochastic. As we can transpose the expression of 𝒕𝒙L to obtain

𝒕𝒙L=12𝒃𝐃𝐌1=0L1(12(𝐈+𝐀𝐌𝐃𝐌1))𝒕=12𝒃=0L1(12(𝐈+𝐃𝐌1𝐀𝐌))𝐃𝐌1𝒕,

our algorithms and results for RDD systems (except Theorem 5) applies to CDD systems by interchanging 𝒕 with 𝒃 and replacing 𝐀𝐌 with 𝐀𝐌. This justifies our claim that we can derive symmetric results by replacing RDD/RDDZ by CDD/CDDZ, swapping 𝒃 and 𝒕 (except in 𝒕𝒙), and replacing frow(𝐌) by fcol(𝐌) in the theorem statements. This argument also straightforwardly applies to our subsequent algorithms and results based on local push and the bidirectional method.

5 The Local Push Method

In this section, we adapt the local push methods to estimate 𝒕𝒙L. We first describe our Push algorithm as a primitive that can be applied to both RDD and CDD systems. After that, we establish different properties of Push for RDD and CDD systems. This leads to our proof for Theorem 7.

For both RDD and CDD systems 𝐌𝒙=𝒃, we describe our Push algorithm as a primitive that can be used for approximating the vector 2𝒙L==0L1(12(𝐈+𝐃𝐌1𝐀𝐌))𝐃𝐌1𝒃. The pseudocode is given in Algorithm 1.

In the Push algorithm, the initialization step sets the reserve and residue vectors to 𝟎, except that 𝒓(0) is set to be 𝐃𝐌1𝒃, which requires O(𝒃0) time if we assume that we can scan through the nonzero entries of 𝒃 in O(𝒃0) time. Next, the main loop iterates over levels from 0 to L2. At each level , the algorithm performs a local push operation on each coordinate v whose residue 𝒓()(v) exceeds the threshold rmax in absolute value. The push operation on v at level sets the reserve 𝒑()(v) to 𝒓()(v), increments 𝒓(+1) by 12(𝐈+𝐃𝐌1𝐀𝐌)(𝒓()(v)𝒆v), and sets 𝒓()(v) to 0. In effect, the second step in the push operation increases 𝒓(+1)(v) by 12𝒓()(v) and increases 𝒓(+1)(u) by 𝐀(v,u)2d𝐌(u)𝒓()(v) for each u[n] with 𝐀(v,u)0, which can be done in 𝐌(,v)0 time given oracle row/column access to 𝐌.

Algorithm 1 Push(𝐌,𝒃,L,rmax).

The following lemma gives the key invariant property of the Push algorithm. Its proof is based on induction and can be found in the full version of this paper [28].

Lemma 15.

The push operations preserve the following invariant:

=0L1(12(𝐈+𝐃𝐌1𝐀𝐌))𝐃𝐌1𝒃==0L1𝒑()+=0L1=0L1(12(𝐈+𝐃𝐌1𝐀𝐌))𝒓().

In light of this invariant, we use 12𝒕(=0L1𝒑()+𝒓(L1)) as an estimate of 𝒕𝒙L. Note that this quantity can be computed during the push process. The next lemma shows that if 𝐌 is RDD, then the absolute error between this quantity and 𝒕𝒙L can be bounded.

Lemma 16.

If 𝐌 is RDD, then |12𝐭(=0L1𝐩()+𝐫(L1))𝐭𝐱L|12𝐭1L2rmax.

Proof.

By Lemma 15 and Equation (5), we have

|12𝒕(=0L1𝒑()+𝒓(L1))𝒕𝒙L|=12|=0L2=0L1𝒕(12(𝐈+𝐃𝐌1𝐀𝐌))𝒓()|
12=0L2=0L1𝒕1(12(𝐈+𝐃𝐌1𝐀𝐌))𝒓()12𝒕1L2rmax,

where we used 12(𝐈+𝐃𝐌1𝐀𝐌)12(1+𝐃𝐌1𝐀𝐌)1 for RDD 𝐌 and 𝒓()rmax for any [0,L2] as guaranteed by the process of Push.

We will also use the following inequality version of the invariant to bound the running time of the Push algorithm, whose proof is similar to that of the invariant equation and can be found in the full version of this paper [28].

Lemma 17.

The push operations preserve the following inequality:
=0L1(12(𝐈+𝐃𝐌1|𝐀𝐌|))𝐃𝐌1|𝐛|=0L1|𝐩()|+=0L1=0L1(12(𝐈+𝐃𝐌1|𝐀𝐌|))|𝐫()|.

The next lemma bounds the complexity of the Push algorithm by a convoluted expression. We will shortly see how to simplify this expression for some special systems and settings.

Lemma 18.

Suppose that we can scan through the nonzero entries of 𝐛 in O(𝐛0) time. Then the complexity of the Push algorithm is bounded by

O(𝒃0+1rmaxv[n]𝐌(,v)0𝒆v=0L1(12(𝐈+𝐃𝐌1|𝐀𝐌|))𝐃𝐌1|𝒃|).

Proof.

Observe that each time a push operation is performed on a coordinate v[n], the value of 𝒆v=0L1|𝒑()| is increased by at least rmax. However, by Lemma 17, 𝒆v=0L1|𝒑()| is always upper bounded by 𝒆v=0L1(12(𝐈+𝐃𝐌1|𝐀𝐌|))𝐃𝐌1|𝒃|. Therefore, the total number of push operations performed on v is at most 1rmax𝒆v=0L1(12(𝐈+𝐃𝐌1|𝐀𝐌|))𝐃𝐌1|𝒃|. Since each push operation on v takes O(𝐌(,v)0) time, the lemma follows by summing the cost of the push operations over all v[n] and adding the O(𝒃0) time for initialization.

If 𝐌 is CDD and the nonzero entries of 𝐌 have absolute values of Ω(1), the next lemma shows that the complexity of Push can be simplified. Its proof is given in the full version of this paper [28].

Lemma 19.

Suppose that 𝐌 is CDD, the nonzero entries of 𝐌 have absolute values of Ω(1), and we can scan through the nonzero entries of 𝐛 in O(𝐛0) time. Then the complexity of the Push algorithm is O(𝐛0+𝐛1L/rmax).

Having established the properties of the Push algorithm for RDD and CDD systems, we can readily combine them to prove Theorem 7 for RCDD systems.

Proof of Theorem 7.

We run Push with rmax:=ε/L2 and use 12𝒕(=0L1𝒑()+𝒓(L1)) as the result. Since 𝐌 is RDD, by Lemma 16, the absolute error between the estimate and 𝒕𝒙L is upper bounded by 12𝒕1L2rmax12ε𝒕1. Since 𝐌 is CDD and its nonzero entries have absolute values of Ω(1), by Lemma 19, the time complexity of the Push algorithm is O(𝒃0+𝒃1L/rmax)=O(𝒃0+𝒃1L3/ε). This finishes the proof.

6 The Bidirectional Method

This section combines the techniques of random-walk sampling and local push to develop bidirectional algorithms for estimating 𝒕𝒙, which leads to Theorem 8.

The framework of the bidirectional method for RDD systems is presented in Algorithm 2. First, we invoke the Push algorithm to obtain the reserve and residue vectors. Recall that the invariant equation of Push (Lemma 15) implies that

𝒕𝒙L12𝒕(=0L1𝒑()+𝒓(L1))=12𝒕=0L2=0L1(12(𝐈+𝐃𝐌1𝐀𝐌))𝒓()
=12𝒕=0L1(12(𝐈+𝐃𝐌1𝐀𝐌))(=0min(L1,L2)𝒓()).

So, instead of directly using 12𝒕(=0L1𝒑()+𝒓(L1)) as an estimate of 𝒕𝒙L, we estimate the right-hand side of the above equation using random-walk samplings from |𝒕|/𝒕1 to reduce approximation error. We employ the same sampling scheme as described in Section 4 to obtain a walk length and the coordinate v reached by the random walk after steps, but each sampled value now involves the summation =0min(L1,L2)𝒓()(v). We take the average of the estimates across ns independent samples and add it to 12𝒕(=0L1𝒑()+𝒓(L1)) to obtain the final estimate x^.

We note that, compared to the sampling scheme in [14], our approach eliminates the need for using additional data structures to maintain the prefix sums of residues, since we can directly compute =0min(L1,L2)𝒓()(v) in O(L) time per sample without increasing the asymptotic complexity.

Algorithm 2 bidirectional method for RDD systems.

The next lemma establishes the unbiasedness of the bidirectional estimator.

Lemma 20.

The sum of 12𝐭(=0L1𝐩()+𝐫(L1)) and each sampled value in the bidirectional method described above gives an unbiased estimate of 𝐭𝐱L.

Proof.

Following the proof of Lemma 14, we can show that the expectation of each sampled value is 12𝒕=0L1(12(𝐈+𝐃𝐌1𝐀𝐌))(=0min(L1,L2)𝒓()). Combining this with the invariant equation in Lemma 15 completes the proof.

Next, we prove Theorem 8 by proving the two stated complexity bounds separately in the following two lemmas. Their proofs are partly inspired by [14] and [52], respectively. The proof of Lemma 22 uses variance analysis and is given in the full version of this paper [28].

Lemma 21.

Suppose the same assumptions as in Theorem 8. Then there exists a randomized algorithm that computes a x^ such that Pr{|x^𝐭𝐱|ε}34 in time O(𝐛0) plus

O(frow(𝐌)1/3𝒕12/3𝒃12/3L7/3ε2/3).

Proof.

We use the bidirectional method as in Algorithm 2. Note that each sampled value equals σ12𝒕1L=0min(L1,L2)𝒓()(v) for some σ{0,±1} and v[n] and the Push algorithm ensures that 𝒓()rmax for each [0,L2]. Thus, the absolute value of each sampled value is at most 12𝒕1L2rmax. Using the Hoeffding bound, it follows that

Pr{|x^𝒕𝒙L|12ε}2exp(2ns(12ε)2(𝒕1L2rmax)2)=2exp(nsε22𝒕12L4rmax2).

To guarantee that this probability is at most 1/4, we set ns:=Θ(𝒕12L4rmax2/ε2), where rmax will be determined shortly.

By Lemma 19, the push cost is O(𝒃1L/rmax). We set rmax:=ε2/3𝒃11/3frow(𝐌)1/3𝒕12/3L4/3, where 𝒃1 can be computed in O(𝒃0) time given our assumptions. Consequently, the cost for random-walk sampling and push both becomes O(frow(𝐌)1/3𝒕12/3𝒃12/3L7/3ε2/3), completing the proof.

Lemma 22.

Suppose the same assumptions as in Theorem 8. Then there exists a randomized algorithm that computes a x^ such that Pr{|x^𝐭𝐱|ε}34 in time O(𝐛0) plus

O(frow(𝐌)1/2𝒕1𝒃11/2𝐃𝐌1𝒃1/2L5/2ε1).

Proof of Theorem 8.

The theorem follows from Lemmas 21 and 22.

7 Connections with PageRank Computation

This section discusses the connections between our framework and PageRank computation and presents proofs for Theorems 9 and 11.

As mentioned in Section 1, the PPR equations (1) and (2) and PageRank contribution equations (3) and (4) can be formulated as RDD/CDD systems. For example, by Equations (1) and (3), for a node tV, setting 𝐌=𝐈(1α)𝐀G𝐃G1,𝒃=αn𝟏,𝒕=𝒆t or 𝐌=𝐈(1α)𝐃G1𝐀G,𝒃=α𝒆t,𝒕=1n𝟏 in our formulation both yield 𝒕𝒙=𝝅G,α(t); by Equation (2), for nodes s,tV, setting 𝐌=𝐃G(1α)𝐀G,𝒃=α𝒆s,𝒕=𝒆t yields 𝒕𝒙=𝝅G,α(s,t)/dG+(t). It is worth noting that on Eulerian graphs, Equations (2) and (4) are RCDD systems, and on undirected graphs, they are SDD systems.

By the definition of the p-norm gaps, we have

γ1(𝐈(1α)𝐀G𝐃G1)=112(𝐈+(1α)𝐀G𝐃G1)|range(𝐈(1α)𝐀G𝐃G1)1
=112(𝐈+(1α)𝐀G𝐃G1)1=112(1+(1α))=12α,

where we used range(𝐈(1α)𝐀G𝐃G1)=n since 𝐈(1α)𝐀G𝐃G1 is invertible. Similarly, we have γ1(𝐃G(1α)𝐀G)=γ(𝐈(1α)𝐃G1𝐀G)=γ(𝐃G(1α)𝐀G)=12α. Thus, 12α serves as a lower bound on the maximum p-norm gap of all these matrices involved in the PPR and PageRank contribution equations.

7.1 Results for PageRank Computation when 𝐃𝑮(𝟏𝜶)𝐀𝑮 is RCDD

Our framework provides new insights and results for PageRank computation, in particular when the involved system is RCDD. Theorem 9 stated in the introduction is one such example, which shows that previous results for single-node PageRank computation on undirected graphs can be improved and generalized to Eulerian graphs.

To prove Theorem 9, we investigate the case when the matrix 𝐃G(1α)𝐀G in Equation (2) is RCDD. This matrix is CDD, and it is also RDD if dG+(v)(1α)dG(v) holds for all vV. In particular, this condition holds when G is Eulerian or α is large enough. Now, by applying Theorem 6, we directly obtain the following result.

Theorem 23.

For any unweighted graph G and decay factor α, suppose that dG+(v)(1α)dG(v) holds for all vV. Then there exists a randomized algorithm that, given tV, δG+, and accuracy parameter ε, computes an estimate of 𝛑G,α(t) within relative error ε with success probability at least 3/4 in time

O~(1αε2dG+(t)δG+1n𝝅G,α(t)),

where O~ hides polylog(nαε) factors.

Proof.

Consider applying Theorem 6 to Equation (2) with 𝒔=1n𝟏 and 𝒕=𝒆t. Note that the corresponding 𝐃𝐌1𝒃 equals αnδG+, L=O~(1/α), frow(𝐌)=O(1), 𝒕1=1, and the obtained (1±ε)-multiplicative approximation of 𝒕𝒙=𝝅G,α(t)/dG+(t) directly yields a (1±ε)-multiplicative approximation of 𝝅G,α(t). Therefore, the time complexity is

O~(α/(nδG+)α2ε2𝝅G,α(t)/dG+(t))=O~(1αε2dG+(t)δG+1n𝝅G,α(t)),

as desired.

To prove Theorem 9, we establish some lower bounds on PageRank values in the next lemma, which may be of independent interest. This lemma is partly inspired by [8, Lemma 5.13], [46, 45], and [47, Theorem 1.1], and we give its proof in the full version of this paper [28].

Lemma 24.

For any weighted directed graph G and tV, we have

𝝅G,α(t)max(αn,α(1α)dG(t)nΔG+,α(1α)dG(t)2n𝐀G(,t)𝐀G1,1,α(1α)dG(t)2nn𝐀G(,t)2𝐀GF),

where 𝐀G1,1:=u,v[n]|𝐀G(u,v)| and 𝐀GF:=u,vV𝐀G(u,v)2 denote the entrywise 1-norm and the Frobenius norm, respectively. If G is Eulerian, we further have 𝛑G,α(t)dG(t)nΔG; if G is unweighted Eulerian, we further have 𝛑G,α(t)1αdG(t)nm.

Proof of Theorem 9.

Theorem 23 gives the complexity bound O~(1αε2dG(t)δG1n𝝅G,α(t)). By Lemma 24, on unweighted Eulerian graphs, we have

𝝅G,α(t) max(αn,α(1α)dG(t)2n𝐀G(,t)𝐀G1,1,dG(t)nΔG,1αdG(t)nm)
=max(αn,α(1α)dG(t)2nm,dG(t)nΔG,1αdG(t)nm).

By plugging these lower bounds on 𝝅G,α(t) into the complexity bound, we obtain the desired results up to polylog(nαε) factors (where we omit the terms of 1/(1α) since we often consider the case when α0). These polylog(nαε) factors can be removed by using non-truncated random walks for sampling (cf. [45]), leading to the stated complexity bounds.

7.2 A Lower Bound on the Accuracy Parameter for SDD Solvers

This subsection proves Theorem 11. To this end, we establish the following reduction from single-node PageRank computation on undirected graphs to solving SDD systems.

Lemma 25.

Suppose that there exists a randomized algorithm that computes an estimate x^t such that Pr{|x^t𝐱(t)|ε𝐱}34 for any SDD system 𝐒𝐱=𝐛 in O(γνετ) time. Then there exists a randomized algorithm that, given δG, estimates 𝛑G,α(t) on unweighted undirected graphs G within constant relative error and with success probability at least 3/4 in time O((dG(t)/δG)τ/αν+τ).

To prove this lemma, we use the following upper bound on 𝝅G,α(t) on Eulerian graphs, whose proof is given in the full version of this paper [28].

Lemma 26.

On any Eulerian graph G and vV, we have 𝛑G,α(v)dG(v)nδG.

Proof of Lemma 25.

Consider the PageRank equation (2) with 𝒔=1/n𝟏. When G is undirected, the matrix 𝐃G(1α)𝐀G is SDD. By setting γ:=α/2 and ε:=Θ(αδG/dG(t)), the supposed algorithm can compute an estimate x^t such that |x^t𝝅G,α(t)dG(t)|εmaxvV{𝝅G,α(v)dG(v)} w.p. at least 3/4 in time O(γνετ)=O((dG(t)/δG)τ/αν+τ). Using Lemma 26 and 𝝅G,α(t)α/n, we have maxvV{𝝅G,α(v)dG(v)}1nδG𝝅G,α(t)αδG. Thus, with probability at least 3/4, |dG(t)x^t𝝅G,α(t)|εdG(t)𝝅G,α(t)αδG=Θ(𝝅G,α(t)), so dG(t)x^t is an estimate of 𝝅G,α(t) within constant relative error, completing the proof.

Proof of Theorem 11.

[45] establishes a complexity lower bound of Ω(dG(t)/δG) for estimating 𝝅G,α(t) within constant relative error with constant success probability on unweighted undirected graphs, where α is constant and the bound holds for any possible combination of δG and dG(t). This lower bound applies to the number of queries to the graph structure. Therefore, combining this lower bound with Lemma 25 and noting that the reduction uses ε:=Θ(αδG/dG(t))=Ω(1/n) yield the desired lower bound of Ω(1/ε) for ε=Ω(1/n).

8 Connections with Effective Resistance Computation

This section justifies the relationship between our framework and effective resistance computation on graphs in Lemma 27 and proves Corollary 10.

Recall that in the context of computing effective resistances, we assume that G is undirected and connected. In our framework, we set 𝐌=𝐋G and 𝒃=𝒕=𝒆s𝒆t. By Theorem 2, γmax(𝐋G)=γ(𝐋G), so a lower bound γ on the spectral gap γ(𝐋G) serves as a lower bound on the maximum p-norm gap γmax(𝐋G). The following lemma states that with this setting, the quantity 𝒕𝒙 that our algorithms approximate equals the effective resistance RG(s,t).

Lemma 27.

When 𝐌=𝐋G and 𝐛=𝐭=𝐞s𝐞t, we have 𝐭𝐱=RG(s,t).

Lemma 27 can be proved using the results in [7], and we provide a different self-contained proof in the full version of this paper [28].

Now we can directly apply Theorems 4 and 8 to prove Corollary 10. The only remaining detail in the proof is to derive a better setting of L for the case 𝒃=𝒕=𝒆s𝒆t.

Proof of Corollary 10.

Following the proof of Theorem 3, we have |𝒕𝒙L𝒕𝒙|12γeγL𝐃𝐌1/2𝒕2𝐃𝐌1/2𝒃2=12γeγL(1dG(s)+1dG(t)) when 𝐌=𝐋G and 𝒃=𝒕=𝒆s𝒆t. Thus, setting L:=Θ(1γlog(1γε(1dG(s)+1dG(t)))) ensures that |𝒕𝒙L𝒕𝒙|12ε. The corollary then follows by applying Theorems 4 and 8 with this setting of L and noting that 𝐃𝐌1𝒃=1/min(dG(s),dG(t)) and frow(𝐌)=O(1) in this case.

References

  • [1] Reid Andersen, Christian Borgs, Jennifer T. Chayes, John E. Hopcroft, Vahab S. Mirrokni, and Shang-Hua Teng. Local computation of PageRank contributions. Internet Mathematics, 5(1):23–45, 2008. doi:10.1080/15427951.2008.10129302.
  • [2] Reid Andersen, Fan R. K. Chung, and Kevin J. Lang. Using PageRank to locally partition a graph. Internet Mathematics, 4(1):35–64, 2007. doi:10.1080/15427951.2007.10129139.
  • [3] Alexandr Andoni, Robert Krauthgamer, and Yosef Pogrow. On solving linear systems in sublinear time. CoRR, abs/1809.02995, 2018. doi:10.48550/arXiv.1809.02995.
  • [4] Alexandr Andoni, Robert Krauthgamer, and Yosef Pogrow. On solving linear systems in sublinear time. In Proceedings of the 10th Innovations in Theoretical Computer Science Conference, volume 124, pages 3:1–3:19, 2019. doi:10.4230/LIPIcs.ITCS.2019.3.
  • [5] Siddhartha Banerjee and Peter Lofgren. Fast bidirectional probability estimation in Markov models. In Advances in Neural Information Processing Systems 28, pages 1423–1431, 2015. URL: https://proceedings.neurips.cc/paper/2015/hash/ede7e2b6d13a41ddf9f4bdef84fdc737-Abstract.html.
  • [6] Christian Bertram, Mads Vestergaard Jensen, Mikkel Thorup, Hanzhi Wang, and Shuyi Yan. Estimating random-walk probabilities in directed graphs. CoRR, abs/2504.16481, 2025. doi:10.48550/arXiv.2504.16481.
  • [7] Enrico Bozzo. The Moore-Penrose inverse of the normalized graph Laplacian. Linear Algebra and its Applications, 439(10):3038–3043, 2013. doi:10.1016/j.laa.2013.08.039.
  • [8] Marco Bressan, Enoch Peserico, and Luca Pretto. Sublinear algorithms for local graph-centrality estimation. SIAM Journal on Computing, 52(4):968–1008, 2023. doi:10.1137/19M1266976.
  • [9] Sergey Brin and Lawrence Page. The anatomy of a large-scale hypertextual web search engine. Computer Networks, 30(1-7):107–117, 1998. doi:10.1016/S0169-7552(98)00110-X.
  • [10] Dongrun Cai, Xue Chen, and Pan Peng. Effective resistances in non-expander graphs. In Proceedings of the 31st Annual European Symposium on Algorithms, volume 274, pages 29:1–29:18, 2023. doi:10.4230/LIPIcs.ESA.2023.29.
  • [11] Michael B. Cohen, Jonathan A. Kelner, Rasmus Kyng, John Peebles, Richard Peng, Anup B. Rao, and Aaron Sidford. Solving directed Laplacian systems in nearly-linear time through sparse LU factorizations. In Proceedings of the 59th IEEE Symposium on Foundations of Computer Science, pages 898–909, 2018. doi:10.1109/FOCS.2018.00089.
  • [12] Michael B. Cohen, Jonathan A. Kelner, John Peebles, Richard Peng, Anup B. Rao, Aaron Sidford, and Adrian Vladu. Almost-linear-time algorithms for Markov chains and new spectral primitives for directed graphs. In Proceedings of the 49th Annual ACM Symposium on Theory of Computing, pages 410–419, 2017. doi:10.1145/3055399.3055463.
  • [13] Michael B. Cohen, Jonathan A. Kelner, John Peebles, Richard Peng, Aaron Sidford, and Adrian Vladu. Faster algorithms for computing the stationary distribution, simulating random walks, and more. In Proceedings of the 57th IEEE Symposium on Foundations of Computer Science, pages 583–592, 2016. doi:10.1109/FOCS.2016.69.
  • [14] Guanyu Cui, Hanzhi Wang, and Zhewei Wei. Mixing time matters: Accelerating effective resistance estimation via bidirectional method. In Proceedings of the 31st ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pages 177–188, 2025. doi:10.1145/3690624.3709298.
  • [15] Paul Dagum, Richard M. Karp, Michael Luby, and Sheldon M. Ross. An optimal algorithm for Monte Carlo estimation. SIAM Journal on Computing, 29(5):1484–1496, 2000. doi:10.1137/S0097539797315306.
  • [16] Dean Doron, François Le Gall, and Amnon Ta-Shma. Probabilistic logarithmic-space algorithms for Laplacian solvers. In Approximation, Randomization, and Combinatorial Optimization. Algorithms and Techniques, volume 81, pages 41:1–41:20, 2017. doi:10.4230/LIPIcs.APPROX-RANDOM.2017.41.
  • [17] Peter G. Doyle and J. Laurie Snell. Random walks and electric networks, volume 22. The Mathematical Association of America, 1984.
  • [18] Rajat Vadiraj Dwaraknath, Ishani Karmarkar, and Aaron Sidford. Towards optimal effective resistance estimation. In Advances in Neural Information Processing Systems 36, 2023. URL: http://papers.nips.cc/paper_files/paper/2023/hash/b8e2046160a568145af6d42eeef199f4-Abstract-Conference.html.
  • [19] Dániel Fogaras, Balázs Rácz, Károly Csalogány, and Tamás Sarlós. Towards scaling fully Personalized PageRank: Algorithms, lower bounds, and experiments. Internet Mathematics, 2(3):333–358, 2005. doi:10.1080/15427951.2005.10129104.
  • [20] George E. Forsythe and Richard A. Leibler. Matrix inversion by a Monte Carlo method. Mathematics of Computation, 4(31):127–129, 1950. URL: https://www.ams.org/journals/mcom/1950-04-031/S0025-5718-1950-0038138-X/home.html.
  • [21] David F. Gleich. Pagerank beyond the web. SIAM Review, 57(3):321–363, 2015. doi:10.1137/140976649.
  • [22] Oded Goldreich and Dana Ron. Property testing in bounded degree graphs. Algorithmica, 32(2):302–343, 2002. doi:10.1007/S00453-001-0078-7.
  • [23] Oded Goldreich and Dana Ron. On testing expansion in bounded-degree graphs. In Studies in Complexity and Cryptography, volume 6650, pages 68–75. Springer, 2011. doi:10.1007/978-3-642-22670-0_9.
  • [24] Aram W. Harrow, Avinatan Hassidim, and Seth Lloyd. Quantum algorithm for linear systems of equations. Physical Review Letters, 103(15):150502, 2009. doi:10.1103/PhysRevLett.103.150502.
  • [25] Arun Jambulapati, Sushant Sachdeva, Aaron Sidford, Kevin Tian, and Yibin Zhao. Eulerian graph sparsification by effective resistance decomposition. In Proceedings of the 2025 ACM-SIAM Symposium on Discrete Algorithms, pages 1607–1650, 2025. doi:10.1137/1.9781611978322.50.
  • [26] Arun Jambulapati and Aaron Sidford. Ultrasparse ultrasparsifiers and faster Laplacian system solvers. In Proceedings of the 2021 ACM-SIAM Symposium on Discrete Algorithms, pages 540–559, 2021. doi:10.1137/1.9781611976465.33.
  • [27] Satyen Kale, Yuval Peres, and C. Seshadhri. Noise tolerance of expanders and sublinear expansion reconstruction. SIAM Journal on Computing, 42(1):305–323, 2013. doi:10.1137/110837863.
  • [28] Tsz Chiu Kwok, Zhewei Wei, and Mingji Yang. On solving asymmetric diagonally dominant linear systems in sublinear time. CoRR, abs/2509.13891, 2025. doi:10.48550/arXiv.2509.13891.
  • [29] Amy Nicole Langville and Carl Dean Meyer. Survey: Deeper inside PageRank. Internet Mathematics, 1(3):335–380, 2003. doi:10.1080/15427951.2004.10129091.
  • [30] Lawrence Li and Sushant Sachdeva. A new approach to estimating effective resistances and counting spanning trees in expander graphs. In Proceedings of the 2023 ACM-SIAM Symposium on Discrete Algorithms, pages 2728–2745, 2023. doi:10.1137/1.9781611977554.ch102.
  • [31] Peter Lofgren, Siddhartha Banerjee, and Ashish Goel. Bidirectional PageRank estimation: from average-case to worst-case. In Proceedings of the 12th International Workshop on Algorithms and Models for the Web-Graph, volume 9479, pages 164–176, 2015. doi:10.1007/978-3-319-26784-5_13.
  • [32] Peter Lofgren, Siddhartha Banerjee, and Ashish Goel. Personalized PageRank estimation and search: a bidirectional approach. In Proceedings of the 9th ACM International Conference on Web Search and Data Mining, pages 163–172, 2016. doi:10.1145/2835776.2835823.
  • [33] Peter Lofgren, Siddhartha Banerjee, Ashish Goel, and Seshadhri Comandur. FAST-PPR: Scaling Personalized PageRank estimation for large graphs. In Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pages 1436–1445, 2014. doi:10.1145/2623330.2623745.
  • [34] László Lovász. Random walks on graphs: a survey. Combinatorics, Paul Erdös is Eighty, 2(1-46):4, 1993.
  • [35] Pan Peng, Daniel Lopatta, Yuichi Yoshida, and Gramoz Goranci. Local algorithms for estimating effective resistance. In Proceedings of the 27th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pages 1329–1338, 2021. doi:10.1145/3447548.3467361.
  • [36] Ronitt Rubinfeld, Gil Tamir, Shai Vardi, and Ning Xie. Fast local computation algorithms. In Proceedings of Innovations in Computer Science, pages 223–238, 2011. URL: http://conference.iiis.tsinghua.edu.cn/ICS2011/content/papers/36.html.
  • [37] Nitin Shyamkumar, Siddhartha Banerjee, and Peter Lofgren. Sublinear estimation of a single element in sparse linear systems. In Proceedings of the 54th Annual Allerton Conference on Communication, Control, and Computing, pages 856–860, 2016. doi:10.1109/ALLERTON.2016.7852323.
  • [38] Daniel A. Spielman and Nikhil Srivastava. Graph sparsification by effective resistances. SIAM Journal on Computing, 40(6):1913–1926, 2011. doi:10.1137/080734029.
  • [39] Daniel A. Spielman and Shang-Hua Teng. Nearly-linear time algorithms for graph partitioning, graph sparsification, and solving linear systems. In Proceedings of the 36th Annual ACM Symposium on Theory of Computing, pages 81–90, 2004. doi:10.1145/1007352.1007372.
  • [40] Daniel A. Spielman and Shang-Hua Teng. Nearly linear time algorithms for preconditioning and solving symmetric, diagonally dominant linear systems. SIAM Journal on Matrix Analysis and Applications, 35(3):835–885, 2014. doi:10.1137/090771430.
  • [41] Amnon Ta-Shma. Inverting well conditioned matrices in quantum logspace. In Proceedings of the 45th Annual ACM Symposium on Theory of Computing, pages 881–890, 2013. doi:10.1145/2488608.2488720.
  • [42] Shang-Hua Teng. The Laplacian Paradigm: Emerging algorithms for massive graphs. In Proceedings of the 7th Annual Conference on Theory and Applications of Models of Computation, volume 6108, pages 2–14, 2010. doi:10.1007/978-3-642-13562-0_2.
  • [43] Mikkel Thorup, Hanzhi Wang, Zhewei Wei, and Mingji Yang. PageRank centrality in directed graphs with bounded in-degree. In Proceedings of the 2026 ACM-SIAM Symposium on Discrete Algorithms, 2026. To appear. arXiv preprint at https://arxiv.org/abs/2508.01257. doi:10.48550/arXiv.2508.01257.
  • [44] Nisheeth K. Vishnoi. L𝐱=𝐛. Foundations and Trends in Theoretical Computer Science, 8(1-2):1–141, 2013. doi:10.1561/0400000054.
  • [45] Hanzhi Wang. Revisiting local PageRank estimation on undirected graphs: Simple and optimal. In Proceedings of the 30th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pages 3036–3044, 2024. doi:10.1145/3637528.3671820.
  • [46] Hanzhi Wang and Zhewei Wei. Estimating single-node PageRank in O~(min{dt,m}) time. Proceedings of the VLDB Endowment, 16(11):2949–2961, 2023. doi:10.14778/3611479.3611500.
  • [47] Hanzhi Wang, Zhewei Wei, Ji-Rong Wen, and Mingji Yang. Revisiting local computation of PageRank: Simple and optimal. In Proceedings of the 56th Annual ACM Symposium on Theory of Computing, pages 911–922, 2024. doi:10.1145/3618260.3649661.
  • [48] Wolfgang R. Wasow. A note on the inversion of matrices by random walks. Mathematical Tables and Other Aids to Computation, 6(38):78–81, 1952. doi:10.2307/2002546.
  • [49] Zhewei Wei, Ji-Rong Wen, and Mingji Yang. Approximating single-source Personalized PageRank with absolute error guarantees. In Proceedings of the 27th International Conference on Database Theory, volume 290, pages 9:1–9:19, 2024. doi:10.4230/LIPIcs.ICDT.2024.9.
  • [50] Mingji Yang, Hanzhi Wang, Zhewei Wei, Sibo Wang, and Ji-Rong Wen. Efficient algorithms for Personalized PageRank computation: a survey. IEEE Transactions on Knowledge and Data Engineering, 36(9):4582–4602, 2024. doi:10.1109/TKDE.2024.3376000.
  • [51] Renchi Yang and Jing Tang. Efficient estimation of pairwise effective resistance. Proceedings of the ACM International Conference on Management of Data, 1(1):16:1–16:27, 2023. doi:10.1145/3588696.
  • [52] Yichun Yang, Rong-Hua Li, Meihao Liao, and Guoren Wang. Improved algorithms for effective resistance computation on graphs. In Proceedings of the 38th Conference on Learning Theory, volume 291, pages 5892–5920, 2025. URL: https://proceedings.mlr.press/v291/yichun25a.html.