On Solving Asymmetric Diagonally Dominant Linear Systems in Sublinear Time

Kwok, Tsz Chiu; Wei, Zhewei; Yang, Mingji

doi:10.4230/LIPIcs.ITCS.2026.89

On Solving Asymmetric Diagonally Dominant Linear Systems in Sublinear Time

Tsz Chiu Kwok

Shanghai University of Finance and Economics, China Zhewei Wei

Renmin University of China, Beijing, China Mingji Yang

Renmin University of China, Beijing, China

Abstract

We initiate a study of solving a row/column diagonally dominant (RDD/CDD) linear system $\mathbf{M}\boldsymbol{x}=\boldsymbol{b}$ in sublinear time, with the goal of estimating $\boldsymbol{t}^{\top}\boldsymbol{x}^{\ast}$ for a given vector $\boldsymbol{t}\in\mathbb{R}^{n}$ and a specific solution $\boldsymbol{x}^{\ast}$ . This setting naturally generalizes the study of sublinear-time solvers for symmetric diagonally dominant (SDD) systems [Andoni-Krauthgamer-Pogrow, ITCS 2019] to the asymmetric case, which has remained underexplored despite extensive work on nearly-linear-time solvers for RDD/CDD systems.

Our first contributions are characterizations of the problem’s mathematical structure. We express a solution $\boldsymbol{x}^{\ast}$ via a Neumann series, prove its convergence, and upper bound the truncation error on this series through a novel quantity of $\mathbf{M}$ , termed the maximum $p$ -norm gap. This quantity generalizes the spectral gap of symmetric matrices and captures how the structure of $\mathbf{M}$ governs the problem’s computational difficulty.

For systems with bounded maximum $p$ -norm gap, we develop a collection of algorithmic results for locally approximating $\boldsymbol{t}^{\top}\boldsymbol{x}^{\ast}$ under various scenarios and error measures. We derive these results by adapting the techniques of random-walk sampling, local push, and their bidirectional combination, which have proved powerful for special cases of solving RDD/CDD systems, particularly estimating PageRank and effective resistance on graphs. Our general framework yields deeper insights, extended results, and improved complexity bounds for these problems. Notably, our perspective provides a unified understanding of Forward Push and Backward Push, two fundamental approaches for estimating random-walk probabilities on graphs.

Our framework also inherits the hardness results for sublinear-time SDD solvers and local PageRank computation, establishing lower bounds on the maximum $p$ -norm gap or the accuracy parameter. We hope that our work opens the door for further study into sublinear solvers, local graph algorithms, and directed spectral graph theory.

Keywords and phrases:

Spectral Graph Theory, Linear Systems, Sublinear Algorithms

Copyright and License:

2012 ACM Subject Classification:

Mathematics of computing

\rightarrow

Spectra of graphs ; Theory of computation

\rightarrow

Graph algorithms analysis ; Theory of computation

\rightarrow

Streaming, sublinear and near linear time algorithms

Related Version:

Full Version: https://arxiv.org/abs/2509.13891 [28]

Acknowledgements:

We thank the anonymous reviewers for their valuable comments. Mingji Yang wishes to thank Prof. Shang-Hua Teng for inspiring discussions and encouragement, and Guanyu Cui for helpful discussions on effective resistance.

Funding:

This research was supported by National Natural Science Foundation of China (No. 92470128, No. U2241212).

DOI:

10.4230/LIPIcs.ITCS.2026.89

Event:

17th Innovations in Theoretical Computer Science Conference (ITCS 2026)

Editor:

Shubhangi Saraf

Series and Publisher:

Leibniz International Proceedings in Informatics, Schloss Dagstuhl – Leibniz-Zentrum für Informatik

1 Introduction

Solving systems of linear equations is one of the most fundamental problems in numerical linear algebra and theoretical computer science. In the classic version of this problem, we are given a matrix $\mathbf{M}\in\mathbb{R}^{n\times n}$ and a vector $\boldsymbol{b}\in\mathbb{R}^{n}$ in $\operatorname{range}(\mathbf{M})$ , and the goal is to compute a solution vector $\boldsymbol{x}\in\mathbb{R}^{n}$ satisfying $\mathbf{M}\boldsymbol{x}=\boldsymbol{b}$ . Beyond general-purpose solvers for arbitrary linear systems (e.g., using fast matrix multiplication or the conjugate gradient method), extensive research has focused on developing efficient solvers for special classes of systems.

In particular, for the important classes of Laplacian and symmetric diagonally dominant (SDD) systems, the breakthrough work of Spielman and Teng [39, 40] established the first nearly-linear-time (in $\operatorname{nnz}(\mathbf{M})$ ) solvers. This gave rise to the influential Laplacian Paradigm, which revolutionized algorithmic graph theory and numerical linear algebra, with widespread applications ranging from network science to machine learning (see, e.g., [42, 44]). Among subsequent efforts to generalize the SDD solvers, a line of work [13, 12, 11] developed nearly-linear-time solvers for asymmetric row/column diagonally dominant (RDD/CDD) systems, which significantly expanded the scope of the Laplacian Paradigm.

On the other hand, partly motivated by the advances in quantum algorithms for solving linear systems in sublinear time [24], Andoni, Krauthgamer, and Pogrow [4] pioneered the study of classical algorithms for approximately solving a single entry of SDD systems in sublinear time. Under specific access models and error measures, they established:

$\blacksquare$

a $\operatorname{polylog}(n)$ -time solver for well-conditioned SDD systems; ¹¹1In fact, [4] considers the (effective) condition number of a normalized version of the involved SDD matrix. See Section 1.3 for details.
$\blacksquare$

a $\widetilde{\Omega}\left(\kappa^{2}\right)$ lower bound for general SDD systems, where $\kappa$ is the condition number of $\mathbf{M}$ .

The second result demonstrates the necessity of a quadratic dependence on the condition number (equivalently, the reciprocal of the spectral gap) for sublinear-time SDD solvers.

In light of the previous research for nearly-linear-time RDD/CDD solvers and sublinear-time SDD solvers, it is natural to ask whether the sublinear-time SDD solvers can be extended to the more general RDD/CDD cases. In this paper, we initiate a study in this direction and give partial positive answers to this question. We show that the sublinear-time solvers for well-conditioned SDD systems can be extended to “well-structured” RDD/CDD systems, provided that we first define an appropriate generalization of the key structural quantity, the spectral gap for symmetric matrices, to asymmetric matrices. We achieve this by re-characterizing the problem’s mathematical structure and introducing a new concept called the maximum $p$ -norm gap.

Algorithmically, the sublinear SDD solver in [4] works by solely generating random-walk samplings based on $\mathbf{M}$ to approximate a truncated Neumann series of the solution. In contrast, we conduct a deeper investigation of the complexity upper bounds by applying two techniques in addition to random-walk sampling: the local push method, which performs local exploration in $\mathbf{M}$ ; and the bidirectional method, which integrates random-walk sampling with local push. Together, we derive a suite of upper bounds for solving RDD/CDD systems under diverse access models and error measures. For instance, we extend the algorithmic result in [4] to RDD systems and derive new results for RCDD systems with smaller dependence on some parameters.

Our algorithmic toolkit and investigation of the diverse upper bounds are inspired by recent advances in local algorithms for estimating PageRank [9] and effective resistance [17] on graphs [50, 49, 47, 45, 6, 43, 14, 52], which are important special cases of solving RDD/CDD systems. The techniques of random-walk sampling [39, 19], local push [2, 1], and the bidirectional method [32] have been extensively studied for these problems, and recent works have further uncovered their new properties and optimality in certain settings. Nonetheless, previous works typically analyze their applications to PageRank and effective resistance computation separately. Our perspective of formulating these problems as solving linear systems, however, provides a more general and unified framework for understanding these techniques and problems, revealing their deeper connections. As we shall see, this bigger picture yields novel insights, extended results, and improved complexity bounds.

Notably, our perspective reveals a connection between two fundamental local push algorithms on graphs, namely ForwardPush [2] and BackwardPush [1]²²2Their original names are ApproximatePageRank and ApproxContributions, respectively.. These algorithms iteratively perform local push operations to explore the graph in opposite directions. Although both algorithms share similar approaches, they have been treated as distinct methods for different problems, with each analyzed separately. In contrast, by abstracting both methods as a single algebraic primitive, we demonstrate that ForwardPush and BackwardPush are equivalent to applying this primitive to different linear systems. This characterization helps to explain their distinct properties and enables unified analysis of both approaches.

On the lower-bound side, our framework inherits the hardness result for sublinear-time SDD solvers, establishing the necessity of our assumption on the maximum $p$ -norm gap; also, known lower bounds for local PageRank computation imply lower bounds on the accuracy parameter for our setting. As our work bridges the study of sublinear-time solvers and local graph algorithms, we believe that further investigation could uncover more connections and results for these topics.

In the remainder of this section, we formally define the problem, present our main contributions, and provide a technical overview.

1.1 Basic Notations

For $n\in\mathbb{Z}^{+}$ , we define $[n]:=\{1,2,\dots,n\}$ . We call a matrix $\mathbf{M}\in\mathbb{R}^{n\times n}$ RDD (row diagonally dominant) if it satisfies $\mathbf{M}(j,j)\geq\sum_{k\neq j}\big|\mathbf{M}(j,k)\big|$ for all $j\in[n]$ and its diagonal entries are positive. We call a matrix CDD (column diagonally dominant) if its transpose is RDD, and call a matrix SDD (symmetric diagonally dominant) if it is symmetric and RDD. It is well-known that any SDD matrix is PSD (positive semidefinite). We call a square matrix Z-matrix if its off-diagonal entries are nonpositive. We use $\boldsymbol{e}_{k}$ to denote the $k$ -th canonical unit vector and $\boldsymbol{1}$ to denote the all-one vector. For any matrix or vector, we use $|\cdot|$ to denote taking the entrywise absolute value.

We call two real numbers $p,q>1$ Hölder conjugates (or $q$ is conjugate to $p$ ) if they satisfy $1/p+1/q=1$ . By convention, we also formally let $1/\infty:=0$ and view $\infty$ and $1$ as Hölder conjugates. For any $p\in[1,\infty]$ , we use $\|\boldsymbol{x}\|_{p}$ to denote the $p$ -norm of a vector $\boldsymbol{x}$ , and $\|\mathbf{M}\|_{p}$ to denote the matrix norm induced by vector $p$ -norm of a matrix $\mathbf{M}\in\mathbb{R}^{n\times n}$ .

Restriction and Pseudoinverse.

For a subspace $U\subseteq\mathbb{R}^{n}$ , we use $\mathbf{M}|_{U}$ to denote the restriction of the linear map $\mathbf{M}$ to $U$ , with induced norm $\left\|\mathbf{M}|_{U}\right\|_{p}:=\max_{\boldsymbol{x}\in U,\|\boldsymbol{x}% \|_{p}=1}\|\mathbf{M}\boldsymbol{x}\|_{p}$ for any $p\in[1,\infty]$ . We write the pseudoinverse (a.k.a. Moore–Penrose inverse) of $\mathbf{M}$ as $\mathbf{M}^{+}$ .

Spectral Gap.

For an SDD matrix $\mathbf{S}\in\mathbb{R}^{n\times n}$ , we define its spectral gap $\gamma(\mathbf{S})$ as half the smallest nonzero eigenvalue of $\widetilde{\mathbf{S}}:=\mathbf{D}_{\mathbf{S}}^{-1/2}\mathbf{S}\mathbf{D}_{% \mathbf{S}}^{-1/2}$ , where $\mathbf{D}_{\mathbf{S}}$ is the diagonal matrix that satisfies $\mathbf{D}(k,k)=\mathbf{S}(k,k)$ for each $k\in[n]$ . The (effective) condition number of $\mathbf{S}$ , denoted by $\kappa(\mathbf{S})$ , is defined as the ratio between the largest and smallest nonzero eigenvalues of $\mathbf{S}$ . It holds that $\kappa(\widetilde{\mathbf{S}})=\Theta\big(1/\gamma(\mathbf{S})\big)$ .

Graphs.

We consider directed graphs $G=(V,E)$ , with $n:=|V|$ and $m:=|E|$ . We assume that $V=[n]$ . If $(u,v)\in E$ , we write $u\to v$ . We denote the (possibly weighted) adjacency matrix as $\mathbf{A}_{G}\in\mathbb{R}^{n\times n}$ . For each $v\in V$ , we define its indegree $d^{-}_{G}(v):=\sum_{u\to v}\mathbf{A}_{G}(u,v)$ and outdegree $d^{+}_{G}(v):=\sum_{v\to u}\mathbf{A}_{G}(v,u)$ . The outdegree matrix $\mathbf{D}_{G}\in\mathbb{R}^{n\times n}$ is the diagonal matrix with $\mathbf{D}_{G}(v,v)=d^{+}_{G}(v)$ for each $v\in V$ . We denote $G$ ’s minimum and maximum outdegree as $\delta^{+}_{G}:=\min_{v\in V}\big\{d^{+}_{G}(v)\big\}$ and $\Delta^{+}_{G}:=\max_{v\in V}\big\{d^{+}_{G}(v)\big\}$ , respectively.

Eulerian Graphs and Laplacian.

We call a graph Eulerian if $d^{-}_{G}(v)=d^{+}_{G}(v)$ for all $v\in V$ . On Eulerian graphs, we simply write $d_{G}(v):=d^{+}_{G}(v)$ , $\delta_{G}:=\delta^{+}_{G}$ , and $\Delta_{G}:=\Delta^{+}_{G}$ . Undirected graphs constitute a special case of Eulerian graphs where each edge corresponds to two directed edges in opposite directions. The directed Laplacian matrix is defined as $\mathbf{L}_{G}:=\mathbf{D}_{G}-\mathbf{A}_{G}^{\top}$ , which satisfies $\boldsymbol{1}^{\top}\mathbf{L}_{G}=\boldsymbol{0}^{\top}$ and is CDDZ. $\mathbf{L}_{G}$ is RCDDZ for Eulerian graphs and is SDDZ for undirected graphs.

1.2 Problem Formulation

We consider a linear system $\mathbf{M}\boldsymbol{x}=\boldsymbol{b}$ , where $\mathbf{M}\in\mathbb{R}^{n\times n}$ is an RDD/CDD matrix and $\boldsymbol{b}\in\operatorname{range}(\mathbf{M})$ , and a coefficient vector $\boldsymbol{t}\in\mathbb{R}^{n}$ . We assume that $\boldsymbol{b},\boldsymbol{t}\neq\boldsymbol{0}$ and all nonzero entries in $\mathbf{M}$ , $\boldsymbol{b}$ , and $\boldsymbol{t}$ have absolute values in $\big[1/\operatorname{poly}(n),\operatorname{poly}(n)\big]$ . We assume that the algorithms are given the dimension $n$ and have oracle access to $\mathbf{M}$ , $\boldsymbol{b}$ , and $\boldsymbol{t}$ via the following basic queries:

$\blacksquare$

Diagonal queries for $\mathbf{M}$ : return $\mathbf{M}(k,k)$ in $O(1)$ time for a given index $k\in[n]$ ;
$\blacksquare$

Row/column queries for $\mathbf{M}$ : return the indices and corresponding values of nonzero entries for a specified row/column of $\mathbf{M}$ , in time linear in the number of returned indices;
$\blacksquare$

Entrywise queries for $\boldsymbol{b}$ and $\boldsymbol{t}$ : return $\boldsymbol{b}(k)$ or $\boldsymbol{t}(k)$ in $O(1)$ time for a given index $k\in[n]$ .

Our results will assume additional access operations, which will be specified in the statements.

Following the concept of local computation algorithms [36] and the previous work [4], we consider a fixed solution $\boldsymbol{x}^{\ast}$ that is determined by $\mathbf{M}$ and $\boldsymbol{b}$ , and require invoking the algorithm with different $\boldsymbol{t}$ and accuracy parameters returns estimates of $\boldsymbol{t}^{\top}\boldsymbol{x}^{\ast}$ that are all consistent with the “global” solution $\boldsymbol{x}^{\ast}$ . Our choice of $\boldsymbol{x}^{\ast}$ will be given in Theorem 1.

We also consider the problems of computing (Personalized) PageRank [9] and effective resistance [17] on graphs, viewing them as special cases of our formulation of solving RDD/CDD systems. For these graph problems, we assume the standard adjacency-list model [22], where each degree query takes $O(1)$ time and each neighbor query returns a neighbor index along with the edge weight in $O(1)$ time.

For Personalized PageRank (PPR), we consider a directed graph $G$ , a decay factor $\alpha\in(0,1)$ , and a source distribution $\boldsymbol{s}\in\big\{\boldsymbol{y}\in\mathbb{R}^{n}_{\geq 0}:\|\boldsymbol{% y}\|_{1}=1\big\}$ . To ensure that PPR is well-defined, we assume that $\delta^{+}_{G}>0$ . The PPR vector $\boldsymbol{\pi}_{G,\alpha,\boldsymbol{s}}$ is defined as the unique solution to the following two equivalent forms of the PPR equation:

	$\displaystyle\left(\mathbf{I}-(1-\alpha)\mathbf{A}_{G}^{\top}\mathbf{D}_{G}^{-% 1}\right)\boldsymbol{\pi}_{G,\alpha,\boldsymbol{s}}=\alpha\boldsymbol{s},$		(1)
	$\displaystyle\left(\mathbf{D}_{G}-(1-\alpha)\mathbf{A}_{G}^{\top}\right)\left(% \mathbf{D}_{G}^{-1}\boldsymbol{\pi}_{G,\alpha,\boldsymbol{s}}\right)=\alpha% \boldsymbol{s}.$		(2)

Both equations can be viewed as linear systems of the form $\mathbf{M}\boldsymbol{x}=\boldsymbol{b}$ , where the coefficient matrices $\mathbf{I}-(1-\alpha)\mathbf{A}_{G}^{\top}\mathbf{D}_{G}^{-1}$ and $\mathbf{D}_{G}-(1-\alpha)\mathbf{A}_{G}^{\top}$ are both CDDZ and invertible. Note that for the second form, the solution to the corresponding system is $\mathbf{D}_{G}^{-1}\boldsymbol{\pi}_{G,\alpha,\boldsymbol{s}}$ , an outdegree-scaled version of the PPR vector. We define the PPR value from $s$ to $t$ as $\boldsymbol{\pi}_{G,\alpha}(s,t):=\boldsymbol{\pi}_{G,\alpha,\boldsymbol{e}_{s% }}(t)$ , and the PageRank vector $\boldsymbol{\pi}_{G,\alpha}$ as $\boldsymbol{\pi}_{G,\alpha}:=\boldsymbol{\pi}_{G,\alpha,1/n\cdot\boldsymbol{1}}$ . It holds that $\boldsymbol{\pi}_{G,\alpha}(t)=\frac{1}{n}\sum_{s\in V}\boldsymbol{\pi}_{G,% \alpha}(s,t)$ for all $t\in V$ . We also consider the following two equivalent forms of the PageRank contribution equation:

	$\displaystyle\big(\mathbf{I}-(1-\alpha)\mathbf{D}_{G}^{-1}\mathbf{A}_{G}\big)% \boldsymbol{\pi}^{-1}_{G,\alpha,t}=\alpha\boldsymbol{e}_{t},$		(3)
	$\displaystyle\big(\mathbf{D}_{G}-(1-\alpha)\mathbf{A}_{G}\big)\boldsymbol{\pi}% ^{-1}_{G,\alpha,t}=\alpha\mathbf{D}_{G}\boldsymbol{e}_{t},$		(4)

where $t\in V$ is a specified target node and $\boldsymbol{\pi}^{-1}_{G,\alpha,t}$ is called the PageRank contribution vector to $t$ . It holds that $\boldsymbol{\pi}^{-1}_{G,\alpha,t}(s)=\boldsymbol{\pi}_{G,\alpha}(s,t)$ for all $s\in V$ . Similarly, both equations can be viewed as linear systems of the form $\mathbf{M}\boldsymbol{x}=\boldsymbol{b}$ , where the coefficient matrices $\mathbf{I}-(1-\alpha)\mathbf{D}_{G}^{-1}\mathbf{A}_{G}$ and $\mathbf{D}_{G}-(1-\alpha)\mathbf{A}_{G}$ are both RDDZ and invertible. Note that for the second form, the corresponding vector $\boldsymbol{b}$ is $\alpha\mathbf{D}_{G}\boldsymbol{e}_{t}$ .

For effective resistance, we consider a connected undirected graph $G$ and two distinct nodes $s,t\in V$ . The effective resistance (a.k.a. resistance distance) between $s$ and $t$ , denoted by $R_{G}(s,t)$ , is defined as the equivalent resistance between $s, t$ if the graph is thought of as an electrical network with each edge $(u,v)\in E$ having resistance $1/\mathbf{A}_{G}(u,v)$ . Algebraically, $R_{G}(s,t)=(\boldsymbol{e}_{s}-\boldsymbol{e}_{t})^{\top}\mathbf{L}_{G}^{+}(% \boldsymbol{e}_{s}-\boldsymbol{e}_{t})$ . As we will establish in Lemma 27, setting $\mathbf{M}=\mathbf{L}_{G}$ and $\boldsymbol{b}=\boldsymbol{t}=\boldsymbol{e}_{s}-\boldsymbol{e}_{t}$ in our formulation yields $\boldsymbol{t}^{\top}\boldsymbol{x}^{\ast}=R_{G}(s,t)$ . Here, $\mathbf{M}=\mathbf{L}_{G}$ is SDDZ and $\boldsymbol{b}=\boldsymbol{e}_{s}-\boldsymbol{e}_{t}\in\operatorname{range}(% \mathbf{M})$ .

1.3 Previous Work for SDD Systems

[4] studies the case when the linear system is $\mathbf{S}\boldsymbol{x}=\boldsymbol{b}$ for some SDD matrix $\mathbf{S}$ . Define $\mathbf{D}_{\mathbf{S}}$ as the diagonal matrix that satisfies $\mathbf{D}(k,k)=\mathbf{S}(k,k)$ for each $k\in[n]$ and $\widetilde{\mathbf{S}}:=\mathbf{D}_{\mathbf{S}}^{-1/2}\mathbf{S}\mathbf{D}_{% \mathbf{S}}^{-1/2}$ . They formulate the fixed solution as $\boldsymbol{x}^{\ast}:=\mathbf{D}_{\mathbf{S}}^{-1/2}\widetilde{\mathbf{S}}^{+% }\mathbf{D}_{\mathbf{S}}^{-1/2}\boldsymbol{b}$ and give a Neumann series expansion of $\boldsymbol{x}^{\ast}$ . For the algorithmic results, they assume that the algorithm is given $\kappa$ , an upper bound on $\kappa(\widetilde{\mathbf{S}})$ (alternatively, $\gamma$ as a lower bound on $\gamma(\mathbf{S})$ ) and set a truncation parameter for the Neumann series as $L:=\Theta\left(\kappa\log\big(\kappa\cdot\kappa(\mathbf{D}_{\mathbf{S}})\|% \boldsymbol{b}\|_{0}/\varepsilon\big)\right)=\widetilde{\Theta}(\kappa)$ .

Based on the truncated Neumann series, they present a randomized algorithm that, given a coordinate $t\in[n]$ , computes an estimate $\hat{x}_{t}$ satisfying $\big|\hat{x}_{t}-\boldsymbol{x}^{\ast}(t)\big|\leq\varepsilon\left\|\mathbf{D}% _{\mathbf{S}}^{-1}\boldsymbol{b}\right\|_{\infty}$ with probability at least $3/4$ . The algorithm runs in time $O\left(f(\mathbf{S})L^{3}\log L/\varepsilon^{2}\right)=\widetilde{O}\left(f(% \mathbf{S})\kappa^{3}\varepsilon^{-2}\right)=\widetilde{O}\left(f(\mathbf{S})% \gamma^{-3}\varepsilon^{-2}\right)$ , where $f(\mathbf{S})$ is the maximum time cost to simulate one step in the random walk defined by $\mathbf{S}$ . This result is implicit in the proof of [3, Theorem 5.1].

On the negative side, [4] proves an $\Omega\left(\kappa(\mathbf{S})^{2}/\log^{3}n\right)=\widetilde{\Omega}\left(% \kappa(\mathbf{S})^{2}\right)$ query lower bound (in terms of probing $\boldsymbol{b}$ ) for achieving a weaker absolute error bound of $\varepsilon\left\|\boldsymbol{x}^{\ast}\right\|_{\infty}$ , for $\kappa(\mathbf{S})=O(\sqrt{n}/\log n)$ and $\varepsilon=\Theta(1/\log n)$ . The matrix $\mathbf{S}$ in their hard instance is a Laplacian matrix of a fixed unweighted undirected graph with maximum degree $4$ and thus satisfies $\kappa(\widetilde{\mathbf{S}})=\Theta\big(\kappa(\mathbf{S})\big)$ . Therefore, this lower bound can also be written as $\widetilde{\Omega}\left(1/\gamma(\mathbf{S})^{2}\right)$ . To our knowledge, no other work has explicitly studied sublinear-time SDD/RDD/CDD solvers.

1.4 Formulation of $\boldsymbol{x}^{\ast}$ and the $𝒑$ -Norm Gaps

Our first contribution is a Neumann-series-based characterization of a solution $\boldsymbol{x}^{\ast}$ to $\mathbf{M}\boldsymbol{x}=\boldsymbol{b}$ , which is consistent with the solution considered by [4] for SDD systems.

We decompose $\mathbf{M}$ uniquely as $\mathbf{M}=\mathbf{D}_{\mathbf{M}}-\mathbf{A}_{\mathbf{M}}^{\top}$ , where $\mathbf{D}_{\mathbf{M}}$ is a diagonal matrix and all diagonal entries of $\mathbf{A}_{\mathbf{M}}^{\top}$ are $0$ . Define $\widetilde{\mathbf{M}}:=\mathbf{D}_{\mathbf{M}}^{-1/2}\mathbf{M}\mathbf{D}_{% \mathbf{M}}^{-1/2}$ and $\widetilde{\mathbf{A}}_{\mathbf{M}}^{\top}:=\mathbf{D}_{\mathbf{M}}^{-1/2}% \mathbf{A}_{\mathbf{M}}^{\top}\mathbf{D}_{\mathbf{M}}^{-1/2}$ .

The next theorem gives the definition of $\boldsymbol{x}^{\ast}$ and its properties.

Theorem 1.

For any RDD/CDD $\mathbf{M}$ and $\boldsymbol{b}\in\operatorname{range}(\mathbf{M})$ , define $\boldsymbol{x}^{\ast}$ to be

\displaystyle\boldsymbol{x}^{\ast}:=\frac{1}{2}\sum_{\ell=0}^{\infty}\left(% \frac{1}{2}\left(\mathbf{I}+\mathbf{D}_{\mathbf{M}}^{-1}\mathbf{A}_{\mathbf{M}% }^{\top}\right)\right)^{\ell}\mathbf{D}_{\mathbf{M}}^{-1}\boldsymbol{b}.

Then $\boldsymbol{x}^{\ast}$ is well-defined and satisfies $\mathbf{M}\boldsymbol{x}^{\ast}=\boldsymbol{b}$ . If $\mathbf{M}$ is SDD, then $\boldsymbol{x}^{\ast}=\mathbf{D}_{\mathbf{M}}^{-1/2}\widetilde{\mathbf{M}}^{+}% \mathbf{D}_{\mathbf{M}}^{-1/2}\boldsymbol{b}$ .

Next, we study a truncated version of $\boldsymbol{x}^{\ast}$ and upper bound the truncation error. As previous analysis based on eigendecomposition is not directly applicable to asymmetric matrices, we introduce a novel concept called the $p$ -norm gap of $\mathbf{M}$ : for any $p\in[1,\infty]$ , we define the $p$ -norm gap of $\mathbf{M}$ as

\displaystyle\gamma_{p}(\mathbf{M}):=1-\left\|\left.\frac{1}{2}\left(\mathbf{I% }+\mathbf{D}_{\mathbf{M}}^{-1/q}\mathbf{A}_{\mathbf{M}}^{\top}\mathbf{D}_{% \mathbf{M}}^{-1/p}\right)\right|_{\operatorname{range}\left(\mathbf{I}-\mathbf% {D}_{\mathbf{M}}^{-1/q}\mathbf{A}_{\mathbf{M}}^{\top}\mathbf{D}_{\mathbf{M}}^{% -1/p}\right)}\right\|_{p},

where $q$ is conjugate to $p$ . We further define the maximum $p$ -norm gap of $\mathbf{M}$ as $\gamma_{\max}(\mathbf{M}):=\max_{p\in[1,\infty]}\gamma_{p}(\mathbf{M})$ .³³3One could alternatively define $\gamma_{\max}(\mathbf{M}):=\max_{p\in\{1,2,\infty\}}\gamma_{p}(\mathbf{M})$ , which yields a possibly smaller quantity but our main results continue to hold with this definition. To our knowledge, no prior work has explicitly studied these quantities.

The following theorem guarantees that for any RDD/CDD matrix $\mathbf{M}$ , the maximum $p$ -norm gap $\gamma_{\max}(\mathbf{M})$ lies in $(0,1]$ , and when $\mathbf{M}$ is SDD, $\gamma_{\max}(\mathbf{M})$ coincides with the spectral gap $\gamma(\mathbf{M})$ . Thus, it is a natural generalization of the spectral gap to asymmetric matrices.

Theorem 2.

If $\mathbf{M}$ is RDD/CDD, then $0<\gamma_{\max}(\mathbf{M})\leq 1$ ; if $\mathbf{M}$ is SDD, then $\gamma_{\max}(\mathbf{M})=\gamma_{2}(\mathbf{M})=\gamma(\mathbf{M})$ .

We will devise algorithms to estimate the truncated version of $\boldsymbol{x}^{\ast}$ , defined as

\displaystyle\boldsymbol{x}^{\ast}_{L}:=\frac{1}{2}\sum_{\ell=0}^{L-1}\left(% \frac{1}{2}\left(\mathbf{I}+\mathbf{D}_{\mathbf{M}}^{-1}\mathbf{A}_{\mathbf{M}% }^{\top}\right)\right)^{\ell}\mathbf{D}_{\mathbf{M}}^{-1}\boldsymbol{b}

(5)

for an integer truncation parameter $L$ . The next theorem upper bounds the truncation error in terms of a given lower bound on the maximum $p$ -norm gap $\gamma_{\max}(\mathbf{M})$ .

Theorem 3.

Suppose $0<\gamma\leq\gamma_{\max}(\mathbf{M})$ . To ensure that $\left|\boldsymbol{t}^{\top}\boldsymbol{x}^{\ast}_{L}-\boldsymbol{t}^{\top}% \boldsymbol{x}^{\ast}\right|\leq\frac{1}{2}\varepsilon$ , it suffices to set

L:=\Theta\left(\frac{1}{\gamma}\log\left(\frac{1}{\gamma\varepsilon}\cdot d_{% \max}(\mathbf{M})\|\boldsymbol{t}\|_{0}\left\|\mathbf{D}_{\mathbf{M}}^{-1}% \boldsymbol{t}\right\|_{\infty}\|\boldsymbol{b}\|_{0}\left\|\mathbf{D}_{% \mathbf{M}}^{-1}\boldsymbol{b}\right\|_{\infty}\right)\right)=\widetilde{% \Theta}\left(\frac{1}{\gamma}\right),

(6)

where $d_{\max}(\mathbf{M})$ denotes the largest diagonal entry in $\mathbf{M}$ .⁴⁴4Here and after, we use $\widetilde{\Theta}$ and $\widetilde{O}$ to hide polylogarithmic factors in $n$ , $1/\gamma$ , $1/\varepsilon$ , and (reciprocals of) quantities in $\mathbf{M}$ , $\boldsymbol{b}$ , and $\boldsymbol{t}$ .

Relationship with the Formulation in [4].

Our formulations of $\boldsymbol{x}^{\ast}$ and the maximum $p$ -norm gap generalize the ones in [4], in the sense that when $\mathbf{M}$ is SDD, $\boldsymbol{x}^{\ast}=\mathbf{D}_{\mathbf{M}}^{-1/2}\widetilde{\mathbf{M}}^{+}% \mathbf{D}_{\mathbf{M}}^{-1/2}\boldsymbol{b}$ matches their solution and $\gamma_{\max}(\mathbf{M})$ equals $\gamma(\mathbf{M})$ , the spectral gap of $\mathbf{M}$ . Therefore, the quadratic lower bound on $1/\gamma(\mathbf{M})$ for SDD systems in that paper translates into a quadratic lower bound on $1/\gamma_{\max}(\mathbf{M})$ for general RDD/CDD systems. Also, our setting of the truncation parameter $L$ in Equation (6) matches theirs when $\mathbf{M}$ is SDD and $\boldsymbol{t}$ is a canonical unit vector.

1.5 Main Algorithmic Results

For our algorithmic results, we assume that the algorithm is given a quantity $\gamma>0$ as a lower bound on $\gamma_{\max}(\mathbf{M})$ and an accuracy parameter $\varepsilon>0$ . We use $\gamma$ , $\varepsilon$ , and the specific terms in the accuracy guarantee to set the truncation parameter $L$ according to Equation (6), where we assume that suitable upper bounds on the quantities in the logarithmic factor are known.

The statements of our results will use the following definition. For RDD $\mathbf{M}$ , we use $f_{\mathrm{row}}(\mathbf{M})$ to denote the maximum time cost to simulate a single step in the random walk defined by the row substochastic matrix $\frac{1}{2}\left(\mathbf{I}+\mathbf{D}_{\mathbf{M}}^{-1}\left|\mathbf{A}_{% \mathbf{M}}^{\top}\right|\right)$ . This quantity depends on the structure and representation of $\mathbf{M}$ . For instance, if each row of $\mathbf{M}$ has at most $d$ nonzero entries, then $f_{\mathrm{row}}(\mathbf{M})=O(d)$ ; if the nonzero entries in $\mathbf{A}_{\mathbf{M}}$ have equal absolute values and we are allowed to sample a uniformly random index of the nonzero entries in each column of $\mathbf{A}_{\mathbf{M}}$ in $O(1)$ time, then $f_{\mathrm{row}}(\mathbf{M})=O(1)$ . For CDD $\mathbf{M}$ , we define $f_{\mathrm{col}}(\mathbf{M}):=f_{\mathrm{row}}\left(\mathbf{M}^{\top}\right)$ .

Each of our algorithmic results excels under different system types, access models, and parameter regimes. We present a selection of our results here, with additional results given in the full version of this paper [28]. Furthermore, a remark at the end of this subsection establishes that most of our results have “symmetric” counterparts (e.g., for CDD systems) that can be obtained by exchanging the roles of certain quantities.

We first present our results of using random-walk sampling for RDD systems.

Theorem 4.

Suppose that $\mathbf{M}$ is RDD, we can sample from the distribution $|\boldsymbol{t}|/\|\boldsymbol{t}\|_{1}$ in $O(1)$ time, and $\|\boldsymbol{t}\|_{1}$ is known. Then there exists a randomized algorithm that computes an estimate $\hat{x}$ such that $\Pr\left\{\left|\hat{x}-\boldsymbol{t}^{\top}\boldsymbol{x}^{\ast}\right|\leq% \varepsilon\left\|\mathbf{D}_{\mathbf{M}}^{-1}\boldsymbol{b}\right\|_{\infty}% \right\}\geq\frac{3}{4}$ in time $O\left(f_{\mathrm{row}}(\mathbf{M})\|\boldsymbol{t}\|_{1}^{2}L^{3}\varepsilon^% {-2}\right)=\widetilde{O}\left(f_{\mathrm{row}}(\mathbf{M})\|\boldsymbol{t}\|_% {1}^{2}\gamma^{-3}\varepsilon^{-2}\right)$ .

This theorem subsumes the main algorithmic result of [4] for SDD systems and extends it to the RDD case while achieving a mild $(\log L)$ -factor improvement. Their original result is recovered as a special case when $\mathbf{M}$ is SDD and $\boldsymbol{t}$ is a canonical unit vector. Our algorithm is similar to theirs, but we achieve the improved complexity by adopting a different random-walk sampling scheme.

We also show that for RDDZ $\mathbf{M}$ along with nonnegative vectors $\boldsymbol{b}$ and $\boldsymbol{t}$ , if we allow the relaxed error bound $\varepsilon\|\boldsymbol{x}^{\ast}\|_{\infty}$ , the complexity can be improved to depend quadratically on $L$ via a variance analysis of the random-walk sampling process. We adopt this error measure since it provides a natural accuracy guarantee and has been previously studied in [4]. It is shown in [4] that $\left\|\mathbf{D}_{\mathbf{M}}^{-1}\boldsymbol{b}\right\|_{\infty}\leq 2\|% \boldsymbol{x}^{\ast}\|_{\infty}$ for RDD systems.

Theorem 5.

Suppose that $\mathbf{M}$ is RDDZ, $\boldsymbol{b},\boldsymbol{t}\geq\boldsymbol{0}$ , we can sample from the distribution $\boldsymbol{t}/\|\boldsymbol{t}\|_{1}$ in $O(1)$ time, and $\|\boldsymbol{t}\|_{1}$ is known. Then there exists a randomized algorithm that computes a $\hat{x}$ such that $\Pr\left\{\left|\hat{x}-\boldsymbol{t}^{\top}\boldsymbol{x}^{\ast}\right|\leq% \varepsilon\left\|\boldsymbol{x}^{\ast}\right\|_{\infty}\right\}\geq\frac{3}{4}$ in time $O\left(f_{\mathrm{row}}(\mathbf{M})\|\boldsymbol{t}\|_{1}^{2}L^{2}\varepsilon^% {-2}\right)=\widetilde{O}\left(f_{\mathrm{row}}(\mathbf{M})\|\boldsymbol{t}\|_% {1}^{2}\gamma^{-2}\varepsilon^{-2}\right)$ .

The following theorem further considers the relative error guarantee of $\varepsilon\cdot\boldsymbol{t}^{\top}\boldsymbol{x}^{\ast}$ . The complexity depends quadratically on $L$ and linearly on $\|\boldsymbol{t}\|_{1}\left\|\mathbf{D}_{\mathbf{M}}^{-1}\boldsymbol{b}\right% \|_{\infty}\big/\boldsymbol{t}^{\top}\boldsymbol{x}^{\ast}$ and is also achieved using random-walk sampling.

Theorem 6.

Suppose that $\mathbf{M}$ is RDDZ, $\boldsymbol{b},\boldsymbol{t}\geq\boldsymbol{0}$ , we can sample from the distribution $\boldsymbol{t}/\|\boldsymbol{t}\|_{1}$ in $O(1)$ time, $\|\boldsymbol{t}\|_{1}$ and $\left\|\mathbf{D}_{\mathbf{M}}^{-1}\boldsymbol{b}\right\|_{\infty}$ are known, and $\boldsymbol{t}^{\top}\boldsymbol{x}^{\ast}>0$ . Then there exists a randomized algorithm that computes an estimate $\hat{x}$ such that $\Pr\left\{\left|\hat{x}-\boldsymbol{t}^{\top}\boldsymbol{x}^{\ast}\right|\leq% \varepsilon\cdot\boldsymbol{t}^{\top}\boldsymbol{x}^{\ast}\right\}\geq\frac{3}% {4}$ in expected time

\displaystyle O\left(\frac{f_{\mathrm{row}}(\mathbf{M})\|\boldsymbol{t}\|_{1}% \left\|\mathbf{D}_{\mathbf{M}}^{-1}\boldsymbol{b}\right\|_{\infty}L^{2}}{% \varepsilon^{2}\cdot\boldsymbol{t}^{\top}\boldsymbol{x}^{\ast}}\right)=% \widetilde{O}\left(\frac{f_{\mathrm{row}}(\mathbf{M})\|\boldsymbol{t}\|_{1}% \left\|\mathbf{D}_{\mathbf{M}}^{-1}\boldsymbol{b}\right\|_{\infty}}{\gamma^{2}% \varepsilon^{2}\cdot\boldsymbol{t}^{\top}\boldsymbol{x}^{\ast}}\right).

Our next result leverages the local push method to derive a deterministic algorithm for special RCDD systems, whose complexity depends linearly on $1/\varepsilon$ .

Theorem 7.

Suppose that $\mathbf{M}$ is RCDD, its nonzero entries have absolute values of $\Omega(1)$ , and we can scan the nonzero entries of $\boldsymbol{b}$ in $O\big(\|\boldsymbol{b}\|_{0}\big)$ time. Then there exists a deterministic algorithm that computes a $\hat{x}$ such that $\left|\hat{x}-\boldsymbol{t}^{\top}\boldsymbol{x}^{\ast}\right|\leq\varepsilon% \|\boldsymbol{t}\|_{1}$ in time $O\big(\|\boldsymbol{b}\|_{0}\big)$ plus $O\left(\|\boldsymbol{b}\|_{1}L^{3}\varepsilon^{-1}\right)=\widetilde{O}\left(% \|\boldsymbol{b}\|_{1}\gamma^{-3}\varepsilon^{-1}\right)$ .

Lastly, applying the bidirectional method to special RCDD systems yields complexity bounds with improved dependence on $L$ and $\varepsilon$ , achieving either $L^{7/3}\varepsilon^{-2/3}$ or $L^{5/2}\varepsilon^{-1}$ using different parameter settings.

Theorem 8.

Suppose that $\mathbf{M}$ is RCDD, its nonzero entries have absolute values of $\Omega(1)$ , we can sample from the distribution $|\boldsymbol{t}|/\|\boldsymbol{t}\|_{1}$ in $O(1)$ time, $\|\boldsymbol{t}\|_{1}$ and $f_{\mathrm{row}}(\mathbf{M})$ are known, and we can scan through the nonzero entries of $\boldsymbol{b}$ in $O\big(\|\boldsymbol{b}\|_{0}\big)$ time. Then there exists a randomized algorithm that computes an estimate $\hat{x}$ such that $\Pr\left\{\left|\hat{x}-\boldsymbol{t}^{\top}\boldsymbol{x}^{\ast}\right|\leq% \varepsilon\right\}\geq\frac{3}{4}$ in time $O\big(\|\boldsymbol{b}\|_{0}\big)$ plus

	$\displaystyle\phantom{{}={}}O\left(\min\left(f_{\mathrm{row}}(\mathbf{M})^{% \frac{1}{3}}\\|\boldsymbol{t}\\|_{1}^{\frac{2}{3}}\\|\boldsymbol{b}\\|_{1}^{\frac{% 2}{3}}L^{\frac{7}{3}}\varepsilon^{-\frac{2}{3}},f_{\mathrm{row}}(\mathbf{M})^{% \frac{1}{2}}\\|\boldsymbol{t}\\|_{1}\\|\boldsymbol{b}\\|_{1}^{\frac{1}{2}}\left\\|% \mathbf{D}_{\mathbf{M}}^{-1}\boldsymbol{b}\right\\|_{\infty}^{\frac{1}{2}}L^{% \frac{5}{2}}\varepsilon^{-1}\right)\right)$
	$\displaystyle=\widetilde{O}\left(\min\left(\frac{f_{\mathrm{row}}(\mathbf{M})^% {1/3}\\|\boldsymbol{t}\\|_{1}^{2/3}\\|\boldsymbol{b}\\|_{1}^{2/3}}{\gamma^{7/3}% \varepsilon^{2/3}},\frac{f_{\mathrm{row}}(\mathbf{M})^{1/2}\\|\boldsymbol{t}\\|_% {1}\\|\boldsymbol{b}\\|_{1}^{1/2}\left\\|\mathbf{D}_{\mathbf{M}}^{-1}\boldsymbol{% b}\right\\|_{\infty}^{1/2}}{\gamma^{5/2}\varepsilon}\right)\right).$

$\blacktriangleright$ Remark.

As we shall see, all theorems in this subsection except Theorem 5 still hold if we replace RDD/RDDZ by CDD/CDDZ, swap $\boldsymbol{b}$ and $\boldsymbol{t}$ (except in $\boldsymbol{t}^{\top}\boldsymbol{x}^{\ast}$ ), and replace $f_{\mathrm{row}}(\mathbf{M})$ by $f_{\mathrm{col}}(\mathbf{M})$ in the statements. Theorem 5 is an exception because its proof relies on the property that $\left\|\mathbf{D}_{\mathbf{M}}^{-1}\boldsymbol{b}\right\|_{\infty}\leq 2\|% \boldsymbol{x}^{\ast}\|_{\infty}$ for RDD $\mathbf{M}$ , which does not have a straightforward analog for the CDD case.

1.6 Connections to PageRank and Effective Resistance Computation

Our results can be directly applied to the PPR or PageRank contribution equations and the single-pair effective resistance problem to yield complexity bounds for different graph types under various access models and accuracy guarantees. Notably, as we shall see, for PageRank computation, half the decay factor serves as a lower bound on the maximum $p$ -norm gap of the corresponding systems; for effective resistance computation, the spectral gap of the graph Laplacian equals the maximum $p$ -norm gap of the system. Rather than exhaustively presenting all applications of our results, here we highlight selected results that generalize and improve upon the previously best complexity bounds.

The following theorem is derived by applying Theorem 6 to the PageRank equation (2) combined with tighter lower bounds on $\boldsymbol{\pi}_{G,\alpha}(t)$ that we establish for Eulerian graphs.

Theorem 9.

For any unweighted Eulerian graph $G$ , given the decay factor $\alpha$ , a target node $t\in V$ , and $\delta_{G}$ , there exists a randomized algorithm that estimates the PageRank value $\boldsymbol{\pi}_{G,\alpha}(t)$ within relative error $\varepsilon$ with probability at least $3/4$ in time

\displaystyle O\left(\frac{1}{\alpha\varepsilon^{2}}\cdot\frac{d_{G}(t)}{% \delta_{G}}\cdot\frac{1}{n\boldsymbol{\pi}_{G,\alpha}(t)}\right)=O\left(\frac{% 1}{\varepsilon^{2}\delta_{G}}\cdot\min\left(\frac{d_{G}(t)}{\alpha^{2}},\frac{% m/d_{G}(t)}{\alpha^{2}},\frac{\Delta_{G}}{\alpha},\frac{\sqrt{m}}{\alpha}% \right)\right).

This improves over the previously best upper bounds of $O\left(\frac{1}{\varepsilon^{2}\delta_{G}}\cdot\min\left(\frac{d_{G}(t)}{% \alpha^{2}},\frac{\sqrt{m}}{\alpha^{2}}\right)\right)$ , which was given by [45] and stated for unweighted undirected graphs. Our algorithm is essentially the same as that in [45], which generates random walks from the target node $t$ , and the improvement comes from our tighter lower bounds on $\boldsymbol{\pi}_{G,\alpha}(t)$ for Eulerian graphs.

For effective resistance computation, recall that we consider connected undirected graphs $G$ . Lemma 27 establishes that the value $\boldsymbol{t}^{\top}\boldsymbol{x}^{\ast}$ that our algorithms approximate equals $R_{G}(s,t)$ when setting $\mathbf{M}=\mathbf{L}_{G}$ and $\boldsymbol{b}=\boldsymbol{t}=\boldsymbol{e}_{s}-\boldsymbol{e}_{t}$ . Thus, Theorems 4 and 8 directly imply the following complexity bounds for estimating effective resistance.

Corollary 10.

For any connected unweighted undirected graph $G$ , given nodes $s,t\in V$ and $\gamma>0$ as a lower bound on $\gamma(\mathbf{L}_{G})$ , there exists a randomized algorithm that estimates the effective resistance $R_{G}(s,t)$ within absolute error $\varepsilon$ with probability at least $3/4$ in time

\displaystyle O\left(\min\left(\frac{L^{3}}{\varepsilon^{2}\cdot\min\big(d_{G}% (s),d_{G}(t)\big)^{2}},\frac{L^{7/3}}{\varepsilon^{2/3}},\frac{L^{5/2}}{% \varepsilon\cdot\min\big(d_{G}(s),d_{G}(t)\big)^{1/2}}\right)\right),

where $L:=\Theta\left(\frac{1}{\gamma}\log\left(\frac{1}{\gamma\varepsilon}\left(% \frac{1}{d_{G}(s)}+\frac{1}{d_{G}(t)}\right)\right)\right)$ .

This subsumes the previous bounds of $O\left(\min\left(\frac{L^{3}\log L}{\varepsilon^{2}\cdot\min(d_{G}(s),d_{G}(t)% )^{2}},\frac{L^{7/3}\log L}{\varepsilon^{2/3}}\right)\right)$ given by [14] with essentially the same setting of $L$ . Moreover, since $R_{G}(s,t)$ can be lower bounded by $1/2\big/\min\big(d_{G}(s),d_{G}(t)\big)$ [34, Corollary 3.3], by setting the absolute error parameter $\varepsilon$ to be $\varepsilon_{r}\cdot 1/2\big/\min\big(d_{G}(s),d_{G}(t)\big)$ , the last bound in our result implies that an estimate of $R_{G}(s,t)$ within relative error $\varepsilon_{r}$ can be computed in time $O\left(\min\big(d_{G}(s),d_{G}(t)\big)^{1/2}L^{5/2}\varepsilon_{r}^{-1}\right)$ . This improves over the previous bound of $O\left(\min\big(d_{G}(s),d_{G}(t)\big)^{1/2}L^{3}\log L\cdot\varepsilon_{r}^{-% 1}\right)$ given by [52]. Our algorithms are essentially the same as those in [14, 52], i.e., using random-walk sampling and the bidirectional method, and the improvements stem from our simpler sampling scheme that avoids using extra data structures and a refined analysis.

On the other hand, known hardness results for local PageRank and effective resistance computation can potentially yield lower bounds for sublinear-time solvers. We highlight the following result, derived by establishing a reduction from estimating single-node PageRank on undirected graphs to solving SDD systems and applying the lower bound for PageRank computation from [45].

Theorem 11.

For any large enough $n$ and $\varepsilon=\Omega(1/n)$ , there exist $\boldsymbol{b}\in\mathbb{R}^{n}$ and $t\in[n]$ that satisfy the following. Every randomized algorithm that, given access to an invertible SDD matrix $\mathbf{S}\in\mathbb{R}^{n\times n}$ whose spectral gap is $\Omega(1)$ , succeeds with probability at least $3/4$ to approximate $\boldsymbol{x}^{\ast}(t)$ within absolute error $\varepsilon\|\boldsymbol{x}^{\ast}\|_{\infty}$ , must probe $\Omega(1/\varepsilon)$ coordinates of $\mathbf{S}$ in the worst case. Here, $\boldsymbol{x}^{\ast}=\mathbf{S}^{-1}\boldsymbol{b}$ .

This result gives an $\Omega(1/\varepsilon)$ lower bound for local SDD solvers with accuracy guarantee $\varepsilon\|\boldsymbol{x}^{\ast}\|_{\infty}$ , demonstrating the necessity of a linear dependence on $1/\varepsilon$ in our Theorem 4. This lower bound on the accuracy parameter complements the lower bound on the spectral gap given by [4].

1.7 Understanding ForwardPush and BackwardPush on Graphs

Inspired by the local push algorithms ForwardPush and BackwardPush, we formulate our Push algorithm as a unified primitive for estimating a summation of matrix powers applied to a vector, which is closely related to our approach to solving RDD/CDD systems. This abstraction reveals that ForwardPush and BackwardPush, despite appearing as distinct algorithms for two problems with different propagation strategies, are equivalent to applying Push to different linear systems, modulo variable scaling. The apparent differences arise from the scaling and the distinct behaviors that Push exhibits on different types of linear systems.

Specifically, for PageRank computation, ForwardPush from node $s$ corresponds to applying Push to Equation (2) with $\boldsymbol{s}=\boldsymbol{e}_{s}$ and outdegree scaling, while BackwardPush from node $t$ applies Push to the contribution equations (3) or (4). Our analysis demonstrates that Push provides closed-form complexity bounds on certain CDD systems and accuracy guarantees on RDD systems, which explains the known computational properties of ForwardPush and BackwardPush on directed graphs.

For RCDD systems, Push inherits both complexity and accuracy advantages, clarifying why ForwardPush and BackwardPush perform well on undirected graphs. This perspective further reveals that on Eulerian graphs, ForwardPush from any node is equivalent to BackwardPush from that node on the transpose graph, establishing a basic connection previously unrecognized.

1.8 Technical Overview

Our characterizations of the solution $\boldsymbol{x}^{\ast}$ and the $p$ -norm gaps are based on fundamental properties of Neumann series and restricted linear maps. Since asymmetric matrices may be non-diagonalizable, the eigendecomposition techniques used for symmetric matrices become inapplicable. We address this challenge by analyzing the operator norms of restricted linear maps to establish series convergence and derive truncation error bounds.

Our algorithms estimate $\boldsymbol{t}^{\top}\boldsymbol{x}^{\ast}_{L}=\frac{1}{2}\boldsymbol{t}^{\top% }\sum_{\ell=0}^{L-1}\left(\frac{1}{2}\left(\mathbf{I}+\mathbf{D}_{\mathbf{M}}^% {-1}\mathbf{A}_{\mathbf{M}}^{\top}\right)\right)^{\ell}\mathbf{D}_{\mathbf{M}}% ^{-1}\boldsymbol{b}$ by adapting three techniques for random-walk probability estimation on graphs: random-walk sampling [39, 19], local push [2, 1], and the bidirectional method [32].

When $\mathbf{M}$ is RDD, the matrix $\frac{1}{2}\left(\mathbf{I}+\mathbf{D}_{\mathbf{M}}^{-1}\left|\mathbf{A}_{% \mathbf{M}}^{\top}\right|\right)$ is row substochastic, enabling us to interpret $\boldsymbol{t}^{\top}\boldsymbol{x}^{\ast}_{L}$ as an expectation over random walks that start from the distribution $|\boldsymbol{t}|/\|\boldsymbol{t}\|_{1}$ and transition according to $\frac{1}{2}\left(\mathbf{I}+\mathbf{D}_{\mathbf{M}}^{-1}\left|\mathbf{A}_{% \mathbf{M}}^{\top}\right|\right)$ . Such random walks may terminate early, and the algorithm needs to record the signs of the entries in $\mathbf{A}_{\mathbf{M}}$ along the walk. This method extends the approach in [4], and we adopt a different sampling scheme and conduct variance analysis in some special cases to reduce the dependence on $L$ in the complexity.

Based on local push methods, we formulate a Push primitive that estimates the vector $\boldsymbol{x}^{\ast}_{L}=\sum_{\ell=0}^{L-1}\left(\frac{1}{2}\left(\mathbf{I}% +\mathbf{D}_{\mathbf{M}}^{-1}\mathbf{A}_{\mathbf{M}}^{\top}\right)\right)^{% \ell}\mathbf{D}_{\mathbf{M}}^{-1}\boldsymbol{b}$ through deterministic local computation. Push maintains coordinate variables initialized to $\mathbf{D}_{\mathbf{M}}^{-1}\boldsymbol{b}$ and iteratively applies the linear operator $\frac{1}{2}\left(\mathbf{I}+\mathbf{D}_{\mathbf{M}}^{-1}\mathbf{A}_{\mathbf{M}% }^{\top}\right)$ to selected coordinates. Our analysis relies on invariant properties preserved by these operations, including an inequality variant that helps to handle negative entries. This approach yields closed-form accuracy guarantees for RDD systems and complexity bounds for special CDD systems. Combining these two aspects yield our result for special RCDD systems.

The bidirectional method combines random-walk sampling and local push from two directions. We adapt the BiPPR framework [32], performing Push from $\mathbf{D}_{\mathbf{M}}^{-1}\boldsymbol{b}$ and exploiting the invariant property to construct an unbiased estimator for $\boldsymbol{t}^{\top}\boldsymbol{x}^{\ast}_{L}$ , which can be sampled via random walks from $|\boldsymbol{t}|/\|\boldsymbol{t}\|_{1}$ when $\mathbf{M}$ is RDD. By balancing the costs of both components, we achieve improved dependence on $L$ and $\varepsilon$ , particularly for RCDD systems.

Crucially, we can transpose the expression for $\boldsymbol{t}^{\top}\boldsymbol{x}^{\ast}_{L}$ to obtain an equivalent summation with $\mathbf{A}_{\mathbf{M}}^{\top}$ replaced by $\mathbf{A}_{\mathbf{M}}$ and the roles of $\boldsymbol{b}$ and $\boldsymbol{t}$ exchanged. This allows us to alternatively apply Push from $\mathbf{D}_{\mathbf{M}}^{-1}\boldsymbol{t}$ and random-walk sampling from $|\boldsymbol{b}|/\|\boldsymbol{b}\|_{1}$ when $\mathbf{M}$ is CDD, leading to symmetric algorithmic procedures and complexity results.

1.9 Paper Organization

The remainder of this paper is organized as follows. Section 2 introduces more related work. Section 3 elaborates on our formulation of $\boldsymbol{x}^{\ast}$ and $p$ -norm gaps and proves Theorems 1 to 3. In Sections 4, 5, and 6, we present our algorithms based on random-walk sampling, local push, and the bidirectional method, respectively, and prove Theorems 4 to 8. After that, Sections 7 and 8 relate our problem and results to local computation of PageRank and effective resistance, respectively. In the full version of this paper [28], we provide more preliminaries, discussions on future directions, detailed explanations of the relationship between our Push primitive and the ForwardPush and BackwardPush algorithms, additional results, and all omitted proofs.

2 Other Related Work

A vast literature exists on nearly-linear-time Laplacian solvers and their extensions, including solvers for undirected Laplacian/SDD systems (e.g., [39, 40, 26]) and directed Laplacian/RDD/CDD systems (e.g., [13, 11, 25]). These solvers achieve nearly linear time complexity in $\operatorname{nnz}(\mathbf{M})$ with polylogarithmic dependence on $1/\varepsilon$ and the condition number. The development of global RDD/CDD solvers relies on reductions from solving RDD/CDD systems to solving Eulerian Laplacian systems, combined with efficient methods for computing PPR vectors with small $\alpha$ and the stationary distribution of random walks on graphs [13]. However, the global techniques from algorithmic linear algebra and known reductions to the Eulerian case do not directly apply to the local setting, revealing a fundamental distinction between global and local RDD/CDD solvers. In fact, the lower bounds established in [3] and this work demonstrate that local SDD solvers require polynomial dependence on $1/\varepsilon$ and $1/\gamma$ , indicating a separation between global and local SDD solvers.

[16] develops probabilistic logspace solvers for certain classes of directed Laplacian systems. Their method also relies on approximating truncated Neumann series, but they bound the truncation error using spectral radius and Jordan normal form, which yield truncation parameters of at least $n^{2}$ . Such a huge truncation parameter makes their algorithm and analysis inapplicable to the sublinear-time setting. As an aside, in the quantum regime, [41] presents algorithms for inverting well-conditioned matrices in quantum logspace, but their approaches are not directly applicable to our classical sublinear-time framework.

The idea of using random-walk sampling to solve linear systems dates back to the von Neumann-Ulam algorithm for approximating matrix inversion [20, 48]. The bidirectional method for estimating random-walk probabilities on graphs is first proposed in [33], which is inspired by property testing techniques [23, 27] and later simplified by the BiPPR framework [32].

The bidirectional idea has been widely applied to compute PageRank [5, 31, 32, 8, 49, 47, 6, 43], effective resistance [14, 52], heat kernel [8], and Markov Chain transition probability [5]. Among them, [47] proves that the simple BiPPR framework computes single-node PageRank on unweighted directed graphs in optimal time complexity (in terms of $n$ and $m$ ). Their analysis relies on a new complexity bound of BackwardPush, which is parameterized by the PageRank value of the target node. Recently, [52] shows that the bidirectional technique can yield faster algorithms for constructing effective resistance sketch (as defined in [18]) on expander graphs, and [43] combines random-walk sampling with a novel randomized local push technique to improve the complexity of estimating single-node PageRank on directed graphs with bounded in-degree.

[37] uses the bidirectional method to estimate a single element in the product of a matrix power and a vector, which relates to our estimation of $\frac{1}{2}\boldsymbol{t}^{\top}\sum_{\ell=0}^{L-1}\left(\frac{1}{2}\left(% \mathbf{I}+\mathbf{D}_{\mathbf{M}}^{-1}\mathbf{A}_{\mathbf{M}}^{\top}\right)% \right)^{\ell}\mathbf{D}_{\mathbf{M}}^{-1}\boldsymbol{b}$ . However, they only derive an average-case complexity bound under some bounded-norm conditions and discuss its applications to solving PSD systems. In contrast, we conduct a more comprehensive study of this problem and apply it to solving RDD/CDD systems.

PageRank and PPR have been extensively studied and widely applied; we refer interested readers to the surveys [29, 21, 50]. More lower bounds for PageRank computation can be found in [8, 50, 6] and references therein. The recent work [6] conducts a comprehensive study of various types of PPR estimation problems using different graph access queries under both worst-case and average-case settings. For constant decay factors, they provide nearly tight complexity upper and lower bounds for achieving constant relative error guarantees when the target value is above a given threshold.

Effective resistance is ubiquitous in spectral graph theory [17, 34, 38, 25]. A line of work [35, 51, 14, 52] focuses on locally estimating single-pair effective resistance through multi-step random-walk probabilities. [10] studies this problem on non-expander graphs and establishes strong complexity lower bounds, though it does not explicitly give a lower bound on the spectral gap of the graph. Recently, [52] provides a lower bound on the relative error parameter for this problem. Besides, a line of work [30, 18, 52] studies the problem of constructing effective resistance sketches. Technically, [30] leverages random-walk sampling, [18] uses count sketches and SDD solvers, and [52] employs a bidirectional approach.

3 Formulation of $\boldsymbol{x}^{\ast}$ and the $𝒑$ -Norm Gaps

In this section, we prove Theorems 1, 2, and 3 and give further explanations on our formulations of $\boldsymbol{x}^{\ast}$ and the $p$ -norm gaps.

First, we need the following lemma that upper bounds $\left\|\mathbf{D}_{\mathbf{M}}^{-1/q}\mathbf{A}_{\mathbf{M}}^{\top}\mathbf{D}_% {\mathbf{M}}^{-1/p}\right\|_{p}$ for certain $\mathbf{M}$ and Hölder conjugates $p, q$ . Its proof is given in the full version of this paper [28].

Lemma 12.

The following hold:

1.

If $\mathbf{M}$ is RDD, then $\left\|\mathbf{D}_{\mathbf{M}}^{-1}\mathbf{A}_{\mathbf{M}}^{\top}\right\|_{% \infty}\leq 1$ .
2.

If $\mathbf{M}$ is CDD, then $\left\|\mathbf{A}_{\mathbf{M}}^{\top}\mathbf{D}_{\mathbf{M}}^{-1}\right\|_{1}\leq 1$ .
3.

If $\mathbf{M}$ is RCDD, then $\left\|\mathbf{D}_{\mathbf{M}}^{-1/q}\mathbf{A}_{\mathbf{M}}^{\top}\mathbf{D}_% {\mathbf{M}}^{-1/p}\right\|_{p}\leq 1$ for any Hölder conjugates $p, q$ .

To establish our formulation of $\boldsymbol{x}^{\ast}$ in Theorem 1, we use the following lemma, which characterizes the operator norm and Neumann series of certain restricted linear maps. The proof of this lemma is given in the full version of this paper [28].

Lemma 13.

Suppose $\mathbf{X}\in\mathbb{R}^{n\times n}$ and $\|\cdot\|$ is the operator norm induced by some vector norm $\|\cdot\|$ . If $\|\mathbf{X}\|\leq 1$ , then, for $\bar{\mathbf{X}}:=\frac{1}{2}(\mathbf{I}+\mathbf{X})$ , we have $\left\|\left.\bar{\mathbf{X}}\right|_{\operatorname{range}(\mathbf{I}-\mathbf{% X})}\right\|<1$ and

\displaystyle\left(\left.(\mathbf{I}-\mathbf{X})\right|_{\operatorname{range}(% \mathbf{I}-\mathbf{X})}\right)^{-1}=\frac{1}{2}\left(\left.\left(\mathbf{I}-% \bar{\mathbf{X}}\right)\right|_{\operatorname{range}(\mathbf{I}-\mathbf{X})}% \right)^{-1}=\frac{1}{2}\sum_{\ell=0}^{\infty}\left(\left.\bar{\mathbf{X}}% \right|_{\operatorname{range}(\mathbf{I}-\mathbf{X})}\right)^{\ell}.

Proof of Theorem 1.

First assume that $\mathbf{M}$ is RDD. Applying Lemmas 12 and 13, we have $\left\|\mathbf{D}_{\mathbf{M}}^{-1}\mathbf{A}_{\mathbf{M}}^{\top}\right\|_{% \infty}\leq 1$ and

\displaystyle\left(\left.\left(\mathbf{I}-\mathbf{D}_{\mathbf{M}}^{-1}\mathbf{% A}_{\mathbf{M}}^{\top}\right)\right|_{\operatorname{range}\left(\mathbf{I}-% \mathbf{D}_{\mathbf{M}}^{-1}\mathbf{A}_{\mathbf{M}}^{\top}\right)}\right)^{-1}% =\frac{1}{2}\sum_{\ell=0}^{\infty}\left(\left.\frac{1}{2}\left(\mathbf{I}+% \mathbf{D}_{\mathbf{M}}^{-1}\mathbf{A}_{\mathbf{M}}^{\top}\right)\right|_{% \operatorname{range}\left(\mathbf{I}-\mathbf{D}_{\mathbf{M}}^{-1}\mathbf{A}_{% \mathbf{M}}^{\top}\right)}\right)^{\ell}.

As $\boldsymbol{b}\in\operatorname{range}(\mathbf{M})=\operatorname{range}\left(% \mathbf{D}_{\mathbf{M}}-\mathbf{A}_{\mathbf{M}}^{\top}\right)$ , we have $\mathbf{D}_{\mathbf{M}}^{-1}\boldsymbol{b}\in\operatorname{range}\left(\mathbf% {I}-\mathbf{D}_{\mathbf{M}}^{-1}\mathbf{A}_{\mathbf{M}}^{\top}\right)$ , so $\boldsymbol{x}^{\ast}$ converges to

	$\displaystyle\frac{1}{2}\sum_{\ell=0}^{\infty}\left(\frac{1}{2}\left(\mathbf{I% }+\mathbf{D}_{\mathbf{M}}^{-1}\mathbf{A}_{\mathbf{M}}^{\top}\right)\right)^{% \ell}\mathbf{D}_{\mathbf{M}}^{-1}\boldsymbol{b}$	$\displaystyle=\frac{1}{2}\sum_{\ell=0}^{\infty}\left(\left.\frac{1}{2}\left(% \mathbf{I}+\mathbf{D}_{\mathbf{M}}^{-1}\mathbf{A}_{\mathbf{M}}^{\top}\right)% \right\|_{\operatorname{range}\left(\mathbf{I}-\mathbf{D}_{\mathbf{M}}^{-1}% \mathbf{A}_{\mathbf{M}}^{\top}\right)}\right)^{\ell}\mathbf{D}_{\mathbf{M}}^{-% 1}\boldsymbol{b}$
		$\displaystyle=\left(\left.\left(\mathbf{I}-\mathbf{D}_{\mathbf{M}}^{-1}\mathbf% {A}_{\mathbf{M}}^{\top}\right)\right\|_{\operatorname{range}\left(\mathbf{I}-% \mathbf{D}_{\mathbf{M}}^{-1}\mathbf{A}_{\mathbf{M}}^{\top}\right)}\right)^{-1}% \mathbf{D}_{\mathbf{M}}^{-1}\boldsymbol{b}.$

Thus, we can check that

\displaystyle\mathbf{M}\boldsymbol{x}^{\ast}=\mathbf{D}_{\mathbf{M}}\left(% \mathbf{I}-\mathbf{D}_{\mathbf{M}}^{-1}\mathbf{A}_{\mathbf{M}}^{\top}\right)% \left(\left.\left(\mathbf{I}-\mathbf{D}_{\mathbf{M}}^{-1}\mathbf{A}_{\mathbf{M% }}^{\top}\right)\right|_{\operatorname{range}\left(\mathbf{I}-\mathbf{D}_{% \mathbf{M}}^{-1}\mathbf{A}_{\mathbf{M}}^{\top}\right)}\right)^{-1}\mathbf{D}_{% \mathbf{M}}^{-1}\boldsymbol{b}=\mathbf{D}_{\mathbf{M}}\mathbf{D}_{\mathbf{M}}^% {-1}\boldsymbol{b}=\boldsymbol{b}.

Next, observe that for any Hölder conjugates $p, q$ , we have

	$\displaystyle\phantom{{}={}}\mathbf{D}_{\mathbf{M}}^{-1/p}\left(\frac{1}{2}% \left(\mathbf{I}+\mathbf{D}_{\mathbf{M}}^{-1/q}\mathbf{A}_{\mathbf{M}}^{\top}% \mathbf{D}_{\mathbf{M}}^{-1/p}\right)\right)^{\ell}\mathbf{D}_{\mathbf{M}}^{-1% /q}$
	$\displaystyle=\mathbf{D}_{\mathbf{M}}^{-1/p}\left(\frac{1}{2}\mathbf{D}_{% \mathbf{M}}^{1/p}\left(\mathbf{I}+\mathbf{D}_{\mathbf{M}}^{-1}\mathbf{A}_{% \mathbf{M}}^{\top}\right)\mathbf{D}_{\mathbf{M}}^{-1/p}\right)^{\ell}\mathbf{D% }_{\mathbf{M}}^{-1/q}=\left(\frac{1}{2}\left(\mathbf{I}+\mathbf{D}_{\mathbf{M}% }^{-1}\mathbf{A}_{\mathbf{M}}^{\top}\right)\right)^{\ell}\mathbf{D}_{\mathbf{M% }}^{-1}$

for each $\ell\geq 0$ , so

\displaystyle\boldsymbol{x}^{\ast}=\frac{1}{2}\mathbf{D}_{\mathbf{M}}^{-1/p}% \sum_{\ell=0}^{\infty}\left(\frac{1}{2}\left(\mathbf{I}+\mathbf{D}_{\mathbf{M}% }^{-1/q}\mathbf{A}_{\mathbf{M}}^{\top}\mathbf{D}_{\mathbf{M}}^{-1/p}\right)% \right)^{\ell}\mathbf{D}_{\mathbf{M}}^{-1/q}\boldsymbol{b}

for any Hölder conjugates $p, q$ .

When $\mathbf{M}$ is CDD, we consider the expression $\boldsymbol{x}^{\ast}=\frac{1}{2}\mathbf{D}_{\mathbf{M}}^{-1}\sum_{\ell=0}^{% \infty}\left(\frac{1}{2}\left(\mathbf{I}+\mathbf{A}_{\mathbf{M}}^{\top}\mathbf% {D}_{\mathbf{M}}^{-1}\right)\right)^{\ell}\boldsymbol{b}$ . We have $\left\|\mathbf{A}_{\mathbf{M}}^{\top}\mathbf{D}_{\mathbf{M}}^{-1}\right\|_{1}\leq 1$ by Lemma 12, so applying the above arguments shows that $\boldsymbol{x}^{\ast}$ converges to $\mathbf{D}_{\mathbf{M}}^{-1}\left(\left.\left(\mathbf{I}-\mathbf{A}_{\mathbf{M% }}^{\top}\mathbf{D}_{\mathbf{M}}^{-1}\right)\right|_{\operatorname{range}\left% (\mathbf{I}-\mathbf{A}_{\mathbf{M}}^{\top}\mathbf{D}_{\mathbf{M}}^{-1}\right)}% \right)^{-1}\boldsymbol{b}$ and

\displaystyle\mathbf{M}\boldsymbol{x}^{\ast}=\left(\mathbf{I}-\mathbf{A}_{% \mathbf{M}}^{\top}\mathbf{D}_{\mathbf{M}}^{-1}\right)\mathbf{D}_{\mathbf{M}}% \mathbf{D}_{\mathbf{M}}^{-1}\left(\left.\left(\mathbf{I}-\mathbf{A}_{\mathbf{M% }}^{\top}\mathbf{D}_{\mathbf{M}}^{-1}\right)\right|_{\operatorname{range}\left% (\mathbf{I}-\mathbf{A}_{\mathbf{M}}^{\top}\mathbf{D}_{\mathbf{M}}^{-1}\right)}% \right)^{-1}\boldsymbol{b}=\boldsymbol{b}.

So we have proved that $\boldsymbol{x}^{\ast}$ is well-defined and satisfies $\mathbf{M}\boldsymbol{x}^{\ast}=\boldsymbol{b}$ when $\mathbf{M}$ is RDD or CDD.

Next, assume that $\mathbf{M}$ is SDD. Then $\mathbf{M}$ is RCDD and Lemma 12 implies that $\left\|\widetilde{\mathbf{A}}_{\mathbf{M}}^{\top}\right\|_{2}\leq 1$ . Repeating the above arguments with $\mathbf{D}_{\mathbf{M}}^{-1/2}\boldsymbol{b}\in\operatorname{range}\left(% \mathbf{I}-\widetilde{\mathbf{A}}_{\mathbf{M}}^{\top}\right)$ gives

\displaystyle\boldsymbol{x}^{\ast}=\mathbf{D}_{\mathbf{M}}^{-1/2}\left(\left.% \left(\mathbf{I}-\widetilde{\mathbf{A}}_{\mathbf{M}}^{\top}\right)\right|_{% \operatorname{range}\left(\mathbf{I}-\widetilde{\mathbf{A}}_{\mathbf{M}}^{\top% }\right)}\right)^{-1}\mathbf{D}_{\mathbf{M}}^{-1/2}\boldsymbol{b}=\mathbf{D}_{% \mathbf{M}}^{-1/2}\left(\left.\widetilde{\mathbf{M}}\right|_{\operatorname{% range}(\widetilde{\mathbf{M}})}\right)^{-1}\mathbf{D}_{\mathbf{M}}^{-1/2}% \boldsymbol{b}.

Since $\mathbf{M}$ is symmetric, $\widetilde{\mathbf{M}}$ is also symmetric, so $\operatorname{range}(\widetilde{\mathbf{M}})=\ker(\widetilde{\mathbf{M}})^{\perp}$ . By the property of the pseudoinverse, we have $\boldsymbol{x}^{\ast}=\mathbf{D}_{\mathbf{M}}^{-1/2}\left(\left.\widetilde{% \mathbf{M}}\right|_{\ker(\widetilde{\mathbf{M}})^{\perp}}\right)^{-1}\mathbf{D% }_{\mathbf{M}}^{-1/2}\boldsymbol{b}=\mathbf{D}_{\mathbf{M}}^{-1/2}\widetilde{% \mathbf{M}}^{+}\mathbf{D}_{\mathbf{M}}^{-1/2}\boldsymbol{b}$ , finishing the proof. $\hfill\blacktriangleleft$

Recall that if $\mathbf{M}$ is RDD, then $1-\left\|\mathbf{D}_{\mathbf{M}}^{-1}\mathbf{A}_{\mathbf{M}}^{\top}\right\|_{% \infty}=\min_{j\in[n]}\left\{\frac{d_{\mathbf{M}}(j)-\sum_{k\neq j}|\mathbf{M}% (j,k)|}{d_{\mathbf{M}}(j)}\right\}\geq 0$ , and this quantity measures how strongly the diagonal entries dominate the off-diagonal entries in each row. However, this quantity can equal zero, making it unsuitable as a useful notion of “gap.” In contrast, our Theorem 2 shows that the maximum $p$ -norm gap $\gamma_{\max}(\mathbf{M})$ is always strictly positive when $\mathbf{M}$ is RDD/CDD. As an example, $\gamma_{\infty}(\mathbf{M})=1-\left\|\left.\frac{1}{2}\left(\mathbf{I}+\mathbf% {D}_{\mathbf{M}}^{-1}\mathbf{A}_{\mathbf{M}}^{\top}\right)\right|_{% \operatorname{range}\left(\mathbf{I}-\mathbf{D}_{\mathbf{M}}^{-1}\mathbf{A}_{% \mathbf{M}}^{\top}\right)}\right\|_{\infty}$ , which refines the quantity $1-\left\|\mathbf{D}_{\mathbf{M}}^{-1}\mathbf{A}_{\mathbf{M}}^{\top}\right\|_{\infty}$ by replacing $\mathbf{D}_{\mathbf{M}}^{-1}\mathbf{A}_{\mathbf{M}}^{\top}$ with $\frac{1}{2}\left(\mathbf{I}+\mathbf{D}_{\mathbf{M}}^{-1}\mathbf{A}_{\mathbf{M}% }^{\top}\right)$ and restricting the operator to $\operatorname{range}\left(\mathbf{I}-\mathbf{D}_{\mathbf{M}}^{-1}\mathbf{A}_{% \mathbf{M}}^{\top}\right)$ .

Although the $p$ -norm gaps only involve operator norms for the restricted linear maps, Theorem 3 shows that they are sufficient to bound the truncation error between $\boldsymbol{t}^{\top}\boldsymbol{x}^{\ast}_{L}$ and $\boldsymbol{t}^{\top}\boldsymbol{x}^{\ast}$ . The proofs of Theorems 2 and 3 are given in the full version of this paper [28].

4 Random-Walk Sampling

This section presents a Monte Carlo algorithm for estimating $\boldsymbol{t}^{\top}\boldsymbol{x}^{\ast}$ via random-walk sampling. All our algorithms in this paper aim to estimate $\boldsymbol{t}^{\top}\boldsymbol{x}^{\ast}$ by approximating $\boldsymbol{t}^{\top}\boldsymbol{x}^{\ast}_{L}$ . By Theorem 3 and our setting of $L$ , it suffices to ensure that the estimate $\hat{x}$ satisfies that $\left|\hat{x}-\boldsymbol{t}^{\top}\boldsymbol{x}^{\ast}_{L}\right|$ is at most half the desired accuracy guarantee with probability at least $3/4$ , and we will omit this matter in the following proofs.

We first focus on RDD systems and transfer the results to CDD systems by transposing the expression of $\boldsymbol{t}^{\top}\boldsymbol{x}^{\ast}_{L}$ at the end of this section. When $\mathbf{M}$ is RDD, $\frac{1}{2}\left(\mathbf{I}+\mathbf{D}_{\mathbf{M}}^{-1}\left|\mathbf{A}_{% \mathbf{M}}^{\top}\right|\right)$ is row substochastic and we can estimate

\displaystyle\boldsymbol{t}^{\top}\boldsymbol{x}^{\ast}_{L}=\frac{1}{2}% \boldsymbol{t}^{\top}\sum_{\ell=0}^{L-1}\left(\frac{1}{2}\left(\mathbf{I}+% \mathbf{D}_{\mathbf{M}}^{-1}\mathbf{A}_{\mathbf{M}}^{\top}\right)\right)^{\ell% }\mathbf{D}_{\mathbf{M}}^{-1}\boldsymbol{b}

by generating random-walk sampling from $|\boldsymbol{t}|/\|\boldsymbol{t}\|_{1}$ . In each sample, we first sample a walk length $\ell$ from $[0,L-1]$ uniformly at random and sample a source coordinate $v$ from the distribution $|\boldsymbol{t}|/\|\boldsymbol{t}\|_{1}$ . Then we simulate a lazy random walk for $\ell$ steps starting from $v$ according to the transition matrix $\frac{1}{2}\left(\mathbf{I}+\mathbf{D}_{\mathbf{M}}^{-1}\left|\mathbf{A}_{% \mathbf{M}}^{\top}\right|\right)$ . Specifically, at each step from $v$ , with probability $\frac{1}{2}$ the walk stays put at $v$ , with probability $\frac{|\mathbf{A}_{\mathbf{M}}(u,v)|}{2d_{\mathbf{M}}(v)}$ the walk moves to each $u\in[n]$ , and with the remaining probability $\frac{1}{2}-\sum_{u\in[n]}\frac{|\mathbf{A}_{\mathbf{M}}(u,v)|}{2d_{\mathbf{M}% }(v)}$ the walk terminates, where $d_{\mathbf{M}}(v):=\mathbf{M}(v,v)$ . Note that these probabilities lie in $[0,1]$ since $\mathbf{M}$ is RDD. Additionally, we keep track of the product of the signs of the initial entry in $\boldsymbol{t}$ and the entries in $\mathbf{A}_{\mathbf{M}}$ along the walk (where stay-put steps have sign $1$ ), which is denoted as $\sigma$ . After $\ell$ steps, if the walk has not terminated and is at coordinate $v$ , we take the value $\sigma\cdot\frac{1}{2}\|\boldsymbol{t}\|_{1}\cdot\frac{\boldsymbol{b}(v)}{d_{% \mathbf{M}}(v)}\cdot L$ as the estimate of this sample. We repeat this process for $n_{\mathrm{s}}$ independent samples and return the average as the final estimate $\hat{x}$ . We give a pseudocode for this approach in the full version of this paper [28].

We emphasize that our sampling scheme is different from the framework in [4], in that we first sample the walk length $\ell$ and then perform $\ell$ steps of the random walk, while they perform $L$ steps in each sample and take the quantities obtained in each step into account. As it turns out, our scheme is easier to analyze and will save a factor of $\log L$ in the number of samples.

We first establish the unbiasedness of the sampling scheme, whose proof is given in the full version of this paper [28].

Lemma 14.

Each sample described above gives an unbiased estimate of $\boldsymbol{t}^{\top}\boldsymbol{x}^{\ast}_{L}$ .

We can now prove Theorem 4 by applying the Hoeffding bound.

Proof of Theorem 4.

We use random-walk sampling as described above. The algorithm returns $\hat{x}$ as the average of $n_{\mathrm{s}}$ independent samples, where each sampled value has absolute value at most $\frac{1}{2}\|\boldsymbol{t}\|_{1}\left\|\mathbf{D}_{\mathbf{M}}^{-1}% \boldsymbol{b}\right\|_{\infty}L$ . Thus, by Lemma 14 and the Hoeffding bound, $\Pr\left\{\left|\hat{x}-\boldsymbol{t}^{\top}\boldsymbol{x}^{\ast}_{L}\right|% \geq\frac{1}{2}\varepsilon\left\|\mathbf{D}_{\mathbf{M}}^{-1}\boldsymbol{b}% \right\|_{\infty}\right\}$ is upper bounded by

\displaystyle 2\exp\left(-\frac{2n_{\mathrm{s}}\left(\frac{1}{2}\varepsilon% \left\|\mathbf{D}_{\mathbf{M}}^{-1}\boldsymbol{b}\right\|_{\infty}\right)^{2}}% {\left(\|\boldsymbol{t}\|_{1}\left\|\mathbf{D}_{\mathbf{M}}^{-1}\boldsymbol{b}% \right\|_{\infty}L\right)^{2}}\right)=2\exp\left(-\frac{n_{\mathrm{s}}\cdot% \varepsilon^{2}}{2\|\boldsymbol{t}\|_{1}^{2}L^{2}}\right).

To guarantee that this probability is at most $1/4$ , we set $n_{\mathrm{s}}:=\Theta\left(\|\boldsymbol{t}\|_{1}^{2}L^{2}/\varepsilon^{2}\right)$ .

As each sample simulates at most $L$ steps of random walk, and each step takes $O\big(f_{\mathrm{row}}(\mathbf{M})\big)$ time, the time complexity is $O\big(f_{\mathrm{row}}(\mathbf{M})L\cdot n_{\mathrm{s}}\big)=O\left(f_{\mathrm% {row}}(\mathbf{M})\|\boldsymbol{t}\|_{1}^{2}L^{3}/\varepsilon^{2}\right)$ , as desired. $\hfill\blacktriangleleft$

The proof of Theorem 5 relies on a variance analysis when $\mathbf{A}_{\mathbf{M}}$ , $\boldsymbol{b}$ , and $\boldsymbol{t}$ are nonnegative. Additionally, Theorem 6 assumes that $\left\|\mathbf{D}_{\mathbf{M}}^{-1}\boldsymbol{b}\right\|_{\infty}$ is known and provides a relative error guarantee. Its proof relies on the Stopping Rule Theorem given in [15] for adaptively setting the number of samples in Monte Carlo estimation. The proofs of Theorems 5 and 6 are given in the full version of this paper [28].

On the other hand, if $\mathbf{M}$ is CDD, then $\mathbf{M}^{\top}$ is RDD and $\frac{1}{2}\left(\mathbf{I}+\mathbf{D}_{\mathbf{M}}^{-1}\left|\mathbf{A}_{% \mathbf{M}}\right|\right)$ is row substochastic. As we can transpose the expression of $\boldsymbol{t}^{\top}\boldsymbol{x}^{\ast}_{L}$ to obtain

\displaystyle\boldsymbol{t}^{\top}\boldsymbol{x}^{\ast}_{L}=\frac{1}{2}% \boldsymbol{b}^{\top}\mathbf{D}_{\mathbf{M}}^{-1}\sum_{\ell=0}^{L-1}\left(% \frac{1}{2}\left(\mathbf{I}+\mathbf{A}_{\mathbf{M}}\mathbf{D}_{\mathbf{M}}^{-1% }\right)\right)^{\ell}\boldsymbol{t}=\frac{1}{2}\boldsymbol{b}^{\top}\sum_{% \ell=0}^{L-1}\left(\frac{1}{2}\left(\mathbf{I}+\mathbf{D}_{\mathbf{M}}^{-1}% \mathbf{A}_{\mathbf{M}}\right)\right)^{\ell}\mathbf{D}_{\mathbf{M}}^{-1}% \boldsymbol{t},

our algorithms and results for RDD systems (except Theorem 5) applies to CDD systems by interchanging $\boldsymbol{t}$ with $\boldsymbol{b}$ and replacing $\mathbf{A}_{\mathbf{M}}^{\top}$ with $\mathbf{A}_{\mathbf{M}}$ . This justifies our claim that we can derive symmetric results by replacing RDD/RDDZ by CDD/CDDZ, swapping $\boldsymbol{b}$ and $\boldsymbol{t}$ (except in $\boldsymbol{t}^{\top}\boldsymbol{x}^{\ast}$ ), and replacing $f_{\mathrm{row}}(\mathbf{M})$ by $f_{\mathrm{col}}(\mathbf{M})$ in the theorem statements. This argument also straightforwardly applies to our subsequent algorithms and results based on local push and the bidirectional method.

5 The Local Push Method

In this section, we adapt the local push methods to estimate $\boldsymbol{t}^{\top}\boldsymbol{x}^{\ast}_{L}$ . We first describe our Push algorithm as a primitive that can be applied to both RDD and CDD systems. After that, we establish different properties of Push for RDD and CDD systems. This leads to our proof for Theorem 7.

For both RDD and CDD systems $\mathbf{M}\boldsymbol{x}=\boldsymbol{b}$ , we describe our Push algorithm as a primitive that can be used for approximating the vector $2\boldsymbol{x}^{\ast}_{L}=\sum_{\ell=0}^{L-1}\left(\frac{1}{2}\left(\mathbf{I% }+\mathbf{D}_{\mathbf{M}}^{-1}\mathbf{A}_{\mathbf{M}}^{\top}\right)\right)^{% \ell}\mathbf{D}_{\mathbf{M}}^{-1}\boldsymbol{b}$ . The pseudocode is given in Algorithm 1.

In the Push algorithm, the initialization step sets the reserve and residue vectors to $\boldsymbol{0}$ , except that $\boldsymbol{r}^{(0)}$ is set to be $\mathbf{D}_{\mathbf{M}}^{-1}\boldsymbol{b}$ , which requires $O\big(\|\boldsymbol{b}\|_{0}\big)$ time if we assume that we can scan through the nonzero entries of $\boldsymbol{b}$ in $O\big(\|\boldsymbol{b}\|_{0}\big)$ time. Next, the main loop iterates over levels $\ell$ from $0$ to $L-2$ . At each level $\ell$ , the algorithm performs a local push operation on each coordinate $v$ whose residue $\boldsymbol{r}^{(\ell)}(v)$ exceeds the threshold $r_{\max}$ in absolute value. The push operation on $v$ at level $\ell$ sets the reserve $\boldsymbol{p}^{(\ell)}(v)$ to $\boldsymbol{r}^{(\ell)}(v)$ , increments $\boldsymbol{r}^{(\ell+1)}$ by $\frac{1}{2}\left(\mathbf{I}+\mathbf{D}_{\mathbf{M}}^{-1}\mathbf{A}_{\mathbf{M}% }^{\top}\right)\left(\boldsymbol{r}^{(\ell)}(v)\boldsymbol{e}_{v}\right)$ , and sets $\boldsymbol{r}^{(\ell)}(v)$ to $0$ . In effect, the second step in the push operation increases $\boldsymbol{r}^{(\ell+1)}(v)$ by $\frac{1}{2}\boldsymbol{r}^{(\ell)}(v)$ and increases $\boldsymbol{r}^{(\ell+1)}(u)$ by $\frac{\mathbf{A}(v,u)}{2d_{\mathbf{M}}(u)}\cdot\boldsymbol{r}^{(\ell)}(v)$ for each $u\in[n]$ with $\mathbf{A}(v,u)\neq 0$ , which can be done in $\big\|\mathbf{M}(\cdot,v)\big\|_{0}$ time given oracle row/column access to $\mathbf{M}$ .

Algorithm 1

\textup{{Push}}(\mathbf{M},\boldsymbol{b},L,r_{\max})

.

The following lemma gives the key invariant property of the Push algorithm. Its proof is based on induction and can be found in the full version of this paper [28].

Lemma 15.

The push operations preserve the following invariant:

\displaystyle\sum_{\ell=0}^{L-1}\left(\frac{1}{2}\left(\mathbf{I}+\mathbf{D}_{% \mathbf{M}}^{-1}\mathbf{A}_{\mathbf{M}}^{\top}\right)\right)^{\ell}\mathbf{D}_% {\mathbf{M}}^{-1}\boldsymbol{b}=\sum_{\ell=0}^{L-1}\boldsymbol{p}^{(\ell)}+% \sum_{\ell=0}^{L-1}\sum_{\ell^{\prime}=0}^{L-\ell-1}\left(\frac{1}{2}\left(% \mathbf{I}+\mathbf{D}_{\mathbf{M}}^{-1}\mathbf{A}_{\mathbf{M}}^{\top}\right)% \right)^{\ell}\boldsymbol{r}^{(\ell^{\prime})}.

In light of this invariant, we use $\frac{1}{2}\boldsymbol{t}^{\top}\left(\sum_{\ell=0}^{L-1}\boldsymbol{p}^{(\ell% )}+\boldsymbol{r}^{(L-1)}\right)$ as an estimate of $\boldsymbol{t}^{\top}\boldsymbol{x}^{\ast}_{L}$ . Note that this quantity can be computed during the push process. The next lemma shows that if $\mathbf{M}$ is RDD, then the absolute error between this quantity and $\boldsymbol{t}^{\top}\boldsymbol{x}^{\ast}_{L}$ can be bounded.

Lemma 16.

If $\mathbf{M}$ is RDD, then $\left|\frac{1}{2}\boldsymbol{t}^{\top}\left(\sum_{\ell=0}^{L-1}\boldsymbol{p}^% {(\ell)}+\boldsymbol{r}^{(L-1)}\right)-\boldsymbol{t}^{\top}\boldsymbol{x}^{% \ast}_{L}\right|\leq\frac{1}{2}\|\boldsymbol{t}\|_{1}L^{2}\cdot r_{\max}$ .

Proof.

By Lemma 15 and Equation (5), we have

	$\displaystyle\phantom{{}={}}\left\|\frac{1}{2}\boldsymbol{t}^{\top}\left(\sum_{% \ell=0}^{L-1}\boldsymbol{p}^{(\ell)}+\boldsymbol{r}^{(L-1)}\right)-\boldsymbol% {t}^{\top}\boldsymbol{x}^{\ast}_{L}\right\|=\frac{1}{2}\left\|\sum_{\ell^{\prime% }=0}^{L-2}\sum_{\ell=0}^{L-\ell^{\prime}-1}\boldsymbol{t}^{\top}\left(\frac{1}% {2}\left(\mathbf{I}+\mathbf{D}_{\mathbf{M}}^{-1}\mathbf{A}_{\mathbf{M}}^{\top}% \right)\right)^{\ell}\boldsymbol{r}^{(\ell^{\prime})}\right\|$
	$\displaystyle\leq\frac{1}{2}\sum_{\ell^{\prime}=0}^{L-2}\sum_{\ell=0}^{L-\ell^% {\prime}-1}\\|\boldsymbol{t}\\|_{1}\left\\|\left(\frac{1}{2}\left(\mathbf{I}+% \mathbf{D}_{\mathbf{M}}^{-1}\mathbf{A}_{\mathbf{M}}^{\top}\right)\right)^{\ell% }\boldsymbol{r}^{(\ell^{\prime})}\right\\|_{\infty}\leq\frac{1}{2}\\|\boldsymbol% {t}\\|_{1}L^{2}\cdot r_{\max},$

where we used $\left\|\frac{1}{2}\left(\mathbf{I}+\mathbf{D}_{\mathbf{M}}^{-1}\mathbf{A}_{% \mathbf{M}}^{\top}\right)\right\|_{\infty}\leq\frac{1}{2}\left(1+\left\|% \mathbf{D}_{\mathbf{M}}^{-1}\mathbf{A}_{\mathbf{M}}^{\top}\right\|_{\infty}% \right)\leq 1$ for RDD $\mathbf{M}$ and $\left\|\boldsymbol{r}^{(\ell^{\prime})}\right\|_{\infty}\leq r_{\max}$ for any $\ell^{\prime}\in[0,L-2]$ as guaranteed by the process of Push. $\hfill\blacktriangleleft$

We will also use the following inequality version of the invariant to bound the running time of the Push algorithm, whose proof is similar to that of the invariant equation and can be found in the full version of this paper [28].

Lemma 17.

The push operations preserve the following inequality:
$\displaystyle\sum_{\ell=0}^{L-1}\left(\frac{1}{2}\left(\mathbf{I}+\mathbf{D}_{% \mathbf{M}}^{-1}\left|\mathbf{A}_{\mathbf{M}}^{\top}\right|\right)\right)^{% \ell}\mathbf{D}_{\mathbf{M}}^{-1}|\boldsymbol{b}|\geq\sum_{\ell=0}^{L-1}\left|% \boldsymbol{p}^{(\ell)}\right|+\sum_{\ell=0}^{L-1}\sum_{\ell^{\prime}=0}^{L-% \ell-1}\left(\frac{1}{2}\left(\mathbf{I}+\mathbf{D}_{\mathbf{M}}^{-1}\left|% \mathbf{A}_{\mathbf{M}}^{\top}\right|\right)\right)^{\ell}\left|\boldsymbol{r}% ^{(\ell^{\prime})}\right|.$

The next lemma bounds the complexity of the Push algorithm by a convoluted expression. We will shortly see how to simplify this expression for some special systems and settings.

Lemma 18.

Suppose that we can scan through the nonzero entries of $\boldsymbol{b}$ in $O\big(\|\boldsymbol{b}\|_{0}\big)$ time. Then the complexity of the Push algorithm is bounded by

\displaystyle O\left(\|\boldsymbol{b}\|_{0}+\frac{1}{r_{\max}}\sum_{v\in[n]}% \big\|\mathbf{M}(\cdot,v)\big\|_{0}\cdot\boldsymbol{e}_{v}^{\top}\sum_{\ell=0}% ^{L-1}\left(\frac{1}{2}\left(\mathbf{I}+\mathbf{D}_{\mathbf{M}}^{-1}\left|% \mathbf{A}_{\mathbf{M}}^{\top}\right|\right)\right)^{\ell}\mathbf{D}_{\mathbf{% M}}^{-1}|\boldsymbol{b}|\right).

Proof.

Observe that each time a push operation is performed on a coordinate $v\in[n]$ , the value of $\boldsymbol{e}_{v}^{\top}\sum_{\ell=0}^{L-1}\left|\boldsymbol{p}^{(\ell)}\right|$ is increased by at least $r_{\max}$ . However, by Lemma 17, $\boldsymbol{e}_{v}^{\top}\sum_{\ell=0}^{L-1}\left|\boldsymbol{p}^{(\ell)}\right|$ is always upper bounded by $\boldsymbol{e}_{v}^{\top}\sum_{\ell=0}^{L-1}\left(\frac{1}{2}\left(\mathbf{I}+% \mathbf{D}_{\mathbf{M}}^{-1}\left|\mathbf{A}_{\mathbf{M}}^{\top}\right|\right)% \right)^{\ell}\mathbf{D}_{\mathbf{M}}^{-1}|\boldsymbol{b}|$ . Therefore, the total number of push operations performed on $v$ is at most $\frac{1}{r_{\max}}\cdot\boldsymbol{e}_{v}^{\top}\sum_{\ell=0}^{L-1}\left(\frac% {1}{2}\left(\mathbf{I}+\mathbf{D}_{\mathbf{M}}^{-1}\left|\mathbf{A}_{\mathbf{M% }}^{\top}\right|\right)\right)^{\ell}\mathbf{D}_{\mathbf{M}}^{-1}|\boldsymbol{% b}|$ . Since each push operation on $v$ takes $O\left(\big\|\mathbf{M}(\cdot,v)\big\|_{0}\right)$ time, the lemma follows by summing the cost of the push operations over all $v\in[n]$ and adding the $O\big(\|\boldsymbol{b}\|_{0}\big)$ time for initialization. $\hfill\blacktriangleleft$

If $\mathbf{M}$ is CDD and the nonzero entries of $\mathbf{M}$ have absolute values of $\Omega(1)$ , the next lemma shows that the complexity of Push can be simplified. Its proof is given in the full version of this paper [28].

Lemma 19.

Suppose that $\mathbf{M}$ is CDD, the nonzero entries of $\mathbf{M}$ have absolute values of $\Omega(1)$ , and we can scan through the nonzero entries of $\boldsymbol{b}$ in $O\big(\|\boldsymbol{b}\|_{0}\big)$ time. Then the complexity of the Push algorithm is $O\big(\|\boldsymbol{b}\|_{0}+\|\boldsymbol{b}\|_{1}L/r_{\max}\big)$ .

Having established the properties of the Push algorithm for RDD and CDD systems, we can readily combine them to prove Theorem 7 for RCDD systems.

Proof of Theorem 7.

We run Push with $r_{\max}:=\varepsilon/L^{2}$ and use $\frac{1}{2}\boldsymbol{t}^{\top}\left(\sum_{\ell=0}^{L-1}\boldsymbol{p}^{(\ell% )}+\boldsymbol{r}^{(L-1)}\right)$ as the result. Since $\mathbf{M}$ is RDD, by Lemma 16, the absolute error between the estimate and $\boldsymbol{t}^{\top}\boldsymbol{x}^{\ast}_{L}$ is upper bounded by $\frac{1}{2}\|\boldsymbol{t}\|_{1}L^{2}\cdot r_{\max}\leq\frac{1}{2}\varepsilon% \|\boldsymbol{t}\|_{1}$ . Since $\mathbf{M}$ is CDD and its nonzero entries have absolute values of $\Omega(1)$ , by Lemma 19, the time complexity of the Push algorithm is $O\big(\|\boldsymbol{b}\|_{0}+\|\boldsymbol{b}\|_{1}L/r_{\max}\big)=O\big(\|% \boldsymbol{b}\|_{0}+\|\boldsymbol{b}\|_{1}L^{3}/\varepsilon\big)$ . This finishes the proof. $\hfill\blacktriangleleft$

6 The Bidirectional Method

This section combines the techniques of random-walk sampling and local push to develop bidirectional algorithms for estimating $\boldsymbol{t}^{\top}\boldsymbol{x}^{\ast}$ , which leads to Theorem 8.

The framework of the bidirectional method for RDD systems is presented in Algorithm 2. First, we invoke the Push algorithm to obtain the reserve and residue vectors. Recall that the invariant equation of Push (Lemma 15) implies that

	$\displaystyle\phantom{{}={}}\boldsymbol{t}^{\top}\boldsymbol{x}^{\ast}_{L}-% \frac{1}{2}\boldsymbol{t}^{\top}\left(\sum_{\ell=0}^{L-1}\boldsymbol{p}^{(\ell% )}+\boldsymbol{r}^{(L-1)}\right)=\frac{1}{2}\boldsymbol{t}^{\top}\sum_{\ell^{% \prime}=0}^{L-2}\sum_{\ell=0}^{L-\ell-1}\left(\frac{1}{2}\left(\mathbf{I}+% \mathbf{D}_{\mathbf{M}}^{-1}\mathbf{A}_{\mathbf{M}}^{\top}\right)\right)^{\ell% }\boldsymbol{r}^{(\ell^{\prime})}$
	$\displaystyle=\frac{1}{2}\boldsymbol{t}^{\top}\sum_{\ell=0}^{L-1}\left(\frac{1% }{2}\left(\mathbf{I}+\mathbf{D}_{\mathbf{M}}^{-1}\mathbf{A}_{\mathbf{M}}^{\top% }\right)\right)^{\ell}\left(\sum_{\ell^{\prime}=0}^{\min(L-\ell-1,L-2)}% \boldsymbol{r}^{(\ell^{\prime})}\right).$

So, instead of directly using $\frac{1}{2}\boldsymbol{t}^{\top}\left(\sum_{\ell=0}^{L-1}\boldsymbol{p}^{(\ell% )}+\boldsymbol{r}^{(L-1)}\right)$ as an estimate of $\boldsymbol{t}^{\top}\boldsymbol{x}^{\ast}_{L}$ , we estimate the right-hand side of the above equation using random-walk samplings from $|\boldsymbol{t}|/\|\boldsymbol{t}\|_{1}$ to reduce approximation error. We employ the same sampling scheme as described in Section 4 to obtain a walk length $\ell$ and the coordinate $v$ reached by the random walk after $\ell$ steps, but each sampled value now involves the summation $\sum_{\ell^{\prime}=0}^{\min(L-\ell-1,L-2)}\boldsymbol{r}^{(\ell^{\prime})}(v)$ . We take the average of the estimates across $n_{\mathrm{s}}$ independent samples and add it to $\frac{1}{2}\boldsymbol{t}^{\top}\left(\sum_{\ell=0}^{L-1}\boldsymbol{p}^{(\ell% )}+\boldsymbol{r}^{(L-1)}\right)$ to obtain the final estimate $\hat{x}$ .

We note that, compared to the sampling scheme in [14], our approach eliminates the need for using additional data structures to maintain the prefix sums of residues, since we can directly compute $\sum_{\ell^{\prime}=0}^{\min(L-\ell-1,L-2)}\boldsymbol{r}^{(\ell^{\prime})}(v)$ in $O(L)$ time per sample without increasing the asymptotic complexity.

Algorithm 2 bidirectional method for RDD systems.

The next lemma establishes the unbiasedness of the bidirectional estimator.

Lemma 20.

The sum of $\frac{1}{2}\boldsymbol{t}^{\top}\left(\sum_{\ell=0}^{L-1}\boldsymbol{p}^{(\ell% )}+\boldsymbol{r}^{(L-1)}\right)$ and each sampled value in the bidirectional method described above gives an unbiased estimate of $\boldsymbol{t}^{\top}\boldsymbol{x}^{\ast}_{L}$ .

Proof.

Following the proof of Lemma 14, we can show that the expectation of each sampled value is $\frac{1}{2}\boldsymbol{t}^{\top}\sum_{\ell=0}^{L-1}\left(\frac{1}{2}\left(% \mathbf{I}+\mathbf{D}_{\mathbf{M}}^{-1}\mathbf{A}_{\mathbf{M}}^{\top}\right)% \right)^{\ell}\left(\sum_{\ell^{\prime}=0}^{\min(L-\ell-1,L-2)}\boldsymbol{r}^% {(\ell^{\prime})}\right)$ . Combining this with the invariant equation in Lemma 15 completes the proof. $\hfill\blacktriangleleft$

Next, we prove Theorem 8 by proving the two stated complexity bounds separately in the following two lemmas. Their proofs are partly inspired by [14] and [52], respectively. The proof of Lemma 22 uses variance analysis and is given in the full version of this paper [28].

Lemma 21.

Suppose the same assumptions as in Theorem 8. Then there exists a randomized algorithm that computes a $\hat{x}$ such that $\Pr\left\{\left|\hat{x}-\boldsymbol{t}^{\top}\boldsymbol{x}^{\ast}\right|\leq% \varepsilon\right\}\geq\frac{3}{4}$ in time $O\big(\|\boldsymbol{b}\|_{0}\big)$ plus

\displaystyle O\left(f_{\mathrm{row}}(\mathbf{M})^{1/3}\|\boldsymbol{t}\|_{1}^% {2/3}\|\boldsymbol{b}\|_{1}^{2/3}L^{7/3}\varepsilon^{-2/3}\right).

Proof.

We use the bidirectional method as in Algorithm 2. Note that each sampled value equals $\sigma\cdot\frac{1}{2}\|\boldsymbol{t}\|_{1}L\sum_{\ell^{\prime}=0}^{\min(L-% \ell-1,L-2)}\boldsymbol{r}^{(\ell^{\prime})}(v)$ for some $\sigma\in\{0,\pm 1\}$ and $v\in[n]$ and the Push algorithm ensures that $\left\|\boldsymbol{r}^{(\ell^{\prime})}\right\|_{\infty}\leq r_{\max}$ for each $\ell^{\prime}\in[0,L-2]$ . Thus, the absolute value of each sampled value is at most $\frac{1}{2}\|\boldsymbol{t}\|_{1}L^{2}\cdot r_{\max}$ . Using the Hoeffding bound, it follows that

\displaystyle\phantom{{}={}}\Pr\left\{\left|\hat{x}-\boldsymbol{t}^{\top}% \boldsymbol{x}^{\ast}_{L}\right|\geq\frac{1}{2}\varepsilon\right\}\leq 2\exp% \left(-\frac{2n_{\mathrm{s}}(\frac{1}{2}\varepsilon)^{2}}{(\|\boldsymbol{t}\|_% {1}L^{2}\cdot r_{\max})^{2}}\right)=2\exp\left(-\frac{n_{\mathrm{s}}\cdot% \varepsilon^{2}}{2\|\boldsymbol{t}\|_{1}^{2}L^{4}\cdot r_{\max}^{2}}\right).

To guarantee that this probability is at most $1/4$ , we set $n_{\mathrm{s}}:=\Theta\left(\|\boldsymbol{t}\|_{1}^{2}L^{4}\cdot r_{\max}^{2}/% \varepsilon^{2}\right)$ , where $r_{\max}$ will be determined shortly.

By Lemma 19, the push cost is $O\left(\|\boldsymbol{b}\|_{1}L/r_{\max}\right)$ . We set $r_{\max}:=\frac{\varepsilon^{2/3}\|\boldsymbol{b}\|_{1}^{1/3}}{f_{\mathrm{row}% }(\mathbf{M})^{1/3}\|\boldsymbol{t}\|_{1}^{2/3}L^{4/3}}$ , where $\|\boldsymbol{b}\|_{1}$ can be computed in $O(\|\boldsymbol{b}\|_{0})$ time given our assumptions. Consequently, the cost for random-walk sampling and push both becomes $O\left(f_{\mathrm{row}}(\mathbf{M})^{1/3}\|\boldsymbol{t}\|_{1}^{2/3}\|% \boldsymbol{b}\|_{1}^{2/3}L^{7/3}\varepsilon^{-2/3}\right)$ , completing the proof. $\hfill\blacktriangleleft$

Lemma 22.

Suppose the same assumptions as in Theorem 8. Then there exists a randomized algorithm that computes a $\hat{x}$ such that $\Pr\left\{\left|\hat{x}-\boldsymbol{t}^{\top}\boldsymbol{x}^{\ast}\right|\leq% \varepsilon\right\}\geq\frac{3}{4}$ in time $O\big(\|\boldsymbol{b}\|_{0}\big)$ plus

\displaystyle O\left(f_{\mathrm{row}}(\mathbf{M})^{1/2}\|\boldsymbol{t}\|_{1}% \|\boldsymbol{b}\|_{1}^{1/2}\left\|\mathbf{D}_{\mathbf{M}}^{-1}\boldsymbol{b}% \right\|_{\infty}^{1/2}L^{5/2}\varepsilon^{-1}\right).

Proof of Theorem 8.

The theorem follows from Lemmas 21 and 22. $\hfill\blacktriangleleft$

7 Connections with PageRank Computation

This section discusses the connections between our framework and PageRank computation and presents proofs for Theorems 9 and 11.

As mentioned in Section 1, the PPR equations (1) and (2) and PageRank contribution equations (3) and (4) can be formulated as RDD/CDD systems. For example, by Equations (1) and (3), for a node $t\in V$ , setting $\mathbf{M}=\mathbf{I}-(1-\alpha)\mathbf{A}_{G}^{\top}\mathbf{D}_{G}^{-1},% \boldsymbol{b}=\frac{\alpha}{n}\boldsymbol{1},\boldsymbol{t}=\boldsymbol{e}_{t}$ or $\mathbf{M}=\mathbf{I}-(1-\alpha)\mathbf{D}_{G}^{-1}\mathbf{A}_{G},\boldsymbol{% b}=\alpha\boldsymbol{e}_{t},\boldsymbol{t}=\frac{1}{n}\boldsymbol{1}$ in our formulation both yield $\boldsymbol{t}^{\top}\boldsymbol{x}^{\ast}=\boldsymbol{\pi}_{G,\alpha}(t)$ ; by Equation (2), for nodes $s,t\in V$ , setting $\mathbf{M}=\mathbf{D}_{G}-(1-\alpha)\mathbf{A}_{G}^{\top},\boldsymbol{b}=% \alpha\boldsymbol{e}_{s},\boldsymbol{t}=\boldsymbol{e}_{t}$ yields $\boldsymbol{t}^{\top}\boldsymbol{x}^{\ast}=\boldsymbol{\pi}_{G,\alpha}(s,t)/d^% {+}_{G}(t)$ . It is worth noting that on Eulerian graphs, Equations (2) and (4) are RCDD systems, and on undirected graphs, they are SDD systems.

By the definition of the $p$ -norm gaps, we have

	$\displaystyle\phantom{{}={}}\gamma_{1}\left(\mathbf{I}-(1-\alpha)\mathbf{A}_{G% }^{\top}\mathbf{D}_{G}^{-1}\right)=1-\left\\|\left.\frac{1}{2}\left(\mathbf{I}+% (1-\alpha)\mathbf{A}_{G}^{\top}\mathbf{D}_{G}^{-1}\right)\right\|_{% \operatorname{range}\left(\mathbf{I}-(1-\alpha)\mathbf{A}_{G}^{\top}\mathbf{D}% _{G}^{-1}\right)}\right\\|_{1}$
	$\displaystyle=1-\left\\|\frac{1}{2}\left(\mathbf{I}+(1-\alpha)\mathbf{A}_{G}^{% \top}\mathbf{D}_{G}^{-1}\right)\right\\|_{1}=1-\frac{1}{2}\big(1+(1-\alpha)\big% )=\frac{1}{2}\alpha,$

where we used $\operatorname{range}\left(\mathbf{I}-(1-\alpha)\mathbf{A}_{G}^{\top}\mathbf{D}% _{G}^{-1}\right)=\mathbb{R}^{n}$ since $\mathbf{I}-(1-\alpha)\mathbf{A}_{G}^{\top}\mathbf{D}_{G}^{-1}$ is invertible. Similarly, we have $\gamma_{1}\left(\mathbf{D}_{G}-(1-\alpha)\mathbf{A}_{G}^{\top}\right)=\gamma_{% \infty}\left(\mathbf{I}-(1-\alpha)\mathbf{D}_{G}^{-1}\mathbf{A}_{G}\right)=% \gamma_{\infty}\left(\mathbf{D}_{G}-(1-\alpha)\mathbf{A}_{G}\right)=\frac{1}{2}\alpha$ . Thus, $\frac{1}{2}\alpha$ serves as a lower bound on the maximum $p$ -norm gap of all these matrices involved in the PPR and PageRank contribution equations.

7.1 Results for PageRank Computation when $\mathbf{D}_{G}-(1-\alpha)\mathbf{A}_{G}^{\top}$ is RCDD

Our framework provides new insights and results for PageRank computation, in particular when the involved system is RCDD. Theorem 9 stated in the introduction is one such example, which shows that previous results for single-node PageRank computation on undirected graphs can be improved and generalized to Eulerian graphs.

To prove Theorem 9, we investigate the case when the matrix $\mathbf{D}_{G}-(1-\alpha)\mathbf{A}_{G}^{\top}$ in Equation (2) is RCDD. This matrix is CDD, and it is also RDD if $d^{+}_{G}(v)\geq(1-\alpha)d^{-}_{G}(v)$ holds for all $v\in V$ . In particular, this condition holds when $G$ is Eulerian or $\alpha$ is large enough. Now, by applying Theorem 6, we directly obtain the following result.

Theorem 23.

For any unweighted graph $G$ and decay factor $\alpha$ , suppose that $d^{+}_{G}(v)\geq(1-\alpha)d^{-}_{G}(v)$ holds for all $v\in V$ . Then there exists a randomized algorithm that, given $t\in V$ , $\delta^{+}_{G}$ , and accuracy parameter $\varepsilon$ , computes an estimate of $\boldsymbol{\pi}_{G,\alpha}(t)$ within relative error $\varepsilon$ with success probability at least $3/4$ in time

\displaystyle\widetilde{O}\left(\frac{1}{\alpha\varepsilon^{2}}\cdot\frac{d^{+% }_{G}(t)}{\delta^{+}_{G}}\cdot\frac{1}{n\boldsymbol{\pi}_{G,\alpha}(t)}\right),

where $\widetilde{O}$ hides $\operatorname{polylog}\left(\frac{n}{\alpha\varepsilon}\right)$ factors.

Proof.

Consider applying Theorem 6 to Equation (2) with $\boldsymbol{s}=\frac{1}{n}\boldsymbol{1}$ and $\boldsymbol{t}=\boldsymbol{e}_{t}$ . Note that the corresponding $\left\|\mathbf{D}_{\mathbf{M}}^{-1}\boldsymbol{b}\right\|_{\infty}$ equals $\frac{\alpha}{n\delta^{+}_{G}}$ , $L=\widetilde{O}(1/\alpha)$ , $f_{\mathrm{row}}(\mathbf{M})=O(1)$ , $\|\boldsymbol{t}\|_{1}=1$ , and the obtained $(1\pm\varepsilon)$ -multiplicative approximation of $\boldsymbol{t}^{\top}\boldsymbol{x}^{\ast}=\boldsymbol{\pi}_{G,\alpha}(t)/d^{+% }_{G}(t)$ directly yields a $(1\pm\varepsilon)$ -multiplicative approximation of $\boldsymbol{\pi}_{G,\alpha}(t)$ . Therefore, the time complexity is

\displaystyle\widetilde{O}\left(\frac{\alpha/(n\delta^{+}_{G})}{\alpha^{2}% \varepsilon^{2}\cdot\boldsymbol{\pi}_{G,\alpha}(t)/d^{+}_{G}(t)}\right)=% \widetilde{O}\left(\frac{1}{\alpha\varepsilon^{2}}\cdot\frac{d^{+}_{G}(t)}{% \delta^{+}_{G}}\cdot\frac{1}{n\boldsymbol{\pi}_{G,\alpha}(t)}\right),

as desired. $\hfill\blacktriangleleft$

To prove Theorem 9, we establish some lower bounds on PageRank values in the next lemma, which may be of independent interest. This lemma is partly inspired by [8, Lemma 5.13], [46, 45], and [47, Theorem 1.1], and we give its proof in the full version of this paper [28].

Lemma 24.

For any weighted directed graph $G$ and $t\in V$ , we have

\displaystyle\boldsymbol{\pi}_{G,\alpha}(t)\geq\max\left(\frac{\alpha}{n},% \frac{\alpha(1-\alpha)d^{-}_{G}(t)}{n\Delta^{+}_{G}},\frac{\alpha(1-\alpha)d^{% -}_{G}(t)^{2}}{n\big\|\mathbf{A}_{G}(\cdot,t)\big\|_{\infty}\|\mathbf{A}_{G}\|% _{1,1}},\frac{\alpha(1-\alpha)d^{-}_{G}(t)^{2}}{n\sqrt{n}\big\|\mathbf{A}_{G}(% \cdot,t)\big\|_{2}\|\mathbf{A}_{G}\|_{\mathrm{F}}}\right),

where $\|\mathbf{A}_{G}\|_{1,1}:=\sum_{u,v\in[n]}\big|\mathbf{A}_{G}(u,v)\big|$ and $\|\mathbf{A}_{G}\|_{\mathrm{F}}:=\sqrt{\sum_{u,v\in V}\mathbf{A}_{G}(u,v)^{2}}$ denote the entrywise $1$ -norm and the Frobenius norm, respectively. If $G$ is Eulerian, we further have $\boldsymbol{\pi}_{G,\alpha}(t)\geq\frac{d_{G}(t)}{n\Delta_{G}}$ ; if $G$ is unweighted Eulerian, we further have $\boldsymbol{\pi}_{G,\alpha}(t)\geq\frac{\sqrt{1-\alpha}\cdot d_{G}(t)}{n\sqrt{% m}}$ .

Proof of Theorem 9.

Theorem 23 gives the complexity bound $\widetilde{O}\left(\frac{1}{\alpha\varepsilon^{2}}\cdot\frac{d_{G}(t)}{\delta_% {G}}\cdot\frac{1}{n\boldsymbol{\pi}_{G,\alpha}(t)}\right)$ . By Lemma 24, on unweighted Eulerian graphs, we have

	$\displaystyle\boldsymbol{\pi}_{G,\alpha}(t)$	$\displaystyle\geq\max\left(\frac{\alpha}{n},\frac{\alpha(1-\alpha)d^{-}_{G}(t)% ^{2}}{n\big\\|\mathbf{A}_{G}(\cdot,t)\big\\|_{\infty}\\|\mathbf{A}_{G}\\|_{1,1}},% \frac{d_{G}(t)}{n\Delta_{G}},\frac{\sqrt{1-\alpha}\cdot d_{G}(t)}{n\sqrt{m}}\right)$
		$\displaystyle=\max\left(\frac{\alpha}{n},\frac{\alpha(1-\alpha)d_{G}(t)^{2}}{% nm},\frac{d_{G}(t)}{n\Delta_{G}},\frac{\sqrt{1-\alpha}\cdot d_{G}(t)}{n\sqrt{m% }}\right).$

By plugging these lower bounds on $\boldsymbol{\pi}_{G,\alpha}(t)$ into the complexity bound, we obtain the desired results up to $\operatorname{polylog}\left(\frac{n}{\alpha\varepsilon}\right)$ factors (where we omit the terms of $1/(1-\alpha)$ since we often consider the case when $\alpha\to 0$ ). These $\operatorname{polylog}\left(\frac{n}{\alpha\varepsilon}\right)$ factors can be removed by using non-truncated random walks for sampling (cf. [45]), leading to the stated complexity bounds. $\hfill\blacktriangleleft$

7.2 A Lower Bound on the Accuracy Parameter for SDD Solvers

This subsection proves Theorem 11. To this end, we establish the following reduction from single-node PageRank computation on undirected graphs to solving SDD systems.

Lemma 25.

Suppose that there exists a randomized algorithm that computes an estimate $\hat{x}_{t}$ such that $\Pr\left\{\big|\hat{x}_{t}-\boldsymbol{x}^{\ast}(t)\big|\leq\varepsilon\|% \boldsymbol{x}^{\ast}\|_{\infty}\right\}\geq\frac{3}{4}$ for any SDD system $\mathbf{S}\boldsymbol{x}=\boldsymbol{b}$ in $O\left(\gamma^{-\nu}\varepsilon^{-\tau}\right)$ time. Then there exists a randomized algorithm that, given $\delta_{G}$ , estimates $\boldsymbol{\pi}_{G,\alpha}(t)$ on unweighted undirected graphs $G$ within constant relative error and with success probability at least $3/4$ in time $O\left(\big(d_{G}(t)/\delta_{G}\big)^{\tau}/\alpha^{\nu+\tau}\right)$ .

To prove this lemma, we use the following upper bound on $\boldsymbol{\pi}_{G,\alpha}(t)$ on Eulerian graphs, whose proof is given in the full version of this paper [28].

Lemma 26.

On any Eulerian graph $G$ and $v\in V$ , we have $\boldsymbol{\pi}_{G,\alpha}(v)\leq\frac{d_{G}(v)}{n\delta_{G}}$ .

Proof of Lemma 25.

Consider the PageRank equation (2) with $\boldsymbol{s}=1/n\cdot\boldsymbol{1}$ . When $G$ is undirected, the matrix $\mathbf{D}_{G}-(1-\alpha)\mathbf{A}_{G}^{\top}$ is SDD. By setting $\gamma:=\alpha/2$ and $\varepsilon:=\Theta\big(\alpha\delta_{G}/d_{G}(t)\big)$ , the supposed algorithm can compute an estimate $\hat{x}_{t}$ such that $\left|\hat{x}_{t}-\frac{\boldsymbol{\pi}_{G,\alpha}(t)}{d_{G}(t)}\right|\leq% \varepsilon\cdot\max_{v\in V}\left\{\frac{\boldsymbol{\pi}_{G,\alpha}(v)}{d_{G% }(v)}\right\}$ w.p. at least $3/4$ in time $O\left(\gamma^{-\nu}\varepsilon^{-\tau}\right)=O\left(\big(d_{G}(t)/\delta_{G}% \big)^{\tau}/\alpha^{\nu+\tau}\right)$ . Using Lemma 26 and $\boldsymbol{\pi}_{G,\alpha}(t)\geq\alpha/n$ , we have $\max_{v\in V}\left\{\frac{\boldsymbol{\pi}_{G,\alpha}(v)}{d_{G}(v)}\right\}% \leq\frac{1}{n\delta_{G}}\leq\frac{\boldsymbol{\pi}_{G,\alpha}(t)}{\alpha% \delta_{G}}$ . Thus, with probability at least $3/4$ , $\big|d_{G}(t)\cdot\hat{x}_{t}-\boldsymbol{\pi}_{G,\alpha}(t)\big|\leq% \varepsilon\cdot d_{G}(t)\cdot\frac{\boldsymbol{\pi}_{G,\alpha}(t)}{\alpha% \delta_{G}}=\Theta\big(\boldsymbol{\pi}_{G,\alpha}(t)\big)$ , so $d_{G}(t)\cdot\hat{x}_{t}$ is an estimate of $\boldsymbol{\pi}_{G,\alpha}(t)$ within constant relative error, completing the proof. $\hfill\blacktriangleleft$

Proof of Theorem 11.

[45] establishes a complexity lower bound of $\Omega\big(d_{G}(t)/\delta_{G}\big)$ for estimating $\boldsymbol{\pi}_{G,\alpha}(t)$ within constant relative error with constant success probability on unweighted undirected graphs, where $\alpha$ is constant and the bound holds for any possible combination of $\delta_{G}$ and $d_{G}(t)$ . This lower bound applies to the number of queries to the graph structure. Therefore, combining this lower bound with Lemma 25 and noting that the reduction uses $\varepsilon:=\Theta\big(\alpha\delta_{G}/d_{G}(t)\big)=\Omega(1/n)$ yield the desired lower bound of $\Omega(1/\varepsilon)$ for $\varepsilon=\Omega(1/n)$ . $\hfill\blacktriangleleft$

8 Connections with Effective Resistance Computation

This section justifies the relationship between our framework and effective resistance computation on graphs in Lemma 27 and proves Corollary 10.

Recall that in the context of computing effective resistances, we assume that $G$ is undirected and connected. In our framework, we set $\mathbf{M}=\mathbf{L}_{G}$ and $\boldsymbol{b}=\boldsymbol{t}=\boldsymbol{e}_{s}-\boldsymbol{e}_{t}$ . By Theorem 2, $\gamma_{\max}(\mathbf{L}_{G})=\gamma(\mathbf{L}_{G})$ , so a lower bound $\gamma$ on the spectral gap $\gamma(\mathbf{L}_{G})$ serves as a lower bound on the maximum $p$ -norm gap $\gamma_{\max}(\mathbf{L}_{G})$ . The following lemma states that with this setting, the quantity $\boldsymbol{t}^{\top}\boldsymbol{x}^{\ast}$ that our algorithms approximate equals the effective resistance $R_{G}(s,t)$ .

Lemma 27.

When $\mathbf{M}=\mathbf{L}_{G}$ and $\boldsymbol{b}=\boldsymbol{t}=\boldsymbol{e}_{s}-\boldsymbol{e}_{t}$ , we have $\boldsymbol{t}^{\top}\boldsymbol{x}^{\ast}=R_{G}(s,t)$ .

Lemma 27 can be proved using the results in [7], and we provide a different self-contained proof in the full version of this paper [28].

Now we can directly apply Theorems 4 and 8 to prove Corollary 10. The only remaining detail in the proof is to derive a better setting of $L$ for the case $\boldsymbol{b}=\boldsymbol{t}=\boldsymbol{e}_{s}-\boldsymbol{e}_{t}$ .

Proof of Corollary 10.

Following the proof of Theorem 3, we have $\left|\boldsymbol{t}^{\top}\boldsymbol{x}^{\ast}_{L}-\boldsymbol{t}^{\top}% \boldsymbol{x}^{\ast}\right|\leq\frac{1}{2\gamma}\cdot e^{-\gamma L}\cdot\left% \|\mathbf{D}_{\mathbf{M}}^{-1/2}\boldsymbol{t}\right\|_{2}\left\|\mathbf{D}_{% \mathbf{M}}^{-1/2}\boldsymbol{b}\right\|_{2}=\frac{1}{2\gamma}\cdot e^{-\gamma L% }\left(\frac{1}{d_{G}(s)}+\frac{1}{d_{G}(t)}\right)$ when $\mathbf{M}=\mathbf{L}_{G}$ and $\boldsymbol{b}=\boldsymbol{t}=\boldsymbol{e}_{s}-\boldsymbol{e}_{t}$ . Thus, setting $L:=\Theta\left(\frac{1}{\gamma}\log\left(\frac{1}{\gamma\varepsilon}\left(% \frac{1}{d_{G}(s)}+\frac{1}{d_{G}(t)}\right)\right)\right)$ ensures that $\left|\boldsymbol{t}^{\top}\boldsymbol{x}^{\ast}_{L}-\boldsymbol{t}^{\top}% \boldsymbol{x}^{\ast}\right|\leq\frac{1}{2}\varepsilon$ . The corollary then follows by applying Theorems 4 and 8 with this setting of $L$ and noting that $\left\|\mathbf{D}_{\mathbf{M}}^{-1}\boldsymbol{b}\right\|_{\infty}=1/\min\big(% d_{G}(s),d_{G}(t)\big)$ and $f_{\mathrm{row}}(\mathbf{M})=O(1)$ in this case. $\hfill\blacktriangleleft$

References

[1] Reid Andersen, Christian Borgs, Jennifer T. Chayes, John E. Hopcroft, Vahab S. Mirrokni, and Shang-Hua Teng. Local computation of PageRank contributions. Internet Mathematics, 5(1):23–45, 2008. doi:10.1080/15427951.2008.10129302.
[2] Reid Andersen, Fan R. K. Chung, and Kevin J. Lang. Using PageRank to locally partition a graph. Internet Mathematics, 4(1):35–64, 2007. doi:10.1080/15427951.2007.10129139.
[3] Alexandr Andoni, Robert Krauthgamer, and Yosef Pogrow. On solving linear systems in sublinear time. CoRR, abs/1809.02995, 2018. doi:10.48550/arXiv.1809.02995.
[4] Alexandr Andoni, Robert Krauthgamer, and Yosef Pogrow. On solving linear systems in sublinear time. In Proceedings of the 10th Innovations in Theoretical Computer Science Conference, volume 124, pages 3:1–3:19, 2019. doi:10.4230/LIPIcs.ITCS.2019.3.
[5] Siddhartha Banerjee and Peter Lofgren. Fast bidirectional probability estimation in Markov models. In Advances in Neural Information Processing Systems 28, pages 1423–1431, 2015. URL: https://proceedings.neurips.cc/paper/2015/hash/ede7e2b6d13a41ddf9f4bdef84fdc737-Abstract.html.
[6] Christian Bertram, Mads Vestergaard Jensen, Mikkel Thorup, Hanzhi Wang, and Shuyi Yan. Estimating random-walk probabilities in directed graphs. CoRR, abs/2504.16481, 2025. doi:10.48550/arXiv.2504.16481.
[7] Enrico Bozzo. The Moore-Penrose inverse of the normalized graph Laplacian. Linear Algebra and its Applications, 439(10):3038–3043, 2013. doi:10.1016/j.laa.2013.08.039.
[8] Marco Bressan, Enoch Peserico, and Luca Pretto. Sublinear algorithms for local graph-centrality estimation. SIAM Journal on Computing, 52(4):968–1008, 2023. doi:10.1137/19M1266976.
[9] Sergey Brin and Lawrence Page. The anatomy of a large-scale hypertextual web search engine. Computer Networks, 30(1-7):107–117, 1998. doi:10.1016/S0169-7552(98)00110-X.
[10] Dongrun Cai, Xue Chen, and Pan Peng. Effective resistances in non-expander graphs. In Proceedings of the 31st Annual European Symposium on Algorithms, volume 274, pages 29:1–29:18, 2023. doi:10.4230/LIPIcs.ESA.2023.29.
[11] Michael B. Cohen, Jonathan A. Kelner, Rasmus Kyng, John Peebles, Richard Peng, Anup B. Rao, and Aaron Sidford. Solving directed Laplacian systems in nearly-linear time through sparse LU factorizations. In Proceedings of the 59th IEEE Symposium on Foundations of Computer Science, pages 898–909, 2018. doi:10.1109/FOCS.2018.00089.
[12] Michael B. Cohen, Jonathan A. Kelner, John Peebles, Richard Peng, Anup B. Rao, Aaron Sidford, and Adrian Vladu. Almost-linear-time algorithms for Markov chains and new spectral primitives for directed graphs. In Proceedings of the 49th Annual ACM Symposium on Theory of Computing, pages 410–419, 2017. doi:10.1145/3055399.3055463.
[13] Michael B. Cohen, Jonathan A. Kelner, John Peebles, Richard Peng, Aaron Sidford, and Adrian Vladu. Faster algorithms for computing the stationary distribution, simulating random walks, and more. In Proceedings of the 57th IEEE Symposium on Foundations of Computer Science, pages 583–592, 2016. doi:10.1109/FOCS.2016.69.
[14] Guanyu Cui, Hanzhi Wang, and Zhewei Wei. Mixing time matters: Accelerating effective resistance estimation via bidirectional method. In Proceedings of the 31st ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pages 177–188, 2025. doi:10.1145/3690624.3709298.
[15] Paul Dagum, Richard M. Karp, Michael Luby, and Sheldon M. Ross. An optimal algorithm for Monte Carlo estimation. SIAM Journal on Computing, 29(5):1484–1496, 2000. doi:10.1137/S0097539797315306.
[16] Dean Doron, François Le Gall, and Amnon Ta-Shma. Probabilistic logarithmic-space algorithms for Laplacian solvers. In Approximation, Randomization, and Combinatorial Optimization. Algorithms and Techniques, volume 81, pages 41:1–41:20, 2017. doi:10.4230/LIPIcs.APPROX-RANDOM.2017.41.
[17] Peter G. Doyle and J. Laurie Snell. Random walks and electric networks, volume 22. The Mathematical Association of America, 1984.
[18] Rajat Vadiraj Dwaraknath, Ishani Karmarkar, and Aaron Sidford. Towards optimal effective resistance estimation. In Advances in Neural Information Processing Systems 36, 2023. URL: http://papers.nips.cc/paper_files/paper/2023/hash/b8e2046160a568145af6d42eeef199f4-Abstract-Conference.html.
[19] Dániel Fogaras, Balázs Rácz, Károly Csalogány, and Tamás Sarlós. Towards scaling fully Personalized PageRank: Algorithms, lower bounds, and experiments. Internet Mathematics, 2(3):333–358, 2005. doi:10.1080/15427951.2005.10129104.
[20] George E. Forsythe and Richard A. Leibler. Matrix inversion by a Monte Carlo method. Mathematics of Computation, 4(31):127–129, 1950. URL: https://www.ams.org/journals/mcom/1950-04-031/S0025-5718-1950-0038138-X/home.html.
[21] David F. Gleich. Pagerank beyond the web. SIAM Review, 57(3):321–363, 2015. doi:10.1137/140976649.
[22] Oded Goldreich and Dana Ron. Property testing in bounded degree graphs. Algorithmica, 32(2):302–343, 2002. doi:10.1007/S00453-001-0078-7.
[23] Oded Goldreich and Dana Ron. On testing expansion in bounded-degree graphs. In Studies in Complexity and Cryptography, volume 6650, pages 68–75. Springer, 2011. doi:10.1007/978-3-642-22670-0_9.
[24] Aram W. Harrow, Avinatan Hassidim, and Seth Lloyd. Quantum algorithm for linear systems of equations. Physical Review Letters, 103(15):150502, 2009. doi:10.1103/PhysRevLett.103.150502.
[25] Arun Jambulapati, Sushant Sachdeva, Aaron Sidford, Kevin Tian, and Yibin Zhao. Eulerian graph sparsification by effective resistance decomposition. In Proceedings of the 2025 ACM-SIAM Symposium on Discrete Algorithms, pages 1607–1650, 2025. doi:10.1137/1.9781611978322.50.
[26] Arun Jambulapati and Aaron Sidford. Ultrasparse ultrasparsifiers and faster Laplacian system solvers. In Proceedings of the 2021 ACM-SIAM Symposium on Discrete Algorithms, pages 540–559, 2021. doi:10.1137/1.9781611976465.33.
[27] Satyen Kale, Yuval Peres, and C. Seshadhri. Noise tolerance of expanders and sublinear expansion reconstruction. SIAM Journal on Computing, 42(1):305–323, 2013. doi:10.1137/110837863.
[28] Tsz Chiu Kwok, Zhewei Wei, and Mingji Yang. On solving asymmetric diagonally dominant linear systems in sublinear time. CoRR, abs/2509.13891, 2025. doi:10.48550/arXiv.2509.13891.
[29] Amy Nicole Langville and Carl Dean Meyer. Survey: Deeper inside PageRank. Internet Mathematics, 1(3):335–380, 2003. doi:10.1080/15427951.2004.10129091.
[30] Lawrence Li and Sushant Sachdeva. A new approach to estimating effective resistances and counting spanning trees in expander graphs. In Proceedings of the 2023 ACM-SIAM Symposium on Discrete Algorithms, pages 2728–2745, 2023. doi:10.1137/1.9781611977554.ch102.
[31] Peter Lofgren, Siddhartha Banerjee, and Ashish Goel. Bidirectional PageRank estimation: from average-case to worst-case. In Proceedings of the 12th International Workshop on Algorithms and Models for the Web-Graph, volume 9479, pages 164–176, 2015. doi:10.1007/978-3-319-26784-5_13.
[32] Peter Lofgren, Siddhartha Banerjee, and Ashish Goel. Personalized PageRank estimation and search: a bidirectional approach. In Proceedings of the 9th ACM International Conference on Web Search and Data Mining, pages 163–172, 2016. doi:10.1145/2835776.2835823.
[33] Peter Lofgren, Siddhartha Banerjee, Ashish Goel, and Seshadhri Comandur. FAST-PPR: Scaling Personalized PageRank estimation for large graphs. In Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pages 1436–1445, 2014. doi:10.1145/2623330.2623745.
[34] László Lovász. Random walks on graphs: a survey. Combinatorics, Paul Erdös is Eighty, 2(1-46):4, 1993.
[35] Pan Peng, Daniel Lopatta, Yuichi Yoshida, and Gramoz Goranci. Local algorithms for estimating effective resistance. In Proceedings of the 27th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pages 1329–1338, 2021. doi:10.1145/3447548.3467361.
[36] Ronitt Rubinfeld, Gil Tamir, Shai Vardi, and Ning Xie. Fast local computation algorithms. In Proceedings of Innovations in Computer Science, pages 223–238, 2011. URL: http://conference.iiis.tsinghua.edu.cn/ICS2011/content/papers/36.html.
[37] Nitin Shyamkumar, Siddhartha Banerjee, and Peter Lofgren. Sublinear estimation of a single element in sparse linear systems. In Proceedings of the 54th Annual Allerton Conference on Communication, Control, and Computing, pages 856–860, 2016. doi:10.1109/ALLERTON.2016.7852323.
[38] Daniel A. Spielman and Nikhil Srivastava. Graph sparsification by effective resistances. SIAM Journal on Computing, 40(6):1913–1926, 2011. doi:10.1137/080734029.
[39] Daniel A. Spielman and Shang-Hua Teng. Nearly-linear time algorithms for graph partitioning, graph sparsification, and solving linear systems. In Proceedings of the 36th Annual ACM Symposium on Theory of Computing, pages 81–90, 2004. doi:10.1145/1007352.1007372.
[40] Daniel A. Spielman and Shang-Hua Teng. Nearly linear time algorithms for preconditioning and solving symmetric, diagonally dominant linear systems. SIAM Journal on Matrix Analysis and Applications, 35(3):835–885, 2014. doi:10.1137/090771430.
[41] Amnon Ta-Shma. Inverting well conditioned matrices in quantum logspace. In Proceedings of the 45th Annual ACM Symposium on Theory of Computing, pages 881–890, 2013. doi:10.1145/2488608.2488720.
[42] Shang-Hua Teng. The Laplacian Paradigm: Emerging algorithms for massive graphs. In Proceedings of the 7th Annual Conference on Theory and Applications of Models of Computation, volume 6108, pages 2–14, 2010. doi:10.1007/978-3-642-13562-0_2.
[43] Mikkel Thorup, Hanzhi Wang, Zhewei Wei, and Mingji Yang. PageRank centrality in directed graphs with bounded in-degree. In Proceedings of the 2026 ACM-SIAM Symposium on Discrete Algorithms, 2026. To appear. arXiv preprint at https://arxiv.org/abs/2508.01257. doi:10.48550/arXiv.2508.01257.
[44] Nisheeth K. Vishnoi. ${L}\mathbf{x}=\mathbf{b}$ . Foundations and Trends in Theoretical Computer Science, 8(1-2):1–141, 2013. doi:10.1561/0400000054.
[45] Hanzhi Wang. Revisiting local PageRank estimation on undirected graphs: Simple and optimal. In Proceedings of the 30th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pages 3036–3044, 2024. doi:10.1145/3637528.3671820.
[46] Hanzhi Wang and Zhewei Wei. Estimating single-node PageRank in $\tilde{O}\left(\min\big\{d_{t},\sqrt{m}\big\}\right)$ time. Proceedings of the VLDB Endowment, 16(11):2949–2961, 2023. doi:10.14778/3611479.3611500.
[47] Hanzhi Wang, Zhewei Wei, Ji-Rong Wen, and Mingji Yang. Revisiting local computation of PageRank: Simple and optimal. In Proceedings of the 56th Annual ACM Symposium on Theory of Computing, pages 911–922, 2024. doi:10.1145/3618260.3649661.
[48] Wolfgang R. Wasow. A note on the inversion of matrices by random walks. Mathematical Tables and Other Aids to Computation, 6(38):78–81, 1952. doi:10.2307/2002546.
[49] Zhewei Wei, Ji-Rong Wen, and Mingji Yang. Approximating single-source Personalized PageRank with absolute error guarantees. In Proceedings of the 27th International Conference on Database Theory, volume 290, pages 9:1–9:19, 2024. doi:10.4230/LIPIcs.ICDT.2024.9.
[50] Mingji Yang, Hanzhi Wang, Zhewei Wei, Sibo Wang, and Ji-Rong Wen. Efficient algorithms for Personalized PageRank computation: a survey. IEEE Transactions on Knowledge and Data Engineering, 36(9):4582–4602, 2024. doi:10.1109/TKDE.2024.3376000.
[51] Renchi Yang and Jing Tang. Efficient estimation of pairwise effective resistance. Proceedings of the ACM International Conference on Management of Data, 1(1):16:1–16:27, 2023. doi:10.1145/3588696.
[52] Yichun Yang, Rong-Hua Li, Meihao Liao, and Guoren Wang. Improved algorithms for effective resistance computation on graphs. In Proceedings of the 38th Conference on Learning Theory, volume 291, pages 5892–5920, 2025. URL: https://proceedings.mlr.press/v291/yichun25a.html.

[bib.bib1] [1] Reid Andersen, Christian Borgs, Jennifer T. Chayes, John E. Hopcroft, Vahab S. Mirrokni, and Shang-Hua Teng. Local computation of PageRank contributions. Internet Mathematics, 5(1):23–45, 2008. doi:10.1080/15427951.2008.10129302.

[bib.bib2] [2] Reid Andersen, Fan R. K. Chung, and Kevin J. Lang. Using PageRank to locally partition a graph. Internet Mathematics, 4(1):35–64, 2007. doi:10.1080/15427951.2007.10129139.

[bib.bib3] [3] Alexandr Andoni, Robert Krauthgamer, and Yosef Pogrow. On solving linear systems in sublinear time. CoRR, abs/1809.02995, 2018. doi:10.48550/arXiv.1809.02995.

[bib.bib4] [4] Alexandr Andoni, Robert Krauthgamer, and Yosef Pogrow. On solving linear systems in sublinear time. In Proceedings of the 10th Innovations in Theoretical Computer Science Conference, volume 124, pages 3:1–3:19, 2019. doi:10.4230/LIPIcs.ITCS.2019.3.

[bib.bib5] [5] Siddhartha Banerjee and Peter Lofgren. Fast bidirectional probability estimation in Markov models. In Advances in Neural Information Processing Systems 28, pages 1423–1431, 2015. URL: https://proceedings.neurips.cc/paper/2015/hash/ede7e2b6d13a41ddf9f4bdef84fdc737-Abstract.html.

[bib.bib6] [6] Christian Bertram, Mads Vestergaard Jensen, Mikkel Thorup, Hanzhi Wang, and Shuyi Yan. Estimating random-walk probabilities in directed graphs. CoRR, abs/2504.16481, 2025. doi:10.48550/arXiv.2504.16481.

[bib.bib7] [7] Enrico Bozzo. The Moore-Penrose inverse of the normalized graph Laplacian. Linear Algebra and its Applications, 439(10):3038–3043, 2013. doi:10.1016/j.laa.2013.08.039.

[bib.bib8] [8] Marco Bressan, Enoch Peserico, and Luca Pretto. Sublinear algorithms for local graph-centrality estimation. SIAM Journal on Computing, 52(4):968–1008, 2023. doi:10.1137/19M1266976.

[bib.bib9] [9] Sergey Brin and Lawrence Page. The anatomy of a large-scale hypertextual web search engine. Computer Networks, 30(1-7):107–117, 1998. doi:10.1016/S0169-7552(98)00110-X.

[bib.bib10] [10] Dongrun Cai, Xue Chen, and Pan Peng. Effective resistances in non-expander graphs. In Proceedings of the 31st Annual European Symposium on Algorithms, volume 274, pages 29:1–29:18, 2023. doi:10.4230/LIPIcs.ESA.2023.29.

[bib.bib11] [11] Michael B. Cohen, Jonathan A. Kelner, Rasmus Kyng, John Peebles, Richard Peng, Anup B. Rao, and Aaron Sidford. Solving directed Laplacian systems in nearly-linear time through sparse LU factorizations. In Proceedings of the 59th IEEE Symposium on Foundations of Computer Science, pages 898–909, 2018. doi:10.1109/FOCS.2018.00089.

[bib.bib12] [12] Michael B. Cohen, Jonathan A. Kelner, John Peebles, Richard Peng, Anup B. Rao, Aaron Sidford, and Adrian Vladu. Almost-linear-time algorithms for Markov chains and new spectral primitives for directed graphs. In Proceedings of the 49th Annual ACM Symposium on Theory of Computing, pages 410–419, 2017. doi:10.1145/3055399.3055463.

[bib.bib13] [13] Michael B. Cohen, Jonathan A. Kelner, John Peebles, Richard Peng, Aaron Sidford, and Adrian Vladu. Faster algorithms for computing the stationary distribution, simulating random walks, and more. In Proceedings of the 57th IEEE Symposium on Foundations of Computer Science, pages 583–592, 2016. doi:10.1109/FOCS.2016.69.

[bib.bib14] [14] Guanyu Cui, Hanzhi Wang, and Zhewei Wei. Mixing time matters: Accelerating effective resistance estimation via bidirectional method. In Proceedings of the 31st ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pages 177–188, 2025. doi:10.1145/3690624.3709298.

[bib.bib15] [15] Paul Dagum, Richard M. Karp, Michael Luby, and Sheldon M. Ross. An optimal algorithm for Monte Carlo estimation. SIAM Journal on Computing, 29(5):1484–1496, 2000. doi:10.1137/S0097539797315306.

[bib.bib16] [16] Dean Doron, François Le Gall, and Amnon Ta-Shma. Probabilistic logarithmic-space algorithms for Laplacian solvers. In Approximation, Randomization, and Combinatorial Optimization. Algorithms and Techniques, volume 81, pages 41:1–41:20, 2017. doi:10.4230/LIPIcs.APPROX-RANDOM.2017.41.

[bib.bib17] [17] Peter G. Doyle and J. Laurie Snell. Random walks and electric networks, volume 22. The Mathematical Association of America, 1984.

[bib.bib18] [18] Rajat Vadiraj Dwaraknath, Ishani Karmarkar, and Aaron Sidford. Towards optimal effective resistance estimation. In Advances in Neural Information Processing Systems 36, 2023. URL: http://papers.nips.cc/paper_files/paper/2023/hash/b8e2046160a568145af6d42eeef199f4-Abstract-Conference.html.

[bib.bib19] [19] Dániel Fogaras, Balázs Rácz, Károly Csalogány, and Tamás Sarlós. Towards scaling fully Personalized PageRank: Algorithms, lower bounds, and experiments. Internet Mathematics, 2(3):333–358, 2005. doi:10.1080/15427951.2005.10129104.

[bib.bib20] [20] George E. Forsythe and Richard A. Leibler. Matrix inversion by a Monte Carlo method. Mathematics of Computation, 4(31):127–129, 1950. URL: https://www.ams.org/journals/mcom/1950-04-031/S0025-5718-1950-0038138-X/home.html.

[bib.bib21] [21] David F. Gleich. Pagerank beyond the web. SIAM Review, 57(3):321–363, 2015. doi:10.1137/140976649.

[bib.bib22] [22] Oded Goldreich and Dana Ron. Property testing in bounded degree graphs. Algorithmica, 32(2):302–343, 2002. doi:10.1007/S00453-001-0078-7.

[bib.bib23] [23] Oded Goldreich and Dana Ron. On testing expansion in bounded-degree graphs. In Studies in Complexity and Cryptography, volume 6650, pages 68–75. Springer, 2011. doi:10.1007/978-3-642-22670-0_9.

[bib.bib24] [24] Aram W. Harrow, Avinatan Hassidim, and Seth Lloyd. Quantum algorithm for linear systems of equations. Physical Review Letters, 103(15):150502, 2009. doi:10.1103/PhysRevLett.103.150502.

[bib.bib25] [25] Arun Jambulapati, Sushant Sachdeva, Aaron Sidford, Kevin Tian, and Yibin Zhao. Eulerian graph sparsification by effective resistance decomposition. In Proceedings of the 2025 ACM-SIAM Symposium on Discrete Algorithms, pages 1607–1650, 2025. doi:10.1137/1.9781611978322.50.

[bib.bib26] [26] Arun Jambulapati and Aaron Sidford. Ultrasparse ultrasparsifiers and faster Laplacian system solvers. In Proceedings of the 2021 ACM-SIAM Symposium on Discrete Algorithms, pages 540–559, 2021. doi:10.1137/1.9781611976465.33.

[bib.bib27] [27] Satyen Kale, Yuval Peres, and C. Seshadhri. Noise tolerance of expanders and sublinear expansion reconstruction. SIAM Journal on Computing, 42(1):305–323, 2013. doi:10.1137/110837863.

[bib.bib28] [28] Tsz Chiu Kwok, Zhewei Wei, and Mingji Yang. On solving asymmetric diagonally dominant linear systems in sublinear time. CoRR, abs/2509.13891, 2025. doi:10.48550/arXiv.2509.13891.

[bib.bib29] [29] Amy Nicole Langville and Carl Dean Meyer. Survey: Deeper inside PageRank. Internet Mathematics, 1(3):335–380, 2003. doi:10.1080/15427951.2004.10129091.

[bib.bib30] [30] Lawrence Li and Sushant Sachdeva. A new approach to estimating effective resistances and counting spanning trees in expander graphs. In Proceedings of the 2023 ACM-SIAM Symposium on Discrete Algorithms, pages 2728–2745, 2023. doi:10.1137/1.9781611977554.ch102.

[bib.bib31] [31] Peter Lofgren, Siddhartha Banerjee, and Ashish Goel. Bidirectional PageRank estimation: from average-case to worst-case. In Proceedings of the 12th International Workshop on Algorithms and Models for the Web-Graph, volume 9479, pages 164–176, 2015. doi:10.1007/978-3-319-26784-5_13.

[bib.bib32] [32] Peter Lofgren, Siddhartha Banerjee, and Ashish Goel. Personalized PageRank estimation and search: a bidirectional approach. In Proceedings of the 9th ACM International Conference on Web Search and Data Mining, pages 163–172, 2016. doi:10.1145/2835776.2835823.

[bib.bib33] [33] Peter Lofgren, Siddhartha Banerjee, Ashish Goel, and Seshadhri Comandur. FAST-PPR: Scaling Personalized PageRank estimation for large graphs. In Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pages 1436–1445, 2014. doi:10.1145/2623330.2623745.

[bib.bib34] [34] László Lovász. Random walks on graphs: a survey. Combinatorics, Paul Erdös is Eighty, 2(1-46):4, 1993.

[bib.bib35] [35] Pan Peng, Daniel Lopatta, Yuichi Yoshida, and Gramoz Goranci. Local algorithms for estimating effective resistance. In Proceedings of the 27th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pages 1329–1338, 2021. doi:10.1145/3447548.3467361.

[bib.bib36] [36] Ronitt Rubinfeld, Gil Tamir, Shai Vardi, and Ning Xie. Fast local computation algorithms. In Proceedings of Innovations in Computer Science, pages 223–238, 2011. URL: http://conference.iiis.tsinghua.edu.cn/ICS2011/content/papers/36.html.

[bib.bib37] [37] Nitin Shyamkumar, Siddhartha Banerjee, and Peter Lofgren. Sublinear estimation of a single element in sparse linear systems. In Proceedings of the 54th Annual Allerton Conference on Communication, Control, and Computing, pages 856–860, 2016. doi:10.1109/ALLERTON.2016.7852323.

[bib.bib38] [38] Daniel A. Spielman and Nikhil Srivastava. Graph sparsification by effective resistances. SIAM Journal on Computing, 40(6):1913–1926, 2011. doi:10.1137/080734029.

[bib.bib39] [39] Daniel A. Spielman and Shang-Hua Teng. Nearly-linear time algorithms for graph partitioning, graph sparsification, and solving linear systems. In Proceedings of the 36th Annual ACM Symposium on Theory of Computing, pages 81–90, 2004. doi:10.1145/1007352.1007372.

[bib.bib40] [40] Daniel A. Spielman and Shang-Hua Teng. Nearly linear time algorithms for preconditioning and solving symmetric, diagonally dominant linear systems. SIAM Journal on Matrix Analysis and Applications, 35(3):835–885, 2014. doi:10.1137/090771430.

[bib.bib41] [41] Amnon Ta-Shma. Inverting well conditioned matrices in quantum logspace. In Proceedings of the 45th Annual ACM Symposium on Theory of Computing, pages 881–890, 2013. doi:10.1145/2488608.2488720.

[bib.bib42] [42] Shang-Hua Teng. The Laplacian Paradigm: Emerging algorithms for massive graphs. In Proceedings of the 7th Annual Conference on Theory and Applications of Models of Computation, volume 6108, pages 2–14, 2010. doi:10.1007/978-3-642-13562-0_2.

[bib.bib43] [43] Mikkel Thorup, Hanzhi Wang, Zhewei Wei, and Mingji Yang. PageRank centrality in directed graphs with bounded in-degree. In Proceedings of the 2026 ACM-SIAM Symposium on Discrete Algorithms, 2026. To appear. arXiv preprint at https://arxiv.org/abs/2508.01257. doi:10.48550/arXiv.2508.01257.

[bib.bib44] [44] Nisheeth K. Vishnoi. ${L}\mathbf{x}=\mathbf{b}$ . Foundations and Trends in Theoretical Computer Science, 8(1-2):1–141, 2013. doi:10.1561/0400000054.

[bib.bib45] [45] Hanzhi Wang. Revisiting local PageRank estimation on undirected graphs: Simple and optimal. In Proceedings of the 30th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pages 3036–3044, 2024. doi:10.1145/3637528.3671820.

[bib.bib46] [46] Hanzhi Wang and Zhewei Wei. Estimating single-node PageRank in $\tilde{O}\left(\min\big\{d_{t},\sqrt{m}\big\}\right)$ time. Proceedings of the VLDB Endowment, 16(11):2949–2961, 2023. doi:10.14778/3611479.3611500.

[bib.bib47] [47] Hanzhi Wang, Zhewei Wei, Ji-Rong Wen, and Mingji Yang. Revisiting local computation of PageRank: Simple and optimal. In Proceedings of the 56th Annual ACM Symposium on Theory of Computing, pages 911–922, 2024. doi:10.1145/3618260.3649661.

[bib.bib48] [48] Wolfgang R. Wasow. A note on the inversion of matrices by random walks. Mathematical Tables and Other Aids to Computation, 6(38):78–81, 1952. doi:10.2307/2002546.

[bib.bib49] [49] Zhewei Wei, Ji-Rong Wen, and Mingji Yang. Approximating single-source Personalized PageRank with absolute error guarantees. In Proceedings of the 27th International Conference on Database Theory, volume 290, pages 9:1–9:19, 2024. doi:10.4230/LIPIcs.ICDT.2024.9.

[bib.bib50] [50] Mingji Yang, Hanzhi Wang, Zhewei Wei, Sibo Wang, and Ji-Rong Wen. Efficient algorithms for Personalized PageRank computation: a survey. IEEE Transactions on Knowledge and Data Engineering, 36(9):4582–4602, 2024. doi:10.1109/TKDE.2024.3376000.

[bib.bib51] [51] Renchi Yang and Jing Tang. Efficient estimation of pairwise effective resistance. Proceedings of the ACM International Conference on Management of Data, 1(1):16:1–16:27, 2023. doi:10.1145/3588696.

[bib.bib52] [52] Yichun Yang, Rong-Hua Li, Meihao Liao, and Guoren Wang. Improved algorithms for effective resistance computation on graphs. In Proceedings of the 38th Conference on Learning Theory, volume 291, pages 5892–5920, 2025. URL: https://proceedings.mlr.press/v291/yichun25a.html.

On Solving Asymmetric Diagonally Dominant Linear Systems in Sublinear Time

Abstract

Keywords and phrases:

Copyright and License:

2012 ACM Subject Classification:

Related Version:

Acknowledgements:

Funding:

DOI:

Event:

Editor:

Series and Publisher:

1 Introduction

1.1 Basic Notations

Restriction and Pseudoinverse.

Spectral Gap.

Graphs.

Eulerian Graphs and Laplacian.

1.2 Problem Formulation

1.3 Previous Work for SDD Systems

1.4 Formulation of 𝒙∗ and the 𝒑-Norm Gaps

Theorem 1.

Theorem 2.

Theorem 3.

Relationship with the Formulation in [4].

1.5 Main Algorithmic Results

Theorem 4.

Theorem 5.

Theorem 6.

Theorem 7.

Theorem 8.

▶ Remark.

1.6 Connections to PageRank and Effective Resistance Computation

Theorem 9.

Corollary 10.

Theorem 11.

1.7 Understanding ForwardPush and BackwardPush on Graphs

1.8 Technical Overview

1.9 Paper Organization

2 Other Related Work

3 Formulation of 𝒙∗ and the 𝒑-Norm Gaps

Lemma 12.

Lemma 13.

Proof of Theorem 1.

4 Random-Walk Sampling

Lemma 14.

Proof of Theorem 4.

5 The Local Push Method

Lemma 15.

Lemma 16.

Proof.

Lemma 17.

Lemma 18.

Proof.

Lemma 19.

Proof of Theorem 7.

6 The Bidirectional Method

Lemma 20.

Proof.

Lemma 21.

Proof.

Lemma 22.

Proof of Theorem 8.

7 Connections with PageRank Computation

7.1 Results for PageRank Computation when 𝐃𝑮−(𝟏−𝜶)⁢𝐀𝑮⊤ is RCDD

Theorem 23.

Proof.

Lemma 24.

Proof of Theorem 9.

7.2 A Lower Bound on the Accuracy Parameter for SDD Solvers

Lemma 25.

Lemma 26.

Proof of Lemma 25.

Proof of Theorem 11.

8 Connections with Effective Resistance Computation

Lemma 27.

Proof of Corollary 10.

References

1.4 Formulation of $\boldsymbol{x}^{\ast}$ and the $𝒑$ -Norm Gaps

$\blacktriangleright$ Remark.

3 Formulation of $\boldsymbol{x}^{\ast}$ and the $𝒑$ -Norm Gaps

7.1 Results for PageRank Computation when $\mathbf{D}_{G}-(1-\alpha)\mathbf{A}_{G}^{\top}$ is RCDD