Abstract 1 Introduction 2 Preliminaries 3 Stationary and Mixing Time Robustness 4 Hitting Time Robustness 5 Cover Time Robustness References

Markov Chain Robustness

David Zuckerman ORCID University of Texas at Austin, TX, USA
Abstract

When a Markov chain models nature or social interactions, it is likely not followed exactly, but only approximately. We therefore introduce several notions of robustness for a Markov chain P. Our standard adversary can dynamically change transition probabilities of P by 1±ε, and our strong adversary can completely control each transition independently with probability ε, as in a model by Azar, Broder, Karlin, Linial, and Philips [4]. These adversaries are equivalent up to constant factors if the degrees are constant. Our adversarial chains need not converge.

We define and prove various robustness properties of a reversible chain P, i.e., a random walk on a connected undirected graph G. Let d be the maximum degree, Δ the diameter, π the stationary distribution, and tmix the mixing time.

  1. 1.

    We define a natural analogue π+(S) that upper bounds limiting frequencies in a set S in the adversarial chain. We show that if ε=O(1/dtup), where tup is a variant of the mixing time, then π+(S)=O(π(S)1α) for any α>0.

  2. 2.

    We define the mixing time robustness as the largest ε such that the approximate mixing time increases by only a constant factor, and prove that it is Ω(1/dtmix).

  3. 3.

    We define the hitting time robustness as the largest ε such that the maximum hitting time increases by only a constant factor, and show that it is Ω(1/tmix). For trees, we show it is Ω(1/Δ).

  4. 4.

    We define the cover time robustness as the largest ε such that the cover time increases by only a constant factor. We show that in most graphs it’s at least the hitting time robustness.

  5. 5.

    We characterize the mixing, hitting, and cover time robustnesses for constant-degree regular expander graphs up to constant factors. They are Θ(1), Θ(1/logn), and Θ(1/logn), respectively.

Keywords and phrases:
Markov chain, random walk, mixing time, hitting time, cover time, robustness, expander graph
Copyright and License:
[Uncaptioned image] © David Zuckerman; licensed under Creative Commons License CC-BY 4.0
2012 ACM Subject Classification:
Theory of computation Random walks and Markov chains
Acknowledgements:
We thank Kuikui Liu and Salil Vadhan for helpful discussions, the anonymous referees for very useful suggestions, and Zhiyang Xun and Dean Doron for helpful comments.
Funding:
Supported in part by NSF Grant CCF-2312573 and a Simons Investigator Award (#409864).
Editor:
Shubhangi Saraf

1 Introduction

Markov chains are used to model many scenarios, and there is a huge theory about them. In many situations, such as a Markov chain modeling nature or social behavior, it is likely that the Markov chain is not followed exactly, but only approximately. We therefore study how behavior in Markov chains is affected by small perturbations.

For example, Cannon, Daymude, Randall, and Richa [6] showed that a collection of computational elements can compress – gather tightly as in a sphere – by asynchronously each executing a simple program. (See also [17].) They analyzed this by modeling it as a Markov chain. Such compression and other self-organizing behavior also occurs in nature. In nature, one may expect that the algorithm is not followed exactly but only approximately. Would the same compressive behavior arise in this case? For a Markov chain modeling social behavior, it’s even more compelling that the chain wouldn’t be followed exactly but could have adversarial influences.

Another situation where errors arise is in simulating huge physical systems. Such a system may exactly follow a Markov chain; however, researchers simulate such a system by coarsening the exact chain into a discrete approximation. This coarsening necessarily leads to error. How badly does this error affect the simulation? In this case, Kania, Aristoff, and Zuckerman [15] recently developed a randomized algorithm to remove the error, but analyzing the error remains interesting.

There has been some work exploring the power of an adversary on certain Markov chains. Chin, Moitra, Mossel, and Sandon [7] studied the power of an adversary in Glauber dynamics. Glauber dynamics is a type of random walk on ΣV, where V is a set of nodes in an underlying graph and Σ is a finite alphabet. They proposed an adversary that controls any changes to a subset AV with |A|ε|V|. They studied how such an adversary can affect global features of the system, such as the approximate mixing time.

Doron, Moshkovitz, Oh, and Zuckerman [9, 10] studied the power of an extremely strong adversary for random walks on lossless expanders. They showed that even in the presence of such a strong adversary, a random walk will approximately mix in some sense. However, their methods don’t appear to extend beyond lossless expanders.

We study a model that can be applied to any random walk. Related questions have been studied before, and we adopt some of those frameworks, but we study different questions.

To study errors in Markov chain behavior, Hartfiel [11, 12] proposed a natural model of Markov set-chains defined as follows. Assume the states are [n]={1,2,,n}, and let n denote the set of n×n stochastic matrices. The Markov set-chain is a stochastic process {Xt} defined by a set 𝒫n.111Hartfiel’s definition has 𝒫 being compact, but let’s ignore that for now. At each time step t, a transition matrix Pt𝒫 is chosen arbitrarily. Then Pt determines the transitions from Xt to Xt+1: Pr[Xt+1=jXt=i]=Pt(i,j). Thus a Markov chain is a special case of a Markov set-chain where the set 𝒫 consists of one matrix, the transition matrix of the Markov chain. In general, the transition matrices in 𝒫 may have different stationary distributions.

Azar, Broder, Karlin, Linial, and Phillips [4] studied a model similar in spirit. More recently, researchers have studied a model very similar to Markov set-chains under the name of dynamic graphs or evolving graphs, e.g., [3, 22].

1.1 The adversary

There appear to be three reasonable choices for the power of an adversary choosing Pt.222Note that we give the adversary more power by allowing Pt to depend on t; otherwise we could get three more choices for the adversary’s power.

  1. 1.

    Oblivious: Pt is chosen independently of {Xs};

  2. 2.

    Markovian: Pt may depend on Xt but not Xs for s<t;

  3. 3.

    Non-Markovian: Pt may depend on Xs for all st.

Note that an oblivious adversary corresponds to a non-homogeneous (or time-dependent) Markov chain. Oblivious and Markovian adversaries are equivalent if 𝒫 is closed under “row replacement”: replacing a row in any P𝒫 by the corresponding row in another P𝒫. The equivalence follows because the adversary can choose the ith row assuming the Markov chain is in state i. The Markovian and non-Markovian adversaries are equivalent if an adversary is trying to affect the hitting time, but not equivalent for cover time. Hartfiel, [4], and recent work on dynamic graphs implicitly use an oblivious adversary (and don’t discuss the others).

We focus on the most powerful, non-Markovian adversary, although we make some observations about the other adversaries.

Can we understand a Markov set chains (MSC) in terms of a related Markov chain? Although reversibility is not required for all of our theorems, we usually assume the related chain is reversible, i.e., a random walk on an undirected graph. We further assume the graph is connected; otherwise we could restrict attention to the appropriate connected component. The most important probability distribution related to a Markov chain is the stationary distribution π. The most important quantities of interest in Markov chains are the mixing time tmix, the time it takes to approach π; the max hitting time thit, the maximum expected time to go from one vertex to another; and the cover time tcov, the expected time to visit all nodes from a worst-case starting node.

First, we discuss the stationary distribution. Suppose all chains in an MSC have the same stationary distribution π. Will this MSC converge to π? It depends on the adversary. For an oblivious adversary, the MSC will converge to π – the usual proof works. For a Markovian adversary, the MSC need not converge, even if all underlying chains are reversible; see Section 3.1.

There are two natural special cases of MSCs that we consider. First, an interval chain is an interval of matrices 𝒫=[P,P+]n, where [P,P+] denotes the set of matrices P such that for all states i,j, we have P(i,j)P(i,j)P+(i,j). For interval chains, oblivious and Markovian adversaries are equivalent, since 𝒫 is closed under row replacement.

For our standard adversary, we consider interval chains of the following form. For P a Markov chain333We identify a Markov chain with its transition matrix. and ε>0, we consider the interval chain [P=(1ε)P,P+=(1+ε)P]. When convenient, we consider the essentially equivalent interval chain [P=eεP,P+=eεP].

Second, [4] introduced a model of biased random walks where a controller biases the random walk. That is, given a Markov chain P and a parameter ε>0, a controller chooses an arbitrary stochastic matrix B with supp(B)supp(P) and the biased random walk follows (1ε)P+εB. That is, with probability ε the controller biases the walk, but has to respect the graph structure. Haslegrave, Sauerwald, and Sylvester [13] studied this in the time dependent setting. The focus of these papers is on a friendly controller who tries to improve the situation from the simple random walk. Our focus is on an adversarial controller who seeks to make matters worse. This will be our strong adversary.

In other words, in both cases, we have that (1ε)P is left alone. A strong adversary may distribute the remaining ε probability arbitrarily on its support. A standard adversary is limited to change each probability by a bounded relative amount. However, a strong adversary with parameter ε is at most as powerful as a (dmaxε)-standard adversary, where dmax denotes the maximum degree. Since we are thinking of dmax as small or constant, we don’t dwell on the difference between the adversaries.

One might suggest an even stronger adversary that can take an arbitrary ε probability and distribute it arbitrarily. In other words, it can choose Pt such that every row of P and Pt differs by at most ε in total variation distance. This adversary generalizes the adversary of [7]. However, if ε1/deg(v) for some node v, then such an adversary can force the MSC to avoid hitting v. This renders some of our questions impossible, so we don’t consider this adversary.

Main question.

For how large an ε does the adversarial chain have similar behavior as the original chain?

In order to make sense of this question, we need to define suitable notions for MSCs. We study several notions: the robustness of the stationary distribution, and the robustness of three times: the mixing time, the maximum hitting time, and the cover time. We will use strong adversaries to define stationary and mixing time robustness, and standard adversaries for hitting time and cover time robustness. This is because our proofs work for strong adversaries for the first two robustnesses, but strong adversaries are too powerful for the last two robustnesses, as explained below.

1.2 Stationary robustness

We begin with the robustness of the stationary distribution π. Although MSCs may not converge to a distribution (see Section 3.1), we can still study the maximum and minimum limiting probabilities of sets. We can define this as follows. For SΩ, let Nt(S) denote the number of visits to S up to time t.

Definition 1.1.

For ε>0, the upper stationary probability

π+(S)=πε+(S)=sup(lim suptNt(S)/t),

where the outer sup is over the choice of a strong ε-adversary.

We could define a lower limit π analogously, but we focus on π+.

For our first theorem, we use a variant of the mixing time that we call tup (see Section 2.3). Like mixing time, it satisfies tup=O(log(1/πmin)γ), where γ is the spectral gap. We prove the following.

Theorem 1.2.

Let G=(V,E) be a graph with all degrees at most d. Let G=(V,E) be a graph with all degrees at most d. For any ε-strong adversary, any starting distribution, and any SV with π(S)1α, we have

π+(S)<(1+α)exp(2ε2dtupln(1/π(S)))π(S).

In particular, for ε=O(1/dtup), we have π+(S)=π(S)exp(O(log(1/π(S)))=O(π(S)1o(1)).

Here the o(1) term goes to 0 as π(S)0, so in particular we have π+(S)=O(π(S)1α) for any α>0.

The dependence of ε on tup is essentially best possible, as the path graph shows.

To interpret this theorem, consider the case when the state space V is exponentially large. Then any event with negligible stationary probability in the original chain has negligible stationary probability in this MSC with a nontrivial adversary. For example, [6] show that compression in their model happens with negligible probability; we deduce that this conclusion holds even if their chain is not followed exactly.

We could define stationary robustness as the largest ε such that the stationary probability analogues increase from p to at most O(pβ) for some β<1. The choice of β or quantification over β could a priori change the definition. However, Theorem 1.2 implies a robust stationarity of Ω(dtup), and at least for the path the choice of β doesn’t matter.

We compare Theorem 1.2 to the work of [4]. They only consider a time-independent adversary, although it wouldn’t matter for this theorem. The main difference between their work and ours is that they show a lower bound on the adversarial power, whereas we show an upper bound. Specifically, they showed that for any bounded-degree graph and any node v, some strong ε-adversary can achieve π+(v)π(v)1Ω(ε), and that this is tight for an expander. Haslegrave, Sauerwald, and Sylvester [13] generalized this to more settings. Our Theorem 1.2 upper bounds π+(S) for every graph and every strong ε-adversary.

1.3 Mixing time robustness

We now turn to the robustness of the fundamental times associated with a random walk: mixing time, max hitting time, and cover time. We begin with mixing time, which requires some set up. Let || denote variation distance. We will generalize the following for a standard random walk. For a node i, let Pri denote the probability for a random walk starting at i. Let

δ(t)=maxi|Pri[Xt]π()|
Definition 1.3.

Fix a small constant α, say α=1/4. The mixing time tmix=tmix(α)=min{t:δ(t)α}.

To generalize this to MSCs, we define

δ+(t)=sup(maxi|Pri[Xt]π()|).

Here π is the stationary distribution of the original Markov chain, and the sup is over strong ε-adversaries. Thus δ(t)δ+(t). We can now define the robust mixing time, which will be a workable definition despite the MSC possibly not converging.

Definition 1.4.

Fix a small constant α, say α=1/4. Define the robust mixing time as

tmix+(ε)=min{t:δ+(t)α},

where ε is the adversary parameter.

Note that tmixtmix+(ε), and that tmix+(ε) may be infinite. This leads to the following definition.

Definition 1.5.

The mixing time robustness is

ρmix(C)=sup{ε:tmix+(ε)Ctmix}.

When C is omitted, we allow C to be an arbitrarily large constant.444This only makes sense when we have a family of graphs, but this applies to big Oh notation generally.

It is not hard to show that the mixing time robustness is Ω(1/(tmixd)), where d is the maximum degree; see Proposition 2.12. We achieve the square root of this quantity for constant-degree graphs.

Theorem 1.6.

In a graph with all degrees at most d, the mixing time robustness ρmix=Ω(1/tmixd).

This is tight for the path and the cycle. However, for expander graphs, this gives Ω(1/logn). We prove a constant lower bound for constant-degree expanders.

Theorem 1.7.

A d-regular γ-spectral expander has mixing time robustness Ω(γ/d).

1.4 Hitting time robustness

We now turn to some expected stopping times. Let 𝔼u denote the expectation for a Markov (set) chain starting at uV. Let Hv denote the hitting time of v: the time to first visit v after time 0. The standard hitting times in an unbiased random walk are h(u,v)=𝔼uHv, and the max hitting time is thit=maxuvh(u,v).

Before defining the robust version, we briefly discuss the adversary. Note that if we have a strong ε-adversary, the robustness for hitting and covering is less than 1/dmin, where dmin denotes the minimum degree. That’s because a strong 1/dmin-adversary can always avoid a particular node. Due to this and our proofs working better with standard adversaries, we define the robust versions for standard adversaries.

Definition 1.8.

Fix an ε>0. The robust hitting time h+(u,v)=sup(𝔼uHv), where the sup is over strategies for standard ε-adversaries. The robust max hitting time is thit+=maxu,vh+(u,v).

Generalizing an observation by [4], the quantities h+ are the same for oblivious, Markovian, and non-Markovian adversaries. This is because an adversary’s best strategy at a given node does not depend on the previous history. This basically follows from Markov decision theory.

We focus on thit+ so we have one quantity rather than n2 quantities. We can now define:

Definition 1.9.

The hitting time robustness is

ρhit(C)=sup{ε:thit+(ε)Cthit}.

When C is omitted, we allow C to be an arbitrarily large constant.

It is not hard to show that the hitting time robustness is Ω(1/thit); see Proposition 2.12. We improve this bound dramatically.

Theorem 1.10.

The hitting time robustness ρhit=Ω(1/tmix).

We can improve this bound for trees. Let Δ denote the diameter.

Theorem 1.11.

The hitting time robustness for any tree is Ω(1/Δ).

Note that Δ=O(tmix), and is often significantly less. For example, for the line and cycle on n nodes, Δ=Θ(n) but tmix=Θ(n2). Even more dramatically, for the balanced binary tree tmix=Θ(nlogn) but Δ=Θ(logn).

For constant degree expanders, Theorem 1.10 gives a bound of Ω(1/logn). We show that this is tight for expanders by giving a matching upper bound, which is a negative result.

Theorem 1.12.

The hitting time robustness of any constant degree expander is ρhit=Θ(1/logn).

1.5 Cover time robustness

We now turn to the cover time. Let the random variable T denote the first time the random walk has visited all nodes. We define the cover time tcov=maxu𝔼uT.

Definition 1.13.

Fix an ε>0. The robust cover time tcov+=sup(𝔼uT), where the sup is over strategies for standard ε-adversaries.

Note that Markovian and non-Markovian adversaries do not appear to be equivalent, since the prior history is very relevant for the cover time. We now define:

Definition 1.14.

The cover time robustness is

ρcov(C)=sup{ε:tcov+(ε)Ctcov}.

When C is omitted, we allow C to be an arbitrarily large constant.

It is not hard to show that the cover time robustness is Ω(1/tcov); see Proposition 2.12. We improve this in many cases. We always have tcov=O(thitlogn); we can improve the cover time robustness whenever this is tight up to a constant factor.

Theorem 1.15.

In any graph with tcov=Θ(thitlogn), we have ρcov=Ω(ρhit).

There are many graphs where tcov=Θ(thitlogn), including expanders (and hence most graphs) [5], the complete graph (coupon collecting), two and higher dimensional grids and tori [2, 25], and the balanced binary tree [24].

1.6 Examples

It’s instructive to see how our bounds apply to natural examples.

1.6.1 Path/cycle

The path is the graph on [n] with n1 edges {i,i+1}; the cycle contains the additional edge {1,n}. In the non-adversarial setting, the main quantities are similar for the two graphs. The mixing time, max hitting time, and cover time are all Θ(n2).

To understand the adversarial setting, we focus on the path. The worst a strong adversary can do is set all Pt(i,i+1)=(1+ε)/2 and all Pt(i,i1)=(1ε)/2. In this case, the hitting times will increase by a factor of about ((1+ε)/(1ε))n. Therefore, the hitting time robustness is Θ(1/n)=Θ(1/tmix). All other robustnesses have the same order of magnitude. Thus, Theorem 1.6 is tight for mixing time robustness, but Theorem 1.10 is not tight for hitting time robustness. See Section 4.4 for details.

1.6.2 Expanders

Here we consider d-regular expander graphs, where we think of d=O(1). In the non-adversarial setting, the mixing time tmix=Θ(logn), the max hitting time thit=Θ(n) [21], and the cover time tcov=Θ(nlogn) [5, 21]. We actually prove these last two theorems in what we believe is a simpler way.

Our theorems imply tight bounds on the robustness notions for constant-degree regular expanders. Specifically, Theorem 1.7 show that the mixing time robustness is a constant, while Theorem 1.12 and Theorem 1.15 show that the hitting and cover time robustnesses are Θ(1/logn).

We remark that previous work shows that lossless expanders have robustness properties beyond what we consider here. A d-regular graph is a lossless expander if sets that are not too large expand by a factor of (1η)d where η<1/2. Doron, Moshkovitz, Oh, and Zuckerman [9, 10] studied random walks where a non-Markovian adversary chooses steps according to a Chor-Goldreich (CG) source, and even generalizations of this. In a CG-source, in a d-regular graph, an adversary can choose any distribution for the next step such that no edge is chosen with probability more than dδ for some parameter δ>0. They show that even this biased walk mixes in the sense that the final node has entropy lognO(1). However, their methods do not seem to extend to standard expander graphs.

1.7 Weak randomness

As another perspective, there has been a lot of research investigating the use of weak randomness in computing. In those situations, it is natural to allow computation on the weak randomness, for example to purify it into high-quality randomness. This work disallows such preprocessing, which is particularly natural when the random process models nature or social interactions.

1.8 Techniques

We prove Theorem 1.2 and Theorem 1.6 about stationarity and mixing time robustness by generalizing a potential function argument by Haslegrave, Sauerwald, and Sylvester [13]. Their potential function uses squares to prove a lower bound. We use a potential function based on smaller powers to prove an upper bound. In particular, we need an inequality going in the opposite direction. Specifically, for x=(x1,,xd)[0,)d define the -power mean M(x)=((ixi)/d)1/, M=maxixi, and the ε-max-avg operator MAε=εM+(1ε)M1. We use Holder’s inequality and other ideas to bound MAε(x) in terms of M(x) for some just slightly larger than 1. We also simplify the [13] method somewhat, avoiding the use of trajectory trees.

We prove Theorem 1.7 about mixing time robustness of expanders by adapting the proof that the second largest eigenvalue determines convergence.

We prove Theorem 1.10 about hitting time robustness by showing that in a standard random walk, there’s the “right” probability of hitting a node within twice the separation time tsep=O(tmix). Specifically, for hv=𝔼π[Hv], we show that for all u,vV, we have Pru[Hv2tsep]=Ω(tsep/hv). We then show that this probability doesn’t decline much even in the robust setting. Dividing the random walk into epochs of length 2tsep, we conclude that the hitting time doesn’t increase much. We further use this technique to give a simpler proof of the max hitting time for expanders, which will be in the final version of this paper. Given its utility for both of these theorems, we believe that this technique could be useful elsewhere.

We prove Theorem 1.11 by first observing that we can take the adversary to be oblivious, so the corresponding adversarial walk has a stationary distribution. We then compare the stationary distributions, and apply the “essential edge lemma” from Aldous-Fill about hitting times across cut edges.

We prove Theorem 1.12, the negative result giving a tight bound for expanders, by giving a target distribution and modifying the Metropolis algorithm. Much of this argument is similar to a lower bound on π+(S) by [4].

Theorem 1.15 about cover time robustness follows from Matthews’ technique for bounding the cover time in terms of the hitting time.

1.9 Related work

We have not found any previous work giving good bounds on our notions of robustness, except for lossless expanders. Doron, Moshkovitz, Oh, and Zuckerman [9, 10] analyzed extremely strong non-Markovian adversaries for lossless expanders, but their methods don’t apply more generally. They also didn’t analyze hitting or cover times.

In the model introduced by [4] and in follow-up works, the focus is on a controller seeking to minimize hitting and cover times, and they show lower bounds on increasing the probability of being in sets. For example, Haslegrave, Sauerwald, and Sylvester [13] improved a result of [4] and proved that an ε-controller can increase the probability of being in a set from p to approximately p1ε. They used this to minimize hitting and cover times, for example showing how to reduce the cover time of an expander to O(nloglogn). This is different than our work where we view the controller as an adversary, so hitting times and cover times increase rather than decrease. Moreover, the quantification is different: they show that a suitable adversary exists, whereas we show that all adversaries are not too bad.

Chin, Moitra, Mossel, and Sandon [7] studied the power of an adversary in Glauber dynamics. Glauber dynamics is a type of random walk on ΣV, where V is a set of nodes in an underlying graph and Σ is a finite alphabet. They proposed an adversary that controls any changes to a subset AV with |A|ε|V|. They studied how such an adversary can affect global features of the system, such as the approximate mixing time. Their methods seem tailored to Glauber dynamics and not generalizable.

There is also research analyzing how the stationary distribution of a Markov chain is affected by perturbations. For example, Liu ([18], Theorem 2.2) showed that if P is perturbed to P~, which has stationary distribution π~, then

|ππ~|thit2|PP~|,

where |M| for a matrix M is the maximum 1-norm of any row. Thus, the best bound on robustness that this could give is Ω(1/thit2), which is much weaker than what we obtain. Moreover, our robustness is for Markov set chains, which are much more general than Markov chains.

Hartfiel [12] studies when Markov set-chains converge, and movement between transient states and absorbing states and the like. He doesn’t address the types of questions we ask.

Avin, Koucky, and Lotker [3] study random walks on evolving graphs. Here an adversary is allowed to completely change the graph, i.e., transition matrix, at every time step. However, the adversary doesn’t know the location of the random walker; in other words, the adversary decides the graphs in advance. They show that the cover time and mixing time could be exponentially larger than the cover times and mixing times of the individual graphs, but are only polynomially larger if the random walk is made lazy. This model is quite different from ours.

Hunter [14] studies a related model, what in our terminology is a strong oblivious adversary. He defines η to be the “random target time” – the time for a random walk to hit a random node chosen according to π – but confusingly calls this the mixing time. Really, η is more closely related to thit; see [16] for more on the random target time. He then shows that the adversarial walk has a stationary distribution at most ηε-far from π in variation distance. This is much weaker than our results.

There are other adversarial variants of Markov chains that don’t appear relevant for us. For example, the PageRank algorithm [20] and variants (e.g., [23]) allow perturbations where the adversary need not respect the graph structure, which leads to very different results. For another example, [8] define a model that they call “adversarial Markov chains” on an infinite metric space. In their model, transitions are followed exactly except when the chain is in some prespecified bounded subset, in which case an adversary can make arbitrary bounded jumps. They study whether such an adversarial chain remains bounded in probability.

1.10 Conjectures and Open Problems

We conjecture the following.

Conjecture 1.16.

The hitting time robustness is Ω(1/Δ), where Δ denotes the diameter.

Since Δ=O(tmix), this would improve Theorem 1.10. Theorem 1.11 establishes this conjecture for trees, which is tight for the path and cycle.

Conjecture 1.17.

For all graphs, the cover time robustness is the hitting time robustness, up to a constant factor.

Conjecture 1.18.

We can replace tup in Theorem 1.2 with tmix.

What do we need to know about a graph to determine its robustnesses? Are the different robustnesses efficiently computable, perhaps up to constant factors?

Another class of interesting open problems is to determine the various robustnesses for natural graphs, such as different dimensional grids, hypercubes, and balanced trees.

2 Preliminaries

2.1 Random walk basics

We will assume throughout that any undirected graph G=(V,E) is connected. We denote n=|V|, m=|E|, Γ(v) the set of neighbors of node vV, and dv=|Γ(v)| the degree of v. A random walk on G is a sequence of random variables (X0,X1,) where each XtV and Xt+1 is uniform on Γ(Xt). We use P to denote the transition matrix of the random walk. Letting pt denote the probability distribution of Xt, viewed as a row vector, so we have pt+1=ptP. If G is not bipartite, then pt converges to the stationary distribution π given by π(v)=dv/(2m). Even if G is bipartite, (pt+pt+1)/2 converges to π.

Let 𝔼u and Pru denote the expectation and probability, respectively, for a random walk starting at uV. Let Hv denote the hitting time of v: the time to first visit v after time 0. The standard hitting times in an unbiased random walk are h(u,v)=𝔼uHv, and the max hitting time is thit=maxuvh(u,v).

The following lemma is well known.

Lemma 2.1.

The expected return time h(v,v)=1/π(v).

2.2 Expander graphs

Expanders are graphs where every subset of nodes has many neighbors. We will use the standard spectral characterization, which is equivalent for constant expansion. For simplicity, assume that G is regular. Denote the eigenvalues of P by 1=λ1>λ2λn1. The last equality holds only when G is bipartite. Let us assume that G is not bipartite, e.g., by adding self loops. Let

λ=max(λ2,λn)=maxi1|λi|

denote the second largest eigenvalue in absolute value. Let γ=def1λ denote the spectral gap. We call G a γ-spectral expander.

2.3 Mixing time variants

Recall the definition of mixing time, Definition 1.3. The choice of α only affects the mixing time up to a constant factor. Specifically:

Lemma 2.2.

In any graph, tmix(α)log21/αtmix(1/4).

For this and much more on the mixing time see, for example, the book by Levin, Peres, and Wilmer [16].

Up to a logarithmic factor, the reciprocal of the spectral gap γ characterizes the mixing time. Specifically, it is known (e.g., [16] Theorems 12.3 and 12.4) that

tmix(α) (1/γ)log(1/(απmin))
tmix(α) (1/γ1)log(1/(2α))

We will use a variant of the mixing time called the separation time tsep.

Definition 2.3.

The separation distance is:

s(t)=maxvV(1pt(v)/π(v)).

In other words, it’s the minimum β such that all nodes v have pt(v)(1β)π(v).

We can now define separation time in the natural way.

Definition 2.4.

Fix a small constant α, say α=1/4. Define the separation time

tsep=tsep(α)=min{t:s(t)α}.
Lemma 2.5.

In any graph, tsep=Θ(tmix).

For the proof, see Aldous-Fill [1]. Briefly, it’s immediate that δ(t)s(t). Aldous-Fill (Lemma 4.7 in [1]) show that s(2t)1(1δ(t))2.

We further define a variant of the separation distance briefly discussed in Aldous-Fill in Equation 4.14. They left it unnamed, so we name it.

Definition 2.6.

The upper separation distance is:

sup(t)=maxvV(pt(v)/π(v)1).

In other words, it’s the minimum β such that all nodes v have pt(v)(1+β)π(v).

We can now define upper separation time analogously.

Definition 2.7.

Fix a small constant α, say α=1/4. Define the upper separation time

tup=tup(α)=min{t:sup(t)α}.

While we have δ(t)sup(t), we can’t bound sup(2t) in terms of δ(t); see Aldous-Fill’s Example 4.9. Nevertheless, the upper bounds of tmix in terms of the spectral gap γ also hold for tsep and tup, just by examining the proof.

Lemma 2.8.

We have tsep(α),tup(α)log(απmin)γ.

 Remark 2.9.

Since standard random walks on bipartite graphs don’t converge, the above definitions aren’t interesting for bipartite graphs. The usual way of circumventing this is to study lazy random walks on bipartite graphs. Alternatively, we can get meaningful statements for bipartite graphs by replacing pt with (pt+pt+1)/2 in the above definitions, understanding that even though the random walks don’t converge, the average of pairs of steps converge.

2.4 Easy bounds on robustness

We will use the following notation.

Definition 2.10.

Let pt(v,S) denote the probability that an unbiased walk, starting at node v, is in the set S at time t. For a given adversary A, let qt(v,S) denote the probability that an A-biased walk, starting at node v, is in the set S at time t. When the start node v is understood, we often omit it and write pt(S) and qt(S).

We have the following simple lemma.

Lemma 2.11.

For any standard ε-adversary, any time t, and any set S, we have qt(S)etεpt(S).

Note that this does not hold for a strong ε-adversary, since such an adversary could move ε probability to a set S with tiny pt(S).

It’s not hard to show that each robustness is at least the reciprocal of the corresponding quantity.

Proposition 2.12.

For any graph G, the following bounds hold:

  1. 1.

    The mixing time robustness, if defined by a standard adversary, is Ω(1/tmix). Since we defined mixing time robustness ρmix using a strong adversary, this implies ρmix=Ω(1/(dtmix)) for graphs with maximum degree d.

  2. 2.

    The hitting time robustness ρhit=Ω(1/thit).

  3. 3.

    The cover time robustness ρcov=Ω(1/tcov).

Proof.

First we show the mixing time robustness. Using Lemma 2.2, choose t=O(tmix) such that |ptπ|α/2. Set ε=α/(2t)=Ω(1/tmix). Then a standard ε-adversary can’t make the biased walk probabilities qt much smaller than the unbiased probabilities pt. Specifically, for any set S, we have

qt(S)(1ε)tpt(S)(1tε)pt(S)=(1α/2)pt(S).

Therefore, |qtpt|α/2, and the triangle inequality gives |qtπ|α.

We prove the hitting time and cover time robustness results together. Let R be a stopping time such as a hitting time or cover time. Let t=4maxu𝔼u[R]. Then by Markov’s inequality, for any node v, we have Prv[Rt]1/4, so Prv[Rt]3/4. Set ε=1/(4t). Fix a strong ε-adversary, and let R+ denote R with respect to this adversary. Then for any node v, we have

Prv[R+t](1tε)3/41/2.

But now dividing time into epochs of length t, the chance of R+ in each epoch is at least 1/2, so by Wald’s identity we have 𝔼u[R+]2t8maxu𝔼u[R].

3 Stationary and Mixing Time Robustness

In this section, we first show that for a Markovian adversary, a Markov set-chain will not necessarily converge to π, even if all underlying chains are reversible and have π as their stationary distribution. We only consider a Markovian adversary in Section 3.1; in the rest of this section we consider standard and strong adversaries. Next we show how to save a square root from the easy mixing time robustness of Ω(1/tmix), proving Theorem 1.6. We add one more ingredient to prove Theorem 1.2. Finally, we establish that constant-degree expanders have constant mixing time robustness, namely Ω(1/γ).

3.1 MSCs non-convergence

First we show that for a Markovian adversary, the MSC will not necessarily converge to π, even if all underlying chains are reversible. To see this, we define two weighted undirected graphs based on the path graph. Denote Odd={{i,i+1}i is odd}, and Even={{i,i+1}i is even}. Graph Godd assigns edges in Odd weight 2, and edges in Even weight 1, and adds weighted self loops at the two endpoints to make all nodes have weighted degree 3. Graph Geven is the same but swaps even and odd: it assigns edges in Even weight 2, and edges in Odd weight 1. Let Podd denote a random walk on Godd, and similarly for Peven. Since Godd and Geven are both regular, both have uniform stationary distributions.

However, consider a Markovian MSC that chooses Podd if Xt is odd and Peven otherwise. If Xt is not an endpoint, then Pr[Xt+1=Xt+1]=2/3. We thus have a biased random walk which converges to a very biased distribution very far from uniform.

3.2 Bounding the increase in probability

We prove that if a set S has probability p, then a strong ε-adversary cannot increase the probability to more than O(p1o(1)) as long as tdε2=O(1). This quadratic dependence on ε improves on the easy linear dependence given by Lemma 2.11. We use a potential function argument that generalizes one by Haslegrave, Sauerwald, and Sylvester [13]. They used a potential function based on squares to prove a lower bound. We use a potential function based on smaller powers to prove an upper bound. In particular, we need an inequality going in the opposite direction as [13].

For x=(x1,,xd)[0,)d define the -power mean M(x)=((ixi)/d)1/, M=maxixi, and the ε-max-avg operator MAε=εM+(1ε)M1. We now develop the inequality we need.

Lemma 3.1.

For k,r1 and krε1, we have (1+rε)k+r(1ε)kr+1+2k2r2ε2.

Proof.
(1+rε)k+r(1ε)k =1+krε+(k2)(rε)2+(k3)(rε)3+
+rkrε+(k2)rε2(k3)rε3+
r+1+2((k2)(rε)2+(k3)(rε)3+)
r+1+2(k2)(rε)2(1+(krε/2)+(krε/2)2+)
r+1+4(k2)(rε)2r+1+2k2r2ε2.

Lemma 3.2.

For x[0,)d and k,>1 satisfying 1/k+1/=1 and k(d1)ε1, we have MAε(x)e2k(d1)ε2M(x).

Proof.

Let x=(x1,,xd), and assume without loss of generality that x1x2xd0. Using Holder’s inequality and Lemma 3.1 with r=d1,

MAε(x) =(ε+1εd)x1+1εdx2++1εdxd
1d((1+(d1)ε)k+(d1)(1ε)k)1/k(x1++xd)1/
1d(d+2k2(d1)2ε2)1/k(x1++xd)1/
(1+2k2(d1)ε2)1/kM(x)
e2k(d1)ε2M(x).

Recall Definition 2.10 for the definitions of pt and qt.

Theorem 3.3.

Let G=(V,E) be a graph with all degrees at most d, and fix a starting node. Let k,>1 satisfy 1/k+1/=1 and kdε1. For any ε-strong adversary, any start node, and any SV, we have qt(S)e2tkdε2pt(S)1/.

Proof.

We follow the structure of [13] but both simplify it somewhat, avoiding the use of the trajectory tree, and generalize it. Fix the transition matrix P, a start node u, target set S, and stopping time t. Define the vectors

pt = (pt(v))vV
st = (qt(v,S))vV.

When intermixed with matrices, we will view pt as a row vector and st as a column vector, for reasons that will become apparent. Note that pt+1=ptP. Define the potential function

Φi=pti,si=vVpti(v)qi(v,S).

Since q0(v,S)=1 if vS and 0 otherwise, we have that

Φ0=vVpt(v)q0(v,S)=pt(S).

Since p0(v)=1 if v=u and 0 otherwise, we have that

Φt=vVp0(v)qt(v,S)=qt(u,S)=qt(S).

If the random walk is at node v with i+1 steps remaining, the adversary’s strategy to maximize qi+1(v,S) is to use the ε probability it controls to move to a neighbor w maximizing qi(w,S). Letting dv denote the degree of v, this strategy gives

qi+1(v,S)=MAε((qi(w,S))wΓ(v))e2kdvε2M((qi(w,S))wΓ(v)),

using Lemma 3.2. Raising both sides to the th power gives

qi+1(v,S)e2kdvε21dvwΓ(v)qi(w,S).

Using the fact that all degrees are at most d, we can deduce an inequality with vectors and matrices:

si+1e2kdε2Psi.

Now we can bound the potential function by

Φi+1=pti1,si+1e2kdε2pti1Psi=e2kdε2pti,si=e2kdε2Φi.

We conclude that Φtexp(2tkdε2)Φ0, and

qt(S)=Φt1/e2tkdε2Φ01/=e2tkdε2pt(S)1/.

Corollary 3.4.

The conclusion of Theorem 3.3 holds for any starting distribution.

Proof.

This follows from the concavity of the function x.

Corollary 3.5.

For any ε-strong adversary, any starting distribution, and any SV, we have

qt(S)exp(2ε2tdln(1/p))p,

where p=pt(S). In particular, for ε=O(1/td), we have qt(S)=O(p1o(1)).

Proof.

We choose k to optimize the upper bound for qt(S). Let c=2tdε2, p=pt(S), and r=ln(1/p). Since 1/=11/k, the upper bound is

eckp11/k=eck+r/kp.

By the arithmetic-geometric mean inequality, this is minimized when ck=r/k and achieves the bound e2crp=exp(2ε2tdr)p.

We now restate and prove Theorem 1.6, which is tight for the path and the cycle. See 1.6

Proof.

While we can deduce this from Corollary 3.5, it’s more straightforward to apply Theorem 3.3 directly. We will choose suitable constants k and that satisfy the assumptions of Theorem 3.3 and use the conclusion. Letting c=defexp(2tkdε2), we can bound the variation distance

|qtpt|supp[0,1](cp1/p).

The derivative of cp1/p is (c/)p1/11=(c/)p1/k1. There are two cases. If c, then the derivative is nonnegative on [0,1], so the sup is achieved at p=1 and the sup is c1. If c<, then the sup is achieved when cp1/k=. This value of p evaluates to:

cp1/p=p(cp1/k1)=p(1)2p/k2/k,

using that =1/(11/k)1+2/k for k2.

We conclude that |qtpt|max(c1,2/k). By choosing k=4/α and ε=α/(8tkd), and using that ex1+2x for x1/2, we get that this maximum is at most α/2. By Lemma 2.2, after t=O(tmix) steps we have |ptπ|α/2. Therefore, by the triangle inequality, |qtπ|α. Thus the robustness is bounded below by

ε=α/(8tkd)=Ω(1/tmixd).

We now use Corollary 3.5 to upper bound π+(S) and prove Theorem 1.2.

Theorem 1.2. [Restated, see original statement.]

Let G=(V,E) be a graph with all degrees at most d. Let G=(V,E) be a graph with all degrees at most d. For any ε-strong adversary, any starting distribution, and any SV with π(S)1α, we have

π+(S)<(1+α)exp(2ε2dtupln(1/π(S)))π(S).

In particular, for ε=O(1/dtup), we have π+(S)=π(S)exp(O(log(1/π(S)))=O(π(S)1o(1)).

Proof.

As usual, let pt() and qt() denote unbiased and adversarially-biased probabilities, respectively. Let SV be arbitrary, and fix a starting node v for the random walk. First assume that G is bipartite. Note that to show π+(S)B, it suffices to show that for all large enough t we have qt(S)B. Let c=2ε2dtup and f(x)=xecln(1/x). To bound qt(S) for ttup, set t=ttup, and condition on Xt. Recall that by the definition of tup, we have ptup(S)(1+α)π(S). Using f(rx)rf(x) for rx<1 and r>1, and applying Corollary 3.5, we have

qt(v,S)=𝔼Xt[qtup(Xt,S)]𝔼Xt[f(ptup(Xt,S))](1+α)f(π(S)),

as required for the nonbipartite case.

The bipartite case follows similarly by comparing (qt+qt+1)/2 with (pt+pt+1)/2.

3.3 Mixing time robustness for expanders

We now establish robustness in terms of spectral gap. See Section 2.2 for the definition of a spectral expander. We first restate our theorem

Theorem 1.7. [Restated, see original statement.]

A d-regular γ-spectral expander has mixing time robustness Ω(γ/d).

Proof.

Our strong adversary chooses a stochastic matrix Bt with supp(Bt)supp(P), and the transition matrix Pt=(1ε)P+εBt. Let qt denote the vector of probabilities at time t. We can write qt=π+rt, where rtπ can be viewed as an error term. Therefore,

qt+1 = (1ε)Pqt+εBtqt
= (1ε)π+(1ε)Prt+εBtπ+εBtrt

Subtracting π gives

rt+1=(1ε)Prt+εBtrt+ε(Btππ)

Using that for stochastic B with row sums at most d, we have Bv2dv2, (BI)v2dv2, and Pw2λw2 for wπ yields:

rt+12((1ε)λ+εd)rt2+εd/n (1)

We will write (1) as rt+12brt2+δ to simplify our calculation. Our calculation will show that we can choose ε small enough to make rt converge quickly. For ε01 to be chosen later, set

ε=(1λ)ε02(dλ).

Then b=def(1ε)λ+εd=λ+ε(dλ)λ+(1λ)/2=(1+λ)/2<1.

Let δ=defεd/n. Note that for xbx+δ with b<1, and for x2δ/(1b), we have

xbx+(1b)x/2=((1+b)/2)x.

Plugging into (1), we deduce that for rt24εd/n/(1λ), we have

rt+121+λ2rt2.

Consequently, after t=O(log1/bn) steps, we have

rt1nrt24εd1λ=2ε0ddλ2ε0dd/4=8ε0.

Here we used that for d2, we have d1d/4.

Thus, by choosing ε0 a small enough constant, we can make the variation distance to π, which is rt1/2, an arbitrarily small constant.

4 Hitting Time Robustness

4.1 Inverse of mixing time

We now show that the hitting time robustness is Ω(1/tmix). First recall the separation time tsep and that tsep=Θ(tmix) (Lemma 2.5). We first prove that short unbiased walks hit nodes with at least the “right” probability. Let hv=𝔼π[Hv].

Lemma 4.1.

For all u,vV, we have Pru[Hv2tsep]tsep/(16hv).

Proof.

Let

p=defPrπ[Hvtsep]=wVπ(w)Prw[Hvtsep].

Since for any starting u, after tsep steps every node w has probability at least π(w)/2, we have

Pru[Hv2tsep]wVπ(w)2Prw[Hvtsep]=p2.

We now show that ptsep/(8hv), which will prove the lemma. Suppose that p<tsep/(8hv). We will show that Prπ[Hv2hv]<1/2, which contradicts Markov’s inequality, proving the lemma.

To show this, let k=2hv/tsep. We prove by induction that for all ik, we have

Prπ[Hvitsep]<2ip1/2.

This is true for i=1. Suppose it’s true for a given i. Then

Prπ[Hv(i+1)tsep]=Prπ[Hvitsep]+Prπ[Hv(i+1)tsepHv>itsep].

To analyze the conditional probability, we bound the conditional distribution of Xt given that Hv>t, where t=itsep. Specifically, for any node w,

Prπ[Xt=wHv>t]=Prπ[Xt=wHv>t]Prπ[Hv>t]Prπ[Xt=w]Prπ[Hv>t]=π(w)Prπ[Hv>t]

Thus, conditioning on Hv>t can only increase probabilities by at most 1/Prπ[Hv>t]2. Therefore,

Prπ[Hv(i+1)tsepHv>itsep]2p,

and we are done.

We are now ready to prove our theorem about hitting time robustness.

Theorem 4.2.

For all ε and all nodes u,v, we have hu,v+=O(hvexp(O(εtmix))).

Proof.

We prove the theorem with tsep in place of tmix; this suffices since tsep=O(tmix). By Lemma 4.1, even with an ε-adversary, we have

q=defPru[Hv+2tsep]exp(2tsepε)tsep/(16hv).

By Wald’s identity, we have hu,v+2tsep/q=O(hvexp(O(εtmix))).

Theorem 1.10, stating that the hitting time robustness ρhit=Ω(1/tmix), follows.

4.2 Hitting time robustness for trees

We now show that the hitting time robustness is Ω(1/Δ) for trees, confirming ˜1.16 in this case. We first restate the theorem.

Theorem 1.11. [Restated, see original statement.]

The hitting time robustness for any tree is Ω(1/Δ).

Fix a tree G with at least two edges (otherwise it’s trivial), so Δ2. Let ε=1/(2Δ), and fix a standard ε-adversary. For the hitting time, we can assume without loss of generality that the adversary is oblivious, and hence the adversarial random walk P has a stationary distribution π. We first show that π is not too different than the stationary distribution π of the uniform random walk P.

Lemma 4.3.

For any node v, we have π(v)/4π(v)4π(v).

Proof.

First consider two neighbors v and w. Let π(v,w)=π(v)P(v,w) denote the stationary probability of the directed edge (v,w). Since {v,w} is a cut edge, we must have π(v,w)=π(w,v). Therefore,

π(v)π(w)=P(w,v)P(v,w)(1ε)dv(1+ε)dw(12ε)dvdw.

For v,w that may not be neighbors, an inductive argument gives a bound in terms of the distance Δ(v,w):

π(v)π(w)(12ε)Δ(v,w)dvdwdv4dw,

because (11/x)x1/4 for x2. The lemma will follow because π(v)=dv/(2m) is exactly proportional to the degree. To elaborate, define the ratio r(v)=π(v)/π(v). Then

r(v)r(w)=π(v)π(w)π(w)π(v)=π(v)π(w)dwdv14.

Since some r(u)1 and some r(u)1, we have 1/4r(v)4 for all v, as claimed.

Proof of Theorem 1.11.

As in the above lemmas, we set ε=1/(2Δ) and fix a standard ε-adversary. We now show that h+(v,w)<8h(v,w), which will prove the theorem. We consider the oblivious ε-adversary that maximizes the hitting time from v to w. It suffices to show h+(v,w)8h(v,w) for neighbors v and w, since for arbitrary v and w in a tree the hitting time is the sum of the hitting times between neighbors.

We now use the idea behind the “essential edge lemma” from Aldous-Fill (Lemma 5.1 in [1]. For neighbors v and w, consider the graph H obtained from G by removing all nodes and edges that are strictly closer to w than to v. Thus, in H the node w has degree 1, and h(v,w) and h+(v,w) are the same in H as in G. Let π and π denote the corresponding stationary distributions in H, and let h(,) and h+(,)denote hitting times in H. Then since a random walk in H starting at w is forced to go to v, we have the following:

h(v,w) =h(w,w)1=1/π(w)11/(2π(w)) (since π(w)1/2)
h+(v,w) =h+(w,w)1=1/π(w)1<4/π(w)8h(v,w) (using Lemma 4.3)

The theorem follows.

4.3 Negative result

Definition 4.4.

Fix a simple undirected graph G=(V,E). Let Δ(v,w) denote the distance between v and w in G. Define the average distance Δ¯=𝔼v,w[Δ(v,w)].

Theorem 4.5.

For every regular graph G and ε[0,1], there is a strong oblivious ε-adversary with thit+=Ω(neεΔ¯).

Proof.

Much of this proof is similar to a lower bound on π+(S) by [4].

The idea is to construct a “lazy” adversary – allowing self loops – where some node has stationary probability at most eεΔ¯/n, and then apply Lemma 2.1. The lazy adversary won’t be a valid adversary because the original graph doesn’t have self loops, so we then convert it into a valid adversary.

Let G be d-regular. Fix a node u such that 𝔼w[Δ(u,w)]Δ¯. We first define a lazy adversary using a Metropolis algorithm. The target stationary distribution ρ will be proportional to the weight defined as wt(v)=(1ε)Δ(u,v). That is, ρv=wt(v)/z, where z=vwt(v). Since for an edge {v,w} we have |Δ(u,v)Δ(u,w)|1, the lazy adversary corresponds to the Metropolis Markov chain Q where

Qv,w={1/dif {v,w}E and Δ(u,w)Δ(v,w)(1ε)/dif {v,w}E and Δ(u,w)=Δ(v,w)11xΓ(v)Qv,xif w=v

Since Q corresponds to a Metropolis algorithm for ρ, the stationary distribution of Q is ρ. Observe that

z=v(1ε)Δ(u,v)=n𝔼v[(1ε)Δ(u,v)n(1ε)𝔼v[Δ(u,v)]n(1ε)Δ¯.

Let h() denote the hitting times for Q. Since some neighbor v of u has h(v,u)h(u,u)1, by Lemma 2.1 we have

thit1/ρu1=z1n(1ε)Δ¯1.

However, Q is not a valid adversary, since a valid adversary must only place weight on edges of G, but G has no self loops. To circumvent this issue, distribute all the self loop probabilities proportional to the other edges. That is, let Qv=xΓ(v)Qv,x, and define

Rv,w={(1+Qv,v)/(dQv)if {v,w}E and Δ(u,w)Δ(v,w)(1ε)/(dQv)if {v,w}E and Δ(u,w)=Δ(v,w)1

In other words R is a sped-up version of Q. A random trajectory of Q can be converted into a random trajectory of R by simply deleting consecutive repetitions. Thus, expected hitting times of R are at least minvQv1ε times those of Q. Since R is a strong oblivious ε-adversary, we have

thit+(1ε)thitn(1ε)1Δ¯1=Ω(neεΔ¯).

Corollary 4.6.

For every regular graph G and ε[0,1], there is a strong oblivious ε-adversary with thit+=Ω(ne(εlogdn)/2).

Proof.

Every d-regular graph has Δ¯(logdn)/2.

This enables us to conclude:

Theorem 1.12. [Restated, see original statement.]

The hitting time robustness of any constant degree expander is ρhit=Θ(1/logn).

Proof.

The unbiased max hit time for any expander is O(n) [21]. In order for thit+ to be O(n), we need ε=Ω(1/logn). Also, ε=Θ(logn) suffices because of Theorem 1.10 and the fact that constant degree expanders have tmix=Θ(logn).

4.4 Tight analysis of path/cycle

Notice that Theorem 1.11 is mainly useful when the unbiased thit=Θ(n). We now give a tight analysis of the robustnesses of the path and cycle, which have thit=Θ(n2). We focus on the hitting time robustness, as the others will be the same. We now show:

Theorem 4.7.

The hitting time robustness of the path and cycle are Θ(1/n).

Proof.

From Theorem 1.11, we have ρhit=Ω(1/n). It therefore suffices to show the upper bound and negative result that ρhit=O(1/n).

The idea is to use the essential edge lemma and its weighted version. It says that if {v,w} is a cut edge, then h(v,w)=2e(Gv)+1, where Gv is the connected component of v if {v,w} is removed. The weighted version simply counts the weights of the edges, and normalizes the edge {v,w} to have weight 1 (or divide by wt(v,w)).

For simplicity we analyze the path on n+1 nodes labeled 0 to n and edges ei={i,i+1}. Consider the hitting time h(n,0)=h(n,n1)+h(n1,n2)++h(1,0). Let’s see how often edge ei appears in 2e(Gv)+1, where the +1 gets charged to {v,w}. The contribution of edge ei to h(n,0) is 1+2i.

Now compare it when we bias the random walk by ε. We can do this by assigning wt(ei)=(1+ε)i. The contribution of edge ei to the biased hitting time h+(n,0) is

1+2((1+ε)+(1+ε)2++(1+ε)i1)2(1+ε)i/ε>2(i2)ε.

Thus the ratio of this to the contribution in the unbiased random walk is at least Ω(iε). Now, over half the contribution comes from in/2, and for such i we can lower bound this ratio by Ω(nε). Therefore, if ε=ω(1/n), then the ratio h+(n,0/h(n,0)=ω(1), as we wanted.

5 Cover Time Robustness

In the non-adversarial setting, Matthews [19] upper bounded the cover time in terms of the max hitting time:

Theorem 5.1 ([19]).

tcovthit(1+1/2++1/n)thit(1+lnn).

There are many examples where this is tight up to a constant factor: expanders (and hence most graphs) [5], the complete graph (coupon collecting), two and higher dimensional grids and tori [2, 25], and the balanced binary tree [24].

We observe that in any graph with tcov=Θ(thitlogn), we have ρcov=Θ(ρhit). This is because Matthews’ proof works the same in the adversarial setting.

Theorem 5.2.

For any adversary, tcov+thit+(1+lnn).

Proof.

Matthews [19] proof works in the adversarial setting, so

tcov+thit+(1+1/2++1/n)thit+(1+lnn).

Corollary 5.3.

In any graph with tcov=Θ(thitlogn), we have ρcov=Ω(ρhit).

We conjecture that more generally, in any graph, ρcov=Θ(ρhit).

References

  • [1] D. Aldous and J. A. Fill. Reversible Markov chains and random walks on graphs, 2002. Unfinished monograph, recompiled 2014, available at http://www.stat.berkeley.edu/˜aldous/RWG/book.html.
  • [2] D.J. Aldous. On the time taken by random walks on finite groups to visit every state. Z. Wahrscheinlichkeitstheorie verw Gebiete, 62:361–374, 1983.
  • [3] C. Avin, M. Koucky, and Z. Lotker. Cover time and mixing time of random walks on dynamic graphs. Random Structures and Algorithms, 52:576–596, 2018. doi:10.1002/RSA.20752.
  • [4] Y. Azar, A. Z. Broder, A. R. Karlin, N. Linial, and S. J. Phillips. Biased random walks. Combinatorica, 16:1–18, 1996. doi:10.1007/BF01300124.
  • [5] A.Z. Broder and A.R. Karlin. Bounds on the cover time. Journal of Theoretical Probability, 2:101–120, 1989.
  • [6] S. Cannon, J. J. Daymude, D. Randall, and A.W. Richa. A Markov chain algorithm for compression in self-organizing particle systems. In Proceedings of the 2016 Symposium on Principles of Distributed Computing, pages 279–288, 2016.
  • [7] B. Chin, A. Moitra, E. Mossel, and C. Sandon. The power of an adversary in Glauber dynamics. In The Thirty Seventh Annual Conference on Learning Theory, volume 247 of Proceedings of Machine Learning Research, pages 1102–1124, 2024. URL: https://proceedings.mlr.press/v247/chin24a.html.
  • [8] R.V. Craiu, L. Gray, K. Latuszynski, N. Madras, G.O. Roberts, and J.S. Rosenthal. Stability of adversarial Markov chains, with an application to adaptive MCMC algorithms. Annals of Applied Probability, 25:3592–3623, 2015.
  • [9] D. Doron, D. Moshkovitz, J. Oh, and D. Zuckerman. Almost Chor-Goldreich sources and adversarial random walks. In Proceedings of the 55th Annual ACM Symposium on Theory of Computing, pages 1–9, 2023.
  • [10] D. Doron, D. Moshkovitz, J. Oh, and D. Zuckerman. Online condensing of unpredictable sources via random walks. In Proceedings of the 40th Computational Complexity Conference, volume 339 of LIPIcs, pages 30:1–30:17, 2025. doi:10.4230/LIPIcs.CCC.2025.30.
  • [11] D.J. Hartfiel. On the limiting set of stochastic products xA1Ak. Proceedings of the American Mathematical Society, 81:201–206, 1981.
  • [12] D.J. Hartfiel. Markov Set-Chains, volume 1695 of Lecture Notes in Mathematics. Springer-Verlag, 1998.
  • [13] J. Haslegrave, T. Sauerwald, and J. Sylvester. Time dependent biased random walks. ACM Transactions on Algorithms, 18, 2022.
  • [14] J.J. Hunter. Mixing times with applications to perturbed markov chains. Linear Algebra and Its Applications, 417:108–123, 2006.
  • [15] S. Kania, D. Aristoff, and D.M. Zuckerman. Riteweight: Randomized iterative trajectory reweighting for steady-state distributions without discretization error. Technical report, Arxiv, 2024. arxiv:2401.05597.
  • [16] D.A. Levin and Y. Peres. Markov Chains and Mixing Times. American Mathematical Society, 2nd edition, 2017.
  • [17] S. Li, B. Dutta, S. Cannon, J. J. Daymude, R. Avinery, E. Aydin, A. W. Richa, D. I. Goldman, and D. Randall. Programming active cohesive granular matter with mechanically induced phase changes. Science Advances, 7:eabe8494, 2021.
  • [18] Y. Liu. Perturbation bounds for the stationary distributions of Markov chains. SIAM Journal on Matrix Analysis and Applications, 33:1057–1074, 2012. doi:10.1137/110838753.
  • [19] P. Matthews. Covering problems for Markov chains. Annals of Probability, 16:1215–1228, 1988.
  • [20] L. Page, S. Brin, R. Motwani, and T. Winograd. The pagerank citation ranking: Bringing order to the web. Technical Report Technical Report 1999-66, Stanford InfoLab, 1999.
  • [21] R. Rubinfeld. The cover time of a regular expander is O(n log n). Information Processing Letters, 35:49–51, 1990. doi:10.1016/0020-0190(90)90173-U.
  • [22] T. Sauerwald and L. Zanetti. Random walks on dynamic graphs: Mixing times, hitting times, and return probabilities. In Proceedings of the 46th International Colloquium on Automata, Languages, and Programming, volume 132 of LIPIcs, pages 93:1–93:15, 2019. doi:10.4230/LIPIcs.ICALP.2019.93.
  • [23] D. Vial and V.G. Subramanian. Restart perturbations for reversible markov chains: Trichotomy and pre-cutoff equivalence. Random Structures and Algorithms, 66, 2025.
  • [24] D. Zuckerman. Covering times of random walks on bounded degree trees and other graphs. Journal of Theoretical Probability, 2:147–157, 1989.
  • [25] D. Zuckerman. A technique for lower bounding the cover time. SIAM Journal on Discrete Mathematics, 5:254–259, 1992.