Abstract 1 Introduction 2 Preliminaries 3 Large Girth is Not Sufficient 4 π’Œ-Fold Unions are Sufficient 5 A Bicriterion Concentration Inequality References Appendix A Useful Concentration Inequalities Appendix B Missing Proofs and Examples

A Bicriterion Concentration Inequality and Prophet Inequalities for k-Fold Matroid Unions

Noga Alon ORCID Department of Mathematics, Princeton University, NJ, USA
Schools of Mathematics and Computer Science, Tel Aviv University, Tel Aviv, Israel
Nick Gravin ORCID Key Laboratory of Interdisciplinary Research of Computation and Economics, Shanghai University of Finance and Economics, China Tristan Pollner Department of Management Science and Engineering, Stanford University, CA, USA Aviad Rubinstein ORCID Department of Computer Science, Stanford University, CA, USA Hongao Wang ORCID Department of Computer Science, Purdue University, West Lafayette, IN, USA S. Matthew Weinberg ORCID Department of Computer Science, Princeton University, NJ, USA Qianfan Zhang ORCID Department of Computer Science, Princeton University, NJ, USA
Abstract

We investigate prophet inequalities with competitive ratios approaching 1, seeking to generalize k-uniform matroids. We first show that large girth does not suffice: for all k, there exists a matroid of girth β‰₯k and a prophet inequality instance on that matroid whose optimal competitive ratio is 12. Next, we show k-fold matroid unions do suffice: we provide a prophet inequality with competitive ratio 1βˆ’O⁒(log⁑kk) for any k-fold matroid union. Our prophet inequality follows from an online contention resolution scheme.

The key technical ingredient in our online contention resolution scheme is a novel bicriterion concentration inequality for arbitrary monotone 1-Lipschitz functions over independent items which may be of independent interest. Applied to our particular setting, our bicriterion concentration inequality yields β€œChernoff-strength” concentration for a 1-Lipschitz function that is not (approximately) self-bounding.

Keywords and phrases:
Prophet Inequalities, Online Contention Resolution Schemes, Concentration Inequalities
Funding:
Noga Alon: Research supported in part by NSF grant DMS-2154082.
Nick Gravin: Research is supported by National Key R & D Program of China (2023YFA1009500), NSFC grant 61932002, β€œthe Fundamental Research Funds for the Central Universities in China”.
Aviad Rubinstein: Research supported in part by NSF CCF-1954927, and a David and Lucile Packard Fellowship.
S. Matthew Weinberg: Research supported in part by NSF CCF-1955205. During Professor Weinberg’s development of this paper, he participated as an expert witness on behalf of the State of Texas in ongoing litigation against Google (the β€œGoogle Litigation”).
Qianfan Zhang: Research supported in part by NSF CCF-1955205.
Copyright and License:
[Uncaptioned image] © Noga Alon, Nick Gravin, Tristan Pollner, Aviad Rubinstein, Hongao Wang, S. Matthew Weinberg, and Qianfan Zhang; licensed under Creative Commons License CC-BY 4.0
2012 ACM Subject Classification:
Theory of computation β†’ Online algorithms
Related Version:
Full Version: https://arxiv.org/abs/2411.11741
Acknowledgements:
The authors are grateful to the anonymous reviewers for helpful feedback on the initial submission of this work.
Editor:
Raghu Meka

1 Introduction

Prophet inequalities are fundamental problems in optimal stopping theory, whose study dates back to seminal work of Krengel and SuchestonΒ [17], and that have wide applications across Economics and Computer Science (e.g., [8, 11]). A prophet inequality instance contains a ground set E of elements, a family β„±βŠ†2E of feasible sets, and a collection of distributions {π’Ÿe}e∈E. For one element at a time, a random variable ve is drawn from distribution π’Ÿe independently and revealed to a gambler, who immediately and irrevocably decides whether to accept or reject e. The gambler must at all times maintain the set of accepted elements Aβˆˆβ„±, and gets payoff βˆ‘e∈Ave at the end of the game. A prophet inequality is c-competitive if it guarantees 𝐄[βˆ‘e∈Ave]β‰₯c⋅𝐄[maxSβˆˆβ„±β’βˆ‘e∈Sve].111The expectation is taken with respect to the random variables {ve}e∈E, which in turn makes A a random variable.

Krengel and Sucheston’s seminal result establishes a 12-competitive prophet inequality for any instance where β„± is a 1-uniform matroid (i.e.Β at most one element is feasible to accept), and moreover establish that no better guarantee is possible.222That is, there exist prophet inequality instances over 1-uniform matroids for which better than a 12-competitive ratio is impossible. The hard instance is quite simple: v1βˆΌπ’Ÿ1 is a point mass at 1, and v2βˆΌπ’Ÿ2 is equal to 1Ξ΅ with probability Ξ΅ and 0 otherwise. A gambler who sees v1 first cannot achieve expect reward exceeding 1, but a prophet who always takes the maximum can achieve expected reward of 2βˆ’Ξ΅. For k-uniform matroids, however, a significantly improved guarantee of 1βˆ’O⁒(1k) is possibleΒ [1, 14, 10]. This motivates the following question: for a given Ξ΅>0, what conditions on β„± suffice for a (1βˆ’Ξ΅)-competitive prophet inequality?

Main Result I: Large Girth does not Suffice

A natural starting point to address this question is to first understand what makes k-uniform matroids β€œspecial” in the sense that the canonical hard instance cannot be embedded. One conjecture might be because k-uniform matroids have large girth: there are no infeasible sets of size ≀k. So, a natural first question to ask is whether β„± having large girth suffices in order to conclude that any instance over β„± admits a c-competitive prophet inequality. Our first main result establishes that large girth does not suffice.

Theorem 1.

For all kβ‰₯1 and Ξ΅>0, there exists a prophet inequality instance (E,β„±,{π’Ÿ}e∈E) such that: (a) (E,β„±) is a graphic matroid with girth k, and (b) (E,β„±,{π’Ÿ}e∈E) does not admit a (12+Ξ΅)-competitive prophet inequality.

Our construction leverages dense graphs of high girth (and a particular construction ofΒ [18]) in order to effectively embed multiple copies of the canonical hard 1-uniform instance. See SectionΒ 3 for further details.

Main Result II: π’Œ-fold Matroid Unions Suffice

TheoremΒ 1 motivates richer generalizations of k-uniform matroids. We next consider k-fold matroid unions, observing that k-uniform matroids are the union of k 1-uniform matroids. Given a matroid β„³=(E,β„±) over ground set E with feasible sets β„±, the k-fold union of β„³ is a new matroid β„³k with ground set E and feasible sets β„±k:={F1βˆͺF2βˆͺβ‹―βˆͺFk:F1,F2,…,Fkβˆˆβ„±}. That is, a set is feasible in β„³k if it can be partitioned into k sets that are each feasible in β„³.

Theorem 2.

For every prophet inequality instance (E,β„±k,{π’Ÿe}e∈E) where (E,β„±k) is the k-fold union of a matroid (E,β„±), there exists a (1βˆ’O⁒(log⁑kk))-competitive prophet inequality.

Our proof of TheoremΒ 2 follows from a novel Online Contention Resolution Scheme (OCRS). An OCRS is parameterized by a ground set E, a feasibility family β„±, and a vector of probabilities π’™βˆˆConvexHull⁑({𝟏F:Fβˆˆβ„±})βŠ†[0,1]E (that is, 𝒙 can be written as a convex combination of indicator vectors of feasible sets). One at a time, elements of E are revealed and active with probability xe independently. If an element is active, it can be accepted or rejected (if inactive, it must be rejected), and the accepted elements must at all times be in β„±. An OCRS is c-selectable if every element e is accepted with probability at least cβ‹…xe. In this language, TheoremΒ 2 follows from a novel (1βˆ’O⁒(log⁑kk))-selectable OCRS for k-fold matroid unions.

To prove our OCRS, we follow a similar framework asΒ [12], and design a recursive decomposition of β„± over which to greedily accept active elements. There are two key challenges to applying their framework, which we overview in greater detail in SubsectionΒ 4.2. We give a representative example below.

Applied to the 1-uniform matroid, theΒ [12] algorithm simply proposes β€œaccept any active element independently with probability b.” Then, linearity of expectation suffices to observe that there are at most b elements in expectation that are both active and accepted,333There are at most 1 elements in expectation that are active, and each active element is accepted with probability b. and Markov’s inequality suffices to guarantee that with probability at least 1βˆ’b, no elements are accepted at all. This suffices to guarantee that for all e: (a) with probability at least 1βˆ’b it is feasible to accept e when revealed, and (b) independently, we will accept e with probability b conditioned on e being active and feasible. This implies a b⁒(1βˆ’b)-selectable algorithm, which is optimized at b=12.

Applied to the k-uniform matroid, a natural algorithm would again be β€œaccept any active element independently with probability b.” Then, linearity of expectation still suffices to observe that there are at most b⁒k elements in expectation that are both active and accepted, but Markov’s inequality only guarantees that with probability at least 1βˆ’b, at most k elements are accepted. This would lead to the same 14-selectable OCRS, which is not the desired 1βˆ’O⁒(log⁑kk). Of course, the obvious fix is to use a significantly stronger concentration inequality than Markov’s. E.g., a Chernoff bound suffices to guarantee that with probability at least 1βˆ’1k at most kβˆ’1 elements are accepted, when b=1βˆ’O⁒(log⁑kk). This leads to the desired 1βˆ’O⁒(log⁑kk) selectable OCRS for k-uniform matroids. However, Chernoff bounds are insufficient for the general class of k-fold matroid unions – the probability that a particular element is feasible to accept is a highly combinatorial function that depends on the underlying matroid structure. Thus our TheoremΒ 2 has two components: first, a decomposition that reduces the OCRS problem to a concentration inequality and second, a novel concentration inequality, which is our third main result.

Main Result III: A Bicriterion Concentration Inequality

Putting aside prophet inequalities for a moment, concentration inequalities are a core aspect of applied probability with widespread application across many areas of Computer Science. One representative setting is the following: Let f:{0,1}E→ℝ be some function, and let 𝑿=⟨Xe⟩e∈E be a vector of independent Bernoulli random variables, where Xe∼Ber⁒(pe). A canonical question asks: what is the probability that f⁒(𝑿) exceeds 𝐄[f⁒(𝑿)]+t?

On one extreme, McDiarmid’s inequality holds whenever f is 1-Lipschitz. On the other, Chernoff bounds are significantly stronger, if f is linear (and 1-Lipschitz). In between, β€œChernoff-strength” concentration holds whenever f is fractionally-subadditive or (approximately) self-boundingΒ [5, 6, 22, 7, 27], but this provably does not extend even to the case when f is subadditiveΒ [27].

Our third main result provides a bicriterion concentration inequality for any monotone 1-Lipschitz function. Specifically, if 𝑿 is a vector of Bernoulli random variables with probability vector 𝒑, let 𝑿(s) denote a vector of Bernoulli random variables with probability vector eβˆ’s⁒𝒑. That is, each probability pi has been decreased by a factor of eβˆ’s. Our new concentration inequality establishes:

Theorem 3.

Let f:{0,1}E→ℝ be a monotone 1-Lipschitz function. For any s∈(0,1], t>0:

𝐏𝐫[f⁒(𝑿(s))β‰₯𝐄[f⁒(𝑿)]+t]≀eβˆ’s⁒t.

A helpful comparison point is McDiarmid’s inequality, which instead proves the following: 𝐏𝐫[f⁒(𝑿)β‰₯𝐄[f⁒(𝑿)]+t]≀eβˆ’2⁒t2/|E|. The distinctions are: (a) our concentration inequality is bicriterion – we analyze f⁒(𝑿(s)) instead of f⁒(𝑿), and (b) our concentration has an exponent of βˆ’s⁒t instead of βˆ’2⁒t2/|E|. In particular, McDiarmid’s inequality depends on the dimension |E| and cannot possibly kick in for tβ‰ͺ|E|, whereas our concentration inequality can kick in for any t>1/s. A representative example to have in mind might be s=log⁑(1/Ξ΅)/𝐄[f⁒(𝑿)] and t=log⁑(1/Ξ΅)⁒𝐄[f⁒(𝑿)]. This results in a tail probability of Ξ΅ for exceeding 𝐄[f⁒(𝑿)] by log⁑(1/Ξ΅) multiples of 𝐄[f⁒(𝑿)], which is β€œChernoff-strength”. But, this concentration holds only for f⁒(𝑿(s)), rather than f⁒(𝑿). This suffices for our application.

To prove TheoremΒ 3, we utilize the entropy method for self-bounding functionsΒ [5, 6, 22, 7] in an unconventional way. We give a more detailed technical overview in SubsectionΒ 5.1.

1.1 Related Work

There are three strands of related work: prophet inequalities, concentration inequalities, and attempts to generalize k-uniform guarantees.

Prophet Inequalities

Prophet inequalities have a long history in Mathematics, Computer Science, and Operations Research. Representative results include Krengel and Sucheston’s initial 12-approximationΒ [17], Samuel-Cahn’s elegant thresholding strategyΒ [25], Chawla et al.’s connection to Bayesian mechanism designΒ [8], Kleinberg and Weinberg’s extension to matroidsΒ [16], and Dutting et al.’s connection to Price of AnarchyΒ [11].

Of particular relevance to our work are prophet inequalities for k-uniform matroids. The first 1βˆ’O⁒(log⁑kk) approximation was developed byΒ [13], and the first asymptotically tight 1βˆ’O⁒(1k) approximation was developed byΒ [1]. Subsequent works achieve the same 1βˆ’O⁒(1k) approximation with sample accessΒ [4], the optimal OCRSΒ [14], or a simpler OCRSΒ [10]. The most technically related paper to our work isΒ [12], whose OCRS framework we leverage. It remains an open question whether the prophet inequality for k-fold matroid unions can be improved to 1βˆ’O⁒(1k).

Concentration Inequalities

The most related concentration inequalities fit the same framework but consider different f. McDiarmid’s inequalityΒ [21] holds for all 1-Lipschitz f, Schechtman’s inequality holds for f that are subadditiveΒ [26], Bucheron et al. derive an inequality for f that are self-bounding functionsΒ [5], and VondrΓ‘k derives an inequality for f that are fractionally subadditiveΒ [27]. These inequalities are commonly used across Theoretical Computer Science, and especially within combinatorial prophet inequalities and Bayesian mechanism designΒ [24, 23].

Generalizing π’Œ-uniform matroids

Recent work ofΒ [9] considers (offline) contention resolution and correlation gap inequalities. Here too, guarantees for k-uniform matroids are significantly stronger than what is achievable for arbitrary matroids. Their work similarly extends guarantees achievable for k-uniform matroids to k-fold matroid unions. In comparison to our work: (a) the general motivation is the same – both works seek to extend stronger guarantees for k-uniform matroids to more general settings, (b) the problems studied and technical aspects are orthogonal,444While in principle, contention resolution and online contention resolution may appear similar, the relevant techniques are fundamentally different with little overlap. Similarly, while correlation gap inequalities are sometimes a useful tool in prophet inequalities, in this case there is no overlap. (c) our work also proposes a bicriterion concentration inequality.

Another generalization of k-uniform matroids are packing constraints, where each element has a d-dimensional size in [0,1]d and one can accept a subset of elements if their size vectors sum to at most k in every coordinate. Packing constraints have been studied in various online settings, including secretary model [15], prophet model [2], and mixed model [3].

2 Preliminaries

Prophet Inequalities

In the prophet inequality problem, we are given a ground set of elements E, a downward-closed family of feasible sets β„±βŠ†2E, and a distribution π’Ÿe associated with each element e∈E. Elements arrive in an adversarial order.555There are various adversarial models. The weakest is the fixed-order adversary, which sets the arrival order offline, based solely on the distributions. The strongest is the almighty adversary, which sets the arrival order online, with full knowledge of all realizations of randomness and the algorithm’s past decisions. Our negative result in SectionΒ 3 applies to fixed-order adversary, while our positive result in SectionΒ 4 holds against almighty adversary. As each element e arrives, its value ve, independently drawn from π’Ÿe, is revealed. At this point, an irrevocable decision must be made whether to include e in its output A, while keeping Aβˆˆβ„±.

For c∈[0,1], we say an online algorithm implies a c-competitive prophet inequality for β„±, if for any distributions {π’Ÿe}e∈E,

𝐄[βˆ‘e∈Ave]β‰₯c⋅𝐄[maxSβˆˆβ„±β’βˆ‘e∈Sve]

where the expectation is taken with respect to random variables {ve}e∈E and the internal randomness of the algorithm.

Online Contention Resolution Schemes

Given a ground set of elements E and a downward-closed family of feasible sets β„±βŠ†2E, we define the polytope of β„± as the convex hull of all characteristic vectors of feasible sets, i.e., 𝒫ℱ=ConvexHull⁑({𝟏F:Fβˆˆβ„±})βŠ†[0,1]E.

An online contention resolution scheme (OCRS) takes a vector π’™βˆˆπ’«β„± as input. Let R⁒(𝒙)βŠ†E be a random set where each element e∈E is in R⁒(𝒙) independently with probability xe. The OCRS sees membership in R⁒(𝒙) of elements in E, arriving in an adversarial order; when each e∈E arrives, if e∈R⁒(𝒙) (i.e., e is β€œactive”), the scheme must decide irrevocably whether to include e in its output A, while keeping Aβˆˆβ„±.

For c∈[0,1], an OCRS is called c-selectable for β„±, if for any π’™βˆˆπ’«β„±,

𝐏𝐫[e∈A∣e∈R⁒(𝒙)]β‰₯cβˆ€e∈E

where Aβˆˆβ„± is the output of the OCRS, and the probability is measured with respect to R⁒(𝒙) and internal randomness of the OCRS. As shown in [12], a c-selectable OCRS directly implies a c-competitive prophet inequality.

Lemma 4 ([12]).

For a ground set E and a family of feasible sets β„±βŠ†2E, a c-selectable OCRS for β„± implies a c-competitive prophet inequality for β„±.

Matroids

A matroid β„³=(E,ℐ) is defined by a ground set of elements E and a non-empty downward-closed family of independent sets β„βŠ†2E with the exchange property, i.e., for every A,Bβˆˆβ„ where |A|>|B|, there exists an element e∈Aβˆ–B such that Bβˆͺ{e}βˆˆβ„. Given a matroid β„³=(E,ℐ), the following notations are used throughout the paper:

  • β– 

    The rank of a set SβŠ†E is the size of the largest independent set contained in S: rank⁑(S)=max⁑{|I|:IβŠ†S,Iβˆˆβ„}.

  • β– 

    The span of a set SβŠ†E is the set of elements that is not independent from S: span⁑(S)={e∈E:rank⁑(S)=rank⁑(Sβˆͺ{e})}.

  • β– 

    The restriction of β„³ to a set SβŠ†E is a matroid β„³|S=(S,ℐ|S)=(S,{Iβˆˆβ„:IβŠ†S}).

  • β– 

    The girth of β„³ is the size of the smallest dependent set: girth⁑(β„³)=min⁑{|S|:SβŠ†E,Sβˆ‰β„}.

Following are some special matroids that we will use later.

Example 5 (Uniform matroid).

A k-uniform matroid β„³=(E,ℐ) is a matroid in which the independent sets are exactly the sets that contains at most k elements for an integer kβ‰₯1, i.e, ℐ={IβŠ†E:|I|≀k}.

Example 6 (Graphical matroid).

A graphical matroid β„³=(E,ℐ) is a matroid in which the independent sets are the forests in a given undirected graph G=(V,E), i.e., ℐ={IβŠ†E:I⁒is acyclic in⁒G}.

We formally define k-fold matroid union as follows.

Definition 7 (k-fold matroid union).

Given a matroid β„³=(E,ℐ) and an integer kβ‰₯1, the k-fold union of β„³ is defined as β„³k=β„³βˆ¨β„³βˆ¨β‹―βˆ¨β„³βŸkΒ times=(E,ℐk) where

ℐk={I1βˆͺI2βˆͺβ‹―βˆͺIk:I1,I2,…,Ikβˆˆβ„}.

In other words, a set I is independent in β„³k if and only if I can be partitioned into at most k independent sets in β„³. Note that β„³k remains a matroid by the closure property of matroid union.

3 Large Girth is Not Sufficient

In this section, we prove that a large girth is not sufficient for matroids (specifically, graphical matroids) to have a prophet inequality with a competitive ratio better than 12.

See 1

To construct a hard instance, we start with a dense graph of large girth. We then transform the graph by splitting each edge (v1,v2) into two edges (v1,u) and (v2,u), where u is a newly introduced vertex. We obtain the final hard instance of the prophet inequality problem by embedding the hard instance of the single-item case into each of these edge pairs (v1,u),(v2,u).

The hardness of this instance arises from the following observation: without accepting both edges in a pair, the instance essentially reduces to |E| independent hard instances of the single-item case. On the other hand, one can accept at most |V|βˆ’1 extra pairs of edges (in addition to |E| single-item problems) at the same time without forming a cycle, which could not contribute a lot to the final solution because the graph is dense.

Proof of TheoremΒ 1.

We employ a construction of [18] which provides dense graphs of large girth. In particular, we will use that for any fixed k there exists some arbitrarily large n such that there is a graph Gn on n vertices with at least n⁒log⁑n edges and girth at least k.666In fact, [18] prove a significantly stronger result, but the weaker version stated above suffices for our purposes. Specifically, consider the graph Gn with vertices V⁒(Gn)={v1,v2,…,vn} and edges E⁒(Gn)={e1,e2,…,em}, where mβ‰₯n⁒log⁑n and each edge ei=(a⁒(i),b⁒(i))∈E⁒(Gn) connects vertices a⁒(i) and b⁒(i) in V⁒(Gn). We construct a new graph Hn with n+m vertices as follows:

  • β– 

    Begin with a set of n+m vertices, labeled V⁒(Hn):={w1,w2,…,wn}βŠ”{u1,u2,…,um}.

  • β– 

    For each edge ei in Gn connecting va⁒(i) and vb⁒(i), add in Hn an edge between ui and wa⁒(i) (call it fi) as well as an edge between ui and wb⁒(i) (call it fiβ€²).

Hence Hn has a total of 2⁒m edges. For 1≀i≀m, let the associated random variable Xfi of fi be a constant 1, and let the associated random variable Xfiβ€² of fiβ€² follow a distribution which takes a value of 1Ξ΅ with probability Ξ΅, and a value of 0 with probability 1βˆ’Ξ΅. We consider an instance of the prophet inequality problem where the online algorithm is presented edges in the order (f1,f1β€²,f2,f2β€²,…,fm,fmβ€²).

We first lower bound OPT⁑(Hn), the expected value the optimal offline algorithm gets on this instance. Note that an offline algorithm could simply look at each pair {fi,fiβ€²} and take whichever edge has higher realized weight; this cannot create a cycle because every edge selected will be incident to a vertex of degree 1. We hence have the bound

OPT⁑(Hn)β‰₯βˆ‘i=1m(Ξ΅β‹…1Ξ΅+(1βˆ’Ξ΅)β‹…1)=m⁒(2βˆ’Ξ΅).

Fix an online algorithm π’œ, and we now give an upper bound on its expected performance π’œβ’(Hn) on the instance. The lower bound relies on the following observation.

Claim 8.

There are at most nβˆ’1 values of i in {1,2,…,m} such that π’œ accepts both fi and fiβ€².

Proof.

Suppose there are at least n such values of i; call them i1, i2, …, in. As the original graph G has n vertices, and a forest on n vertices has at most nβˆ’1 edges, we clearly see that there is a cycle among {ei1,ei2,…,ein}. That however would imply there is a cycle in H; namely, follow the cycle that existed in G, but replace each edge ei with the edge fi followed by the edge fiβ€². ⊲

For each 1≀i≀m, we now consider cases for what π’œ gets in expectation from {fi,fiβ€²} right after fi arrives:

  • β– 

    If π’œ rejects fi, then it clearly gets in expectation at most 1 from {fi,fiβ€²} because 𝔼⁒[Xfiβ€²]=1.

  • β– 

    If π’œ accepts fi and rejects fiβ€², then it clearly gets weight at most 1 from {fi,fiβ€²}.

  • β– 

    If π’œ accepts fi and accepts fiβ€², then it clearly gets weight at most 1+1Ξ΅ from {fi,fiβ€²}.

Let C1 denote the set of all i∈[m] such that π’œ rejects fi, let C2 denote the set of all i∈[m] such that π’œ accepts fi and rejects fiβ€², and let C3 denote the set of all i∈[m] such that π’œ accepts fi and fiβ€². Note C1, C2, and C3 are random (disjoint) sets that may depend on the values realized by {Xfi,Xfiβ€²}i=1m and any randomness in π’œ. By the above cases, we can see that in expectation, π’œ gets score at most

βˆ‘i∈C11+βˆ‘i∈C21+βˆ‘i∈C3(1+1Ξ΅)=|C1|+|C2|+|C3|β‹…(1+1Ξ΅).

Although |C1|, |C2|, and |C3| are random variables, |C1|+|C2|≀m always, and by 8 we have |C3|≀nβˆ’1 always. Hence, in expectation (averaging over all possible realizations of C1, C2, and C3), we can bound the performance of π’œ on Hn by π’œβ’(Hn)≀m+n⁒(1+1Ξ΅). As n grows, we can compute

lim infnβ†’βˆžπ’œβ’(Hn)OPT⁑(Hn)≀limnβ†’βˆžm+n⁒(1+1Ξ΅)m⁒(2βˆ’Ξ΅)=12βˆ’Ξ΅.

Taking Ξ΅β†’0 demonstrates the claimed result. β—€

4 π’Œ-Fold Unions are Sufficient

Our main goal in the section is to construct a good OCRS for k-fold matroid unions (TheoremΒ 9). Combining with the reduction from prophet inequalities to OCRSs by [12] (4), this immediately implies the existence of good prophet inequality for all k-fold matroid unions (TheoremΒ 2).

Theorem 9.

There exists a (1βˆ’O⁒(log⁑kk))-selectable OCRS for any k-fold matroid unionΒ β„³k.

Our OCRS for k-fold matroid unions builds on the chain decomposition approach used in the matroid OCRS by [12], outlined in SubsectionΒ 4.1. We overview our approach and highlight main difficulties in SubsectionΒ 4.2. The construction is then formally given and analyzed in SubsectionΒ 4.3, where the bicriterion concentration inequality in SectionΒ 5 is used to bound its selectability.

4.1 Recap: OCRS for general matroids

We briefly describe the idea of the 14-selectable matroid OCRS byΒ [12]. Specifically, they show that for any parameter b∈(0,1), there exists a (1βˆ’b)-selectable OCRS for any matroid β„³=(E,ℐ) and π’™βˆˆb⋅𝒫ℳ. Note that one can β€œscale down” a vector 𝒙 from 𝒫ℳ to b⋅𝒫ℳ by only considering each element independently with probability b. Formally:

Fact 10.

For b,c∈(0,1) and any matroid β„³, a c-selectable OCRS for all 𝐱∈b⋅𝒫ℳ implies a b⁒c-selectable OCRS for all π±βˆˆπ’«β„³.

Therefore, it follows that a b⁒(1βˆ’b)-selectable ORCS exists for any matroid β„³ and π’™βˆˆπ’«β„³. By letting b=12, they obtain a 14-selectable matroid OCRS.

The greedy algorithm

Let us start with the simple greedy algorithm that always accepts the active element whenever possible. When β„³ is a 1-uniform matroid, the greedy algorithm is actually (1βˆ’b)-selectable for π’™βˆˆb⋅𝒫ℳ (i.e., βˆ‘e∈Exe≀b since β„³ is 1-uniform), since the selectability of an element e∈E can be easily lower bounded as

𝐏𝐫[eΒ is accepted∣eΒ is active] β‰₯𝐏𝐫[no other element is active∣eΒ is active]
β‰₯𝐏𝐫[no element is active].

The first inequality holds because when there is no active elements besides e, the greedy algorithm can always accept e even if it arrives at the end. The second inequality holds due to the independence between elements. Moreover, by Markov’s inequality,

𝐏𝐫[no element is active] =1βˆ’ππ«[|R⁒(𝒙)|β‰₯1]
β‰₯1βˆ’π„[|R⁒(𝒙)|]=1βˆ’βˆ‘e∈Exeβ‰₯1βˆ’b.

(Recall that R⁒(𝒙) is the set of active elements.)

The first half of argument applies when β„³ is a general matroid: for every element e∈E,

𝐏𝐫[eΒ is accepted∣eΒ is active]β‰₯𝐏𝐫[eβˆ‰span⁑(R⁒(𝒙))].

However, unlike in 1-uniform matroids, the probability that an element e∈E is spanned by active elements R⁒(𝒙) could be much smaller than 1βˆ’b, even for a scaled π’™βˆˆb⋅𝒫ℳ. In fact, the selectability of the greedy algorithm can be arbitrarily bad for a general matroid β„³ (see, e.g., [19]).

Protection

Consider AlgorithmΒ 1, a modified greedy algorithm with a protection set S⊊E that only handles elements in Eβˆ–S. Intuitively, the algorithm accepts every active element e∈Eβˆ–S whenever it does not conflict with any element in S. As a result, elements in S are β€œprioritized” over those in Eβˆ–S: regardless of which independent set from S is accepted, it remains an independent set when combined with the accepted elements in Eβˆ–S.

Algorithm 1 Modified greedy algorithm for β„³=(E,ℐ) with a protection set SβŠ†E.

For the modified greedy algorithm, we can similarly lower bound the selectability for e∈Eβˆ–S:

𝐏𝐫[eΒ is accepted∣eΒ is active]β‰₯𝐏𝐫[eβˆ‰span⁑(R⁒(𝒙)βˆͺS)].

The good news is that, such probabilities can be further lower bounded by 1βˆ’b for the S obtained using AlgorithmΒ 2, an iterative algorithm that updates S by adding an element e whenever 𝐏𝐫[e∈span⁑(R⁒(𝒙)βˆͺS)]>b.

Algorithm 2 Find a protection set S for β„³=(E,ℐ) and π’™βˆˆb⋅𝒫ℳ.

Note that AlgorithmΒ 2 always terminates since E is a finite set, and the modified greedy algorithm with this protection set S guarantees (1βˆ’b)-selectability for every element e∈Eβˆ–S. More importantly, the protection is non-trivial, i.e., S is a proper subset of E.

Lemma 11 ([12]).

For any matroid β„³=(E,ℐ) and 𝐱∈b⋅𝒫ℳ, Protect⁒(β„³,𝐱,b)⊊E.

Therefore, it remains to get a good OCRS for β„³|S and 𝒙|S, the restriction of the original matroid and vector to the protection set S.

Chain decomposition

The matroid OCRS inΒ [12] starts with an offline prepossessing that finds the following chain decomposition of the elements:

βˆ…=Nβ„“βŠŠNβ„“βˆ’1βŠŠβ‹―βŠŠN1⊊N0=E

where Ni+1=Protect⁒(β„³|Ni,𝒙|Ni,b) for every 0≀i<β„“. And the OCRS is then operates by invoking AlgorithmΒ 1 on matroid β„³|Ni with a protection set Ni+1 for each e∈Niβˆ–Ni+1.

It is easy to see that these algorithms together produces an independent set of β„³, and the selectability for each element e∈Niβˆ–Ni+1 is

𝐏𝐫[eΒ is accepted∣eΒ is active]β‰₯1βˆ’ππ«[e∈spanβ„³|Ni⁑(R⁒(𝒙|Ni)βˆͺNi+1)]β‰₯1βˆ’b

where the last inequality holds due to the way Ni+1 is obtained using AlgorithmΒ 2. By setting b=12, the resulting OCRS is 12-selectable given any matroid β„³ and π’™βˆˆ12⋅𝒫ℳ.

4.2 Overview of our construction

We now give a high-level overview of our construction and highlight main difficulties. Let us first examine the case when β„³ is a k-uniform matroid and see why the simple greedy algorithm works better for larger k.

Intuition from π’Œ-uniform matroids

When β„³ is a k-uniform matroid, it turns out that the simple greedy algorithm that always accepts the active element whenever possible yields an OCRS with a selectability of 1βˆ’O⁒(log⁑kk). To see this, consider the following tighter analysis of selectability for k-uniform matroids: for every element e∈E,

𝐏𝐫[eΒ is accepted∣eΒ is active]β‰₯𝐏𝐫[eβˆ‰span⁑(R⁒(𝒙))]=𝐏𝐫[|R⁒(𝒙)|<k].

Intuitively, |R⁒(𝒙)| represents the number of slots occupied by active elements, and we know |R⁒(𝒙)|<k indicates eβˆ‰span⁑(R⁒(𝒙)). We want the bad event |R⁒(𝒙)|β‰₯k to occur with a small probability.

Note that |R⁒(𝒙)| is a sum of Bernoulli random variables and it concentrates very well: if we consider a slightly scaled-down π’™βˆˆ(1βˆ’O⁒(log⁑kk))⋅𝒫ℳ, Chernoff bound (TheoremΒ 27) tells us that 𝐏𝐫[|R⁒(𝒙)|β‰₯k]≀1k. By 10, one can further derive an OCRS for k-uniform matroids with a selectability of (1βˆ’O⁒(log⁑kk))⁒(1βˆ’1k)=1βˆ’O⁒(log⁑kk).

To summarize, the greedy algorithm performs well on k-uniform matroids because of the existence of a fine-grained occupancy indicator |R⁒(x)| that concentrates well.

Main idea and challenges

For a k-fold matroid union β„³k=(E,ℐk), the simple greedy algorithm could perform very poor due to inherent non-uniformity of β„³k. In the matroid OCRS by [12], this is resolved using the idea of chain decomposition. For each level, an iterative procedure (AlgorithmΒ 2) is used to find a protection set S that includes all elements that are easily spanned by R⁒(𝒙)βˆͺS. This is done by directly looking at the probability 𝐏𝐫[e∈span⁑(R⁒(𝒙)βˆͺS)].

Our idea is to construct a different chain decomposition based on functions Ο‰e⁒(β‹…):2Eβ†’[0,k] that act as a β€œgeneralized occupancy indicator” for each element e, such that Ο‰e⁒(βˆ…)=0, Ο‰e⁒(S)=k if e is spanned by the set S, and we want Ο‰e⁒(β‹…) to be as smooth as possible (i.e., 1-Lipschitz). For each level of the chain decomposition, we will add e to the protection set S whenever the expected occupancy 𝐄[Ο‰e⁒(R⁒(𝒙)βˆͺS)] is large.

For k-uniform matroids, a simple occupancy indicator would be Ο‰e⁒(S)=min⁑(k,|S|) (since we require its value to be between 0 and k). However, extending the definition of an occupancy function to a general k-fold matroid union introduces several challenges:

  1. 1.

    (Compatibility with chain decomposition) The most crucial part of the chain decomposition in [12] is to show the protection set S is always a proper subset of E (11). Similarly, we will need to show that it is always possible to find a protection set S⊊E such that the expected occupancy 𝐄[Ο‰e⁒(R⁒(𝒙)βˆͺS)] for every e∈Eβˆ–S is smaller than k by a large enough margin.

  2. 2.

    (Chernoff-strength concentration) Based on the fact that 𝐄[Ο‰e⁒(R⁒(𝒙)βˆͺS)] is sufficiently smaller than k, we ultimately want to show that 𝐏𝐫[Ο‰e⁒(R⁒(𝒙)βˆͺS)=k] is very small, which would imply a good selectability for e. This is simple for k-uniform matroids by using Chernoff bound. However, it turns out Ο‰e⁒(β‹…) for general k-fold matroid unions does not admit a standard Chernoff-strength concentration inequality, and much more efforts are required to achieve a similar selectability guarantee.

4.3 An OCRS for π’Œ-fold matroid unions

In SubsubsectionΒ 4.3.1, we define our candidate occupancy functions and show some useful properties. Then, in SubsubsectionΒ 4.3.2, we show these functions are compatible with the chain decomposition approach and can be used to get an OCRS for k-fold matroid unions. Finally, in SubsubsectionΒ 4.3.3, we prove the selectability of this OCRS by showing these functions concentrates well enough using TheoremΒ 3. Some proofs in the section are deferred to AppendixΒ B for ease of reading.

4.3.1 The occupancy function

To define the occupancy function, we will instead work with the following extended k-fold unions which essentially introduces k parallel copies for each element. They are still matroids, and OCRS for them implies OCRS for k-fold matroid unions. Therefore, it suffices for us to give an OCRS for the extended k-fold union.

Definition 12 (Extended k-fold union).

Given a matroid β„³=(E,ℐ) and an integer kβ‰₯1, let β„³βˆ—=(Eβˆ—,β„βˆ—) be the matroid that contains k parallel copies (e,1),…,(e,k) of each element e∈E. Formally,

Eβˆ— =EΓ—[k]={(e,i):e∈E,i∈[k]},
β„βˆ— ={{(e1,i1),…,(et,it)}:{e1,…,et}βˆˆβ„,i1,…,it∈[k]}.

And we define the extended k-fold union β„³βˆ—k=(Eβˆ—,β„βˆ—k) of β„³ to be the k-fold union of β„³βˆ—.

Lemma 13.

The extended k-fold union β„³βˆ—k of a matroid β„³ is a matroid. Furthermore, a c-selectable OCRS for β„³βˆ—k implies a c-selectable OCRS for β„³k.

We are now ready to define the following occupancy function on β„³βˆ—k=(Eβˆ—=EΓ—[k],β„βˆ—k). Intuitively, the function indicates the number of β€œslots” for elements (e,β‹…)∈Eβˆ— that are occupied by the elements in S. We then show the occupancy function has good properties: it is monotone and 1-Lipschitz. More importantly, the value of Ο‰e⁒(S) can be used to deduce whether (e,β‹…)∈Eβˆ— is spanned by other elements in S.

Definition 14 (Occupancy function).

Given an extended k-fold union β„³βˆ—k=(Eβˆ—=EΓ—[k],β„βˆ—k), for every e∈E, define its occupancy function Ο‰e:2Eβˆ—β†’[0,k] as the function where for all SβŠ†Eβˆ—,777When it is clear from context, we will use rank⁑(β‹…)/span⁑(β‹…) to denote the rank/span of a set of elements in β„³βˆ—k for the ease of notation.

Ο‰e⁒(S)=kβˆ’rank⁑(Sβˆͺ({e}Γ—[k]))+rank⁑(S).
Lemma 15.

For any extended k-fold union β„³βˆ—k=(Eβˆ—,β„βˆ—k) and element (e,i)∈Eβˆ—, Ο‰e satisfies

  1. 1.

    (Monotone) Ο‰e⁒(S)≀ωe⁒(T) for every SβŠ†TβŠ†Eβˆ—;

  2. 2.

    (1-Lipschitz) Ο‰e⁒(Sβˆͺ{a})βˆ’Ο‰e⁒(S)≀1 for every SβŠ†Eβˆ— and a∈Eβˆ—.

Lemma 16.

For any extended k-fold union β„³βˆ—k=(Eβˆ—,β„βˆ—k), element (e,i)∈Eβˆ—, and set SβŠ†Eβˆ—, Ο‰e⁒(S)<k implies (e,i)βˆ‰span⁑(Sβˆ–{(e,i)}).

Example 17.

When β„³ is a 1-uniform matroid of size n, its extended k-fold union β„³βˆ—k is a k-uniform matroid of size k⁒n. For every e∈E and SβŠ†Eβˆ—, we have

rank⁑(Sβˆͺ({e}Γ—[k])) =min⁑(k,|Sβˆͺ({e}Γ—[k])|)=k,
rank⁑(S) =min⁑(k,|S|).

Therefore, Ο‰e⁒(S)=min⁑(k,|S|), i.e., the number of occupied slots by S.

Also, note that for any π±βˆ—βˆˆ(1βˆ’O⁒(log⁑kk))β‹…π’«β„³βˆ—k, the value Ο‰e⁒(R⁒(π±βˆ—)) concentrates very well as a capped sum over Bernoulli random variables. Therefore, the bad event Ο‰e⁒(R⁒(π±βˆ—))=k rarely happens and the simple greedy algorithm without protection works.

4.3.2 Chain decomposition based on occupancy functions

Similar to the matroid OCRS by [12], our OCRS for extended k-fold union β„³βˆ—k=(Eβˆ—,β„βˆ—k) and π’™βˆ—βˆˆbβ‹…π’«β„³βˆ—k starts with an offline prepossessing step that finds the following chain decomposition of elements in Eβˆ—,

βˆ…=Nβ„“βŠŠNβ„“βˆ’1βŠŠβ‹―βŠŠN1⊊N0=Eβˆ—

where Nj+1=KFoldProtect⁒(β„³βˆ—k|Nj,π’™βˆ—|Nj,b) for every 0≀j<β„“, as described in AlgorithmΒ 3. Unlike AlgorithmΒ 2, it relies on the occupancy functions which are only defined for extended k-fold unions.

Algorithm 3 Find a protection set S for extended k-fold union β„³βˆ—k=(Eβˆ—,β„βˆ—k) and π’™βˆ—βˆˆbβ‹…π’«β„³βˆ—k.

Before introducing our OCRS, we need to make sure the chain decomposition above is well-defined, i.e., AlgorithmΒ 3 will always returns a proper subset S of elements, and β„³βˆ—k|S remains an extended k-fold union. This is formally stated in 18, which resembles 11 inΒ [12].

Lemma 18.

For any b∈(0,1), any extended k-fold union β„³βˆ—k=(Eβˆ—,β„βˆ—k) and π±βˆ—βˆˆbβ‹…π’«β„³βˆ—k, S⊊Eβˆ— for S=KFoldProtect⁒(β„³βˆ—k,π±βˆ—,b). Moreover, β„³βˆ—k|S remains an extended k-fold union.

Having obtained such a chain decomposition for β„³βˆ—k and π’™βˆ—βˆˆbβ‹…π’«β„³βˆ—k, our OCRS is simply running the modified greedy algorithm, AlgorithmΒ 1, for each submatroid β„³βˆ—k|Nj with a protection set Nj+1 for all 0≀j<β„“ together. Note that although the chain decomposition is constructed with π’™βˆ—, an extra scaling factor of eβˆ’(1βˆ’b) will be applied before invoking AlgorithmΒ 1. This will be useful later when we apply the bicriterion concentration inequality.

Algorithm 4 OCRS for extended k-fold union β„³βˆ—k=(Eβˆ—,β„βˆ—k) and π’™βˆ—βˆˆbβ‹…π’«β„³βˆ—k.

The feasibility of such a scheme follows exactly from [12] as running AlgorithmΒ 1 on any chain decomposition always produces an independent set. We are left to show the OCRS guarantees a good selectability for any β„³βˆ—k and π’™βˆ—βˆˆbβ‹…π’«β„³βˆ—k for some parameter b. In fact, we will set b=1βˆ’log⁑kk and show the selectability is at least 1βˆ’O⁒(log⁑kk), proving TheoremΒ 9.

4.3.3 Analyzing the selectability

Without loss of generality, let us focus on the selectability of elements in the first layer Eβˆ—βˆ–N1, since a same proof would work for all submatroid β„³βˆ—k|Nj as they remains to be extended k-fold unions.

By 16, for every element (e,i)∈Eβˆ—βˆ–N1, its selectability can be lower bounded as

𝐏𝐫[(e,i)Β is accepted∣(e,i)Β is active] β‰₯𝐏𝐫[(e,i)βˆ‰span⁑((R⁒(eβˆ’(1βˆ’b)β’π’™βˆ—)βˆ–{(e,i)})βˆͺN1)]
β‰₯𝐏𝐫[Ο‰e⁒(R⁒(eβˆ’(1βˆ’b)β’π’™βˆ—)βˆͺN1)<k].

On the other hand, by the way chain decomposition is obtained using AlgorithmΒ 3, we know even without the extra scaling of eβˆ’(1βˆ’b), the expected value of Ο‰e⁒(R⁒(π’™βˆ—)βˆͺN1) is not too close to k:

𝐄[Ο‰e⁒(R⁒(π’™βˆ—)βˆͺN1)]≀b⁒k.

For the ease of notation, denote X=R⁒(π’™βˆ—) and Xβ€²=R⁒(eβˆ’(1βˆ’b)β’π’™βˆ—). Fixing an element (e,i)∈Eβˆ—, define the function f:2Eβˆ—β†’[0,k] where for every SβŠ†Eβˆ—,

f⁒(S)=Ο‰e⁒(SβˆͺN1).

Then, to lower bound selectability for (e,i), it is equivalent to upper bound 𝐏𝐫[f⁒(Xβ€²)=k] given that 𝐄[f⁒(X)]≀b⁒k. Specifically, to get a selectability of 1βˆ’O⁒(log⁑kk), we will set b=1βˆ’log⁑kk, and it suffices to show the following bicriterion concentration inequality:

𝐄[f⁒(X)]≀kβˆ’k⁒log⁑k⟹𝐏𝐫[f⁒(Xβ€²)β‰₯𝐄[f⁒(X)]+k⁒log⁑k]≀O⁒(1k). (βˆ—)

By 15, we know f is always monotone and 1-Lipschitz. Then, using TheoremΒ 3 (and recall that Xβ€²=R⁒(eβˆ’log⁑k/kβ’π’™βˆ—)), we have

𝐏𝐫[f⁒(Xβ€²)β‰₯𝐄[f⁒(X)]+k⁒log⁑k]≀exp⁑(βˆ’log⁑kkβ‹…k⁒log⁑k)=1k.

Therefore, for extended k-fold union β„³βˆ—k and π’™βˆ—βˆˆbβ‹…π’«β„³βˆ—k, running AlgorithmΒ 4 yields

𝐏𝐫[(e,i)Β is accepted∣(e,i)Β is active] β‰₯1βˆ’ππ«[f⁒(Xβ€²)β‰₯k]
β‰₯1βˆ’ππ«[f⁒(Xβ€²)β‰₯𝐄[f⁒(X)]+k⁒log⁑k]β‰₯1βˆ’1k.

Together with 10 and 13, we prove TheoremΒ 9 by showing the existence of an OCRS for all k-fold union β„³k and π’™βˆ—βˆˆπ’«β„³k with a selectability of

(1βˆ’1k)β‹…bβ‹…eβˆ’(1βˆ’b)=(1βˆ’1k)β‹…(1βˆ’log⁑kk)β‹…eβˆ’log⁑kk=1βˆ’O⁒(log⁑kk).
β–ΆΒ Remark 19.

It might seems bizarre and unnecessary to consider f⁒(Xβ€²) instead of f⁒(X). Indeed, since f is monotone non-decreasing, the following claim that only contains f⁒(X) would imply (βˆ— β€£ 4.3.3), and it looks more like a standard concentration inequality:

𝐄[f⁒(X)]≀kβˆ’k⁒log⁑k⟹𝐏𝐫[f⁒(X)β‰₯𝐄[f⁒(X)]+k⁒log⁑k]≀O⁒(1k). (βˆ—β£βˆ—)

We know (βˆ—β£βˆ— β€£ 19) is true when f is a sum over Bernoulli random variables by Chernoff bound, and it is tempting to use more powerful concentration inequalities to prove (βˆ—β£βˆ— β€£ 19) for general 1-Lipschitz f. Unfortunately, such a bound does not exist for general monotone and 1-Lipschitz set functions (see SectionΒ 5 for details), and it turns out to be impossible even for the specific f we use here, as 29 shown.

5 A Bicriterion Concentration Inequality

In this section, we assume the ground set E=[n] and consider a function f:{0,1}n→ℝ that satisfies the following properties:888Note that f can be equivalently viewed as a function over subsets of a ground set of size n, as we did in SectionΒ 4.

  1. 1.

    (Monotone) f⁒(𝒙)≀f⁒(π’š) for all 𝒙,π’šβˆˆ{0,1}n where π’™β‰€π’š (element-wise).

  2. 2.

    (𝟏-Lipschitz) |f⁒(𝒙)βˆ’f⁒(π’š)|≀βˆ₯π’™βˆ’π’šβˆ₯1 for all 𝒙,π’šβˆˆ{0,1}n.

Also, let 𝑿=(X1,X2,…,Xn) be a vector of n independent Bernoulli random variables where Xi∼Ber⁑(pi) for each i∈[n] and π’‘βˆˆ[0,1]n. For simplicity, we denote this as π‘ΏβˆΌBer⁑(𝒑).

We are interested in how well f⁒(𝑿) concentrates on its upper tail. By McDiarmid’s inequality (TheoremΒ 28), for every t>0,

𝐏𝐫[f⁒(𝑿)β‰₯𝐄[f⁒(𝑿)]+t]≀eβˆ’2⁒t2n,

Unfortunately, the bound depends on the dimension n, whereas our application in SectionΒ 4 requires a dimension-free bound that is independent from n. In fact, it is known that dimension-free concentration inequality does not exist for f in general (see, e.g., [27]).

The good news is that, for our application, it suffices to consider another π‘Ώβ€²βˆΌBer⁑(𝒑′) with slightly smaller parameters 𝒑′<𝒑 and show f⁒(𝑿′) does not exceed 𝐄[f⁒(𝑿)] by much, with high probability. Formally, we define 𝑿(s) with a scaling factor s as follows:

Definition 20 (Scaling).

Given n independent Bernoulli random variables π—βˆΌBer⁑(𝐩), for any scaling factor sβ‰₯0, define 𝐗(s)∼Ber⁑(eβˆ’s⁒𝐩). In other words, Xi(s)∼Ber⁑(eβˆ’s⁒pi) for all i∈[n].

And we prove TheoremΒ 3, a bicriterion concentration inequality, where the bound depends on both the scaling factor s and the deviation size t.

See 3

For our application in SectionΒ 4, we basically set s=log⁑kk,t=k⁒log⁑k for some kβ‰ˆπ„[f⁒(𝑿)] and the inequality gives us 𝐏𝐫[f⁒(𝑿(s))β‰₯k+k⁒log⁑k]≀1k. Note that this bound is sharp up to a constant factor in the exponent: even in the case where f⁒(𝒙)=βˆ‘i=1nxi, the Chernoff bound of f⁒(𝑿(s)) only yields 𝐏𝐫[f⁒(𝑿(s))β‰₯k+k⁒log⁑k]≀O⁒(1kc) for some constant c.

5.1 Technical overview

Before getting into the proof, let us first outline our approach and highlight the main difficulty. Our proof utilizes the entropy method for self-bounding functions [5, 6, 22, 7]. Roughly speaking, to prove a exponential concentration inequality for some Z=f⁒(𝑿), the plan is to establish a differential inequality for the moment-generating function 𝐄[eλ⁒Z] based on the following modified logarithmic Sobolev inequality. If this differential inequality implies strong bounds for 𝐄[eλ⁒Z], a concentration inequality can be subsequently obtained.

Lemma 21 (A modified logarithmic Sobolev inequality [20]).

Given n independent Bernoulli random variables 𝐗 and a function f:{0,1}n→ℝ. Let Z=f⁒(𝐗) and Zi=fi⁒(X1,…,Xiβˆ’1,Xi+1,…,Xn) for an arbitrary function fi:{0,1}nβˆ’1→ℝ. For any Ξ»βˆˆβ„,

λ⁒𝐄[Z⁒eλ⁒Z]βˆ’π„[eλ⁒Z]⁑log⁒𝐄[eλ⁒Z]β‰€βˆ‘i=1n𝐄[eλ⁒Z⁒ϕ⁒(βˆ’Ξ»β’(Zβˆ’Zi))]

where ϕ⁒(x)=exβˆ’xβˆ’1.

Whether 21 can be effectively converted into a useful differential inequality for 𝐄[eλ⁒Z] depends on the choice of {Zi}i∈[n]. For a monotone function f, a typical choice is Zi=f⁒(X1,…,Xiβˆ’1,0,Xi+1,…,Xn), and previous works have demonstrated that such a conversion is possible if f is 1-Lipschitz and the following condition holds almost surely for some constants a,bβ‰₯0:Β 999In this case, f is a so-called (a,b)-self-bounding function [22, 7].

βˆ‘i=1nZβˆ’Zi≀a⁒Z+b. (†)

Now, given Z(s)=f⁒(𝑿(s)) under a scaling factor s>0, one might attempt to similarly derive a differential inequality of 𝐄[eλ⁒Z(s)] based on 21 if the condition († β€£ 5.1) can be satisfied. In fact, if we define Zi(s)=f⁒(X1(s),…,Xiβˆ’1(s),0,Xi+1(s),…,Xn(s)), the following holds:

𝐄[βˆ‘i=1nZ(s)βˆ’Zi(s)]=βˆ’dd⁒s⁒𝐄[Z(s)].

Thus, if βˆ’dd⁒s⁒𝐄[Z(s)]≀a⁒Z(s)+b, then († β€£ 5.1) holds in expectation for Z(s); otherwise, E⁒[Z(s)] is decreasing rapidly with respect to s at that point.

As a result, either there exists some sβˆ—βˆˆ(0,s) such that († β€£ 5.1) holds in expectation for Z(sβˆ—), or 𝐄[Z(s)] becomes significantly smaller than 𝐄[Z(0)]. Intuitively, the latter case should directly imply a bicriterion concentration result, leaving only the former case to be addressed.101010If we do not aim for an exponential tail bound, these observations indeed suffice to get a Chebyshev-type bicriterion concentration inequality for Z(s), by using Efron-Stein inequality to bound its variance in the former case. However, it turns out that such a use of 21 crucially depends on († β€£ 5.1) holding almost surely, which is not applicable to such Z(sβˆ—) in the former case.111111Specifically, applying the entropy method for 𝐄[eλ⁒Z] requires βˆ‘i=1n𝐄[eλ⁒Z⁒(Zβˆ’Zi)]≀𝐄[eλ⁒Z⁒(a⁒Z+b)] for every Ξ», which might be false even if († β€£ 5.1) holds with very high probability.

Given this limitation, rather than working with moment-generating functions directly, we propose an alternative approach. Our key idea is to relate 21 with the following unconventional function, defined for every Ξ»β‰₯0:

F⁒(Ξ»)=𝐄[eλ⁒Z(Ξ»)].

Note that this is not a moment-generating function, as λ here also serves as the scaling factor of Z, causing the random variable Z(λ) to change with it. Surprisingly, we can obtain the following upper bound for the derivative of F⁒(λ) that aligns well with 21.

Lemma 22.

Given n independent Bernoulli random variables 𝐗 and a monotone 1-Lipschitz function f:{0,1}n→ℝ. For any λ∈(0,1],

F′⁒(Ξ»)≀𝐄[Z(Ξ»)⁒eλ⁒Z(Ξ»)]βˆ’1Ξ»β’βˆ‘i=1n𝐄[eλ⁒Z(Ξ»)⁒ϕ⁒(βˆ’Ξ»β’(Z(Ξ»)βˆ’Zi(Ξ»)))]

where Z(Ξ»)=f⁒(𝐗(Ξ»)), Zi(Ξ»)=f⁒(X1(Ξ»),…,Xiβˆ’1(Ξ»),0,Xi+1(Ξ»),…,Xn(Ξ»)), and F⁒(Ξ»)=𝐄[eλ⁒Z(Ξ»)].

By combining 22 with 21, we can conclude that for all λ∈(0,1],

λ⁒F′⁒(Ξ»)≀F⁒(Ξ»)⁒log⁑F⁒(Ξ»).

Solving this differential inequality provides an upper bound for F⁒(Ξ»). TheoremΒ 3 then follows by applying Markov’s inequality to the random variable es⁒Z(s).

5.2 Proof of TheoremΒ 3

Let Z(Ξ»)=f⁒(𝑿(Ξ»)) and Zi(Ξ»)=f⁒(X1(Ξ»),…,Xiβˆ’1(Ξ»),0,Xi+1(Ξ»),…,Xn(Ξ»)) throughout the proof. Given 21 and 22, it is not hard to show the bicriterion concentration inequality.

Proof of TheoremΒ 3.

For any λ>0, we apply 21 to Z(λ) and {Zi(λ)}i∈[n] and obtain

λ⁒𝐄[Z(Ξ»)⁒eλ⁒Z(Ξ»)]βˆ’π„[eλ⁒Z(Ξ»)]⁑log⁒𝐄[eλ⁒Z(Ξ»)]β‰€βˆ‘i=1n𝐄[eλ⁒Z(Ξ»)⁒ϕ⁒(βˆ’Ξ»β’(ZΞ»βˆ’Zi(Ξ»)))]

where ϕ⁒(x)=exβˆ’xβˆ’1. Rearranging the inequality, we have

λ⁒(𝐄[Z(Ξ»)⁒eλ⁒Z(Ξ»)]βˆ’1Ξ»β’βˆ‘i=1n𝐄[eλ⁒Z(Ξ»)⁒ϕ⁒(βˆ’Ξ»β’(ZΞ»βˆ’Zi(Ξ»)))])≀𝐄[eλ⁒Z(Ξ»)]⁑log⁒𝐄[eλ⁒Z(Ξ»)].

Together with 22, this gives us the following differential inequality for F⁒(Ξ»)=𝐄[eλ⁒Z(Ξ»)]:

λ⁒F′⁒(Ξ»)≀F⁒(Ξ»)⁒log⁑F⁒(Ξ»),βˆ€Ξ»βˆˆ(0,1].

And by letting G⁒(λ)=log⁑F⁒(λ), we can rewrite the inequality as

λ⁒G′⁒(Ξ»)≀G⁒(Ξ»),βˆ€Ξ»βˆˆ(0,1].

Note that G0⁒(Ξ»)=λ⁒𝐄[Z(0)] is a solution to λ⁒G′⁒(Ξ»)=G⁒(Ξ») for λ∈(0,1]. Define g⁒(Ξ»)=G⁒(Ξ»)βˆ’G0⁒(Ξ»)Ξ» and we have

g′⁒(Ξ»)=G′⁒(Ξ»)βˆ’G0′⁒(Ξ»)Ξ»βˆ’G⁒(Ξ»)βˆ’G0⁒(Ξ»)Ξ»2=(λ⁒G′⁒(Ξ»)βˆ’G⁒(Ξ»))βˆ’(λ⁒G0′⁒(Ξ»)βˆ’G0⁒(Ξ»))Ξ»2≀0.

Also note that limΞ»β†’0+G⁒(Ξ»)Ξ»=G′⁒(0)=F′⁒(0)F⁒(0)=𝐄[Z(0)] (where the last equality holds by 23), therefore limΞ»β†’0+g⁒(Ξ»)=0. Combining this with g′≀0, we conclude that g is non-positive on (0,1]. In other words, for λ∈(0,1],

G⁒(Ξ»)≀G0⁒(Ξ»)=λ⁒𝐄[Z(0)].

Finally, by Markov’s inequality, we conclude that for any λ∈(0,1] and t>0,

𝐏𝐫[Z(Ξ»)β‰₯𝐄[Z(0)]+t]=𝐏𝐫[eλ⁒Z(Ξ»)β‰₯eλ⁒(𝐄[Z(0)]+t)]≀𝐄[eλ⁒Z(Ξ»)]eλ⁒(𝐄[Z(0)]+t)≀eβˆ’Ξ»β’t.

β—€

We are left to prove 22. Let us first compute F′⁒(Ξ») by definition.

Lemma 23.

For any Ξ»β‰₯0, F′⁒(Ξ»)=𝐄[Z(Ξ»)⁒eλ⁒Z(Ξ»)]βˆ’βˆ‘i=1n𝐄[eλ⁒Z(Ξ»)βˆ’eλ⁒Zi(Ξ»)].

Proof.

Define a function h:ℝ×[0,1]n→ℝ as

h⁒(t,𝒒)=π„π’€βˆΌBer⁑(𝒒)[et⁒f⁒(𝒀)].

For each i∈[n] and b∈{0,1}, denote fi,b⁒(𝒀)=f⁒(Y1,…,Yiβˆ’1,b,Yi+1,…,Yn) and we can compute the partial derivative of h with respect to qi as

βˆ‚βˆ‚qi⁒h⁒(t,𝒒) =βˆ‚βˆ‚qi⁒(qiβ’π„π’€βˆΌBer⁑(𝒒)[et⁒fi,1⁒(𝒀)]+(1βˆ’qi)β’π„π’€βˆΌBer⁑(𝒒)[et⁒fi,0⁒(𝒀)])
=π„π’€βˆΌBer⁑(𝒒)[et⁒fi,1⁒(𝒀)βˆ’et⁒fi,0⁒(𝒀)].

Recall that π‘ΏβˆΌBer⁑(𝒑) and F⁒(Ξ»)=𝐄[eλ⁒Z(Ξ»)]=h⁒(Ξ»,eβˆ’Ξ»β’π’‘). Therefore,

F′⁒(Ξ») =d⁒tdβ’Ξ»β‹…βˆ‚t⁒h⁒(Ξ»,eβˆ’Ξ»β’π’‘)+βˆ‘i=1nd⁒qidβ’Ξ»β‹…βˆ‚βˆ‚qi⁒h⁒(Ξ»,eβˆ’Ξ»β’π’‘)
=1⋅𝐄[f⁒(𝑿(Ξ»))⁒eλ⁒f⁒(𝑿(Ξ»))]+βˆ‘i=1n(βˆ’eβˆ’Ξ»β’pi)β‹…π„π’€βˆΌBer⁑(eβˆ’Ξ»β’π’‘)[eλ⁒fi,1⁒(𝒀)βˆ’eλ⁒fi,0⁒(𝒀)]
=𝐄[f⁒(𝑿(Ξ»))⁒eλ⁒f⁒(𝑿(Ξ»))]βˆ’βˆ‘i=1nπ„π’€βˆΌBer⁑(eβˆ’Ξ»β’π’‘)[eλ⁒f⁒(𝒀)βˆ’eλ⁒fi,0⁒(𝒀)]
=𝐄[Z(Ξ»)⁒eλ⁒Z(Ξ»)]βˆ’βˆ‘i=1n𝐄[eλ⁒Z(Ξ»)βˆ’eλ⁒Zi(Ξ»)].

β—€

Then we further derive a lower bound to the latter term, βˆ‘i=1n𝐄[eλ⁒Z(Ξ»)βˆ’eλ⁒Zi(Ξ»)].

Lemma 24.

For any Ξ»β‰₯0, βˆ‘i=1n𝐄[eλ⁒Z(Ξ»)βˆ’eλ⁒Zi(Ξ»)]β‰₯λ⁒eβˆ’Ξ»β’βˆ‘i=1n𝐄[eλ⁒Z(Ξ»)⁒(Z(Ξ»)βˆ’Zi(Ξ»))].

Proof.

We prove the inequality for each term separately and without expectation. For any i∈[n], note that

eλ⁒Z(Ξ»)βˆ’eλ⁒Zi(Ξ»)=eλ⁒Zi(Ξ»)⁒(eλ⁒(Z(Ξ»)βˆ’Zi(Ξ»))βˆ’1)β‰₯eλ⁒Zi(Ξ»)⋅λ⁒(Z(Ξ»)βˆ’Zi(Ξ»))

since exβˆ’1β‰₯x. Meanwhile, we know Zi(Ξ»)β‰₯Z(Ξ»)βˆ’1 by 1-Lipschitzness of f. Therefore,

eλ⁒Z(Ξ»)βˆ’eλ⁒Zi(Ξ»)β‰₯λ⁒eβˆ’Ξ»β‹…eλ⁒Z(Ξ»)⁒(Z(Ξ»)βˆ’Zi(Ξ»)).

β—€

The following two facts of the function ϕ⁒(x)=exβˆ’xβˆ’1 will also be used.

Fact 25.

For any λ∈(0,1], ϕ⁒(βˆ’Ξ»)λ≀λ⁒eβˆ’Ξ».

Fact 26.

For any Ξ»βˆˆβ„ and x∈[0,1], ϕ⁒(βˆ’Ξ»β’x)≀ϕ⁒(βˆ’Ξ»)⁒x.

Now we are ready to prove the lemma.

Proof of 22.

We upper bound F′⁒(Ξ») step-by-step as follows:

F′⁒(Ξ») =𝐄[Z(Ξ»)⁒eλ⁒Z(Ξ»)]βˆ’βˆ‘i=1n𝐄[eλ⁒Z(Ξ»)βˆ’eλ⁒Zi(Ξ»)] (23)
≀𝐄[Z(Ξ»)⁒eλ⁒Z(Ξ»)]βˆ’Ξ»β’eβˆ’Ξ»β’βˆ‘i=1n𝐄[eλ⁒Z(Ξ»)⁒(Z(Ξ»)βˆ’Zi(Ξ»))] (24)
≀𝐄[Z(Ξ»)⁒eλ⁒Z(Ξ»)]βˆ’Ο•β’(βˆ’Ξ»)Ξ»β’βˆ‘i=1n𝐄[eλ⁒Z(Ξ»)⁒(Z(Ξ»)βˆ’Zi(Ξ»))] (25)
≀𝐄[Z(Ξ»)⁒eλ⁒Z(Ξ»)]βˆ’1Ξ»β’βˆ‘i=1n𝐄[eλ⁒Z(Ξ»)⁒ϕ⁒(βˆ’Ξ»β’(Z(Ξ»)βˆ’Zi(Ξ»)))] (26)

where in the last step we also use the fact that Z(Ξ»)βˆ’Zi(Ξ»)∈[0,1], as f is monotone and 1-Lipschitz. β—€

References

  • [1] Saeed Alaei. Bayesian combinatorial auctions: Expanding single buyer mechanisms to many buyers. SIAM Journal on Computing, 43(2):930–972, 2014. doi:10.1137/120878422.
  • [2] Saeed Alaei, HuΒ Fu, Nima Haghpanah, Jason Hartline, and Azarakhsh Malekian. Bayesian optimal auctions via multi- to single-agent reduction. In Proceedings of the 13th ACM Conference on Electronic Commerce, EC ’12, pageΒ 17, New York, NY, USA, 2012. Association for Computing Machinery. doi:10.1145/2229012.2229017.
  • [3] C.Β J. Argue, Anupam Gupta, Marco Molinaro, and Sahil Singla. Robust secretary and prophet algorithms for packing integer programs. In JosephΒ (Seffi) Naor and Niv Buchbinder, editors, Proceedings of the 2022 ACM-SIAM Symposium on Discrete Algorithms, SODA 2022, Virtual Conference / Alexandria, VA, USA, January 9 - 12, 2022, pages 1273–1297. SIAM, SIAM, 2022. doi:10.1137/1.9781611977073.53.
  • [4] PabloΒ Daniel Azar, Robert Kleinberg, and S.Β Matthew Weinberg. Prophet inequalities with limited information. In Proceedings of the Twenty-Fifth Annual ACM-SIAM Symposium on Discrete Algorithms, SODA 2014, Portland, Oregon, USA, January 5-7, 2014, pages 1358–1377, 2014. doi:10.1137/1.9781611973402.100.
  • [5] StΓ©phane Boucheron, GΓ‘bor Lugosi, and Pascal Massart. A sharp concentration inequality with applications. Random Structures & Algorithms, 16(3):277–292, 2000. doi:10.1002/(SICI)1098-2418(200005)16:3\%3C277::AID-RSA4\%3E3.0.CO;2-1.
  • [6] StΓ©phane Boucheron, GΓ‘bor Lugosi, and Pascal Massart. Concentration inequalities using the entropy method. The Annals of Probability, 31(3):1583–1614, 2003.
  • [7] Stephane Boucheron, Gabor Lugosi, and Pascal Massart. On concentration of self-bounding functions. Electronic Journal of Probability, 14:1884–1899, 2009.
  • [8] Shuchi Chawla, JasonΒ D. Hartline, DavidΒ L. Malec, and Balasubramanian Sivan. Multi-parameter mechanism design and sequential posted pricing. In LeonardΒ J. Schulman, editor, Proceedings of the 42nd ACM Symposium on Theory of Computing, STOC 2010, Cambridge, Massachusetts, USA, 5-8 June 2010, pages 311–320. ACM, 2010. doi:10.1145/1806689.1806733.
  • [9] Chandra Chekuri, Junkai Song, and Weizhong Zhang. Contention resolution for the l-fold union of a matroid via the correlation gap. In 2024 Symposium on Simplicity in Algorithms (SOSA), pages 396–405. SIAM, 2024. doi:10.1137/1.9781611977936.36.
  • [10] Atanas Dinev and S.Β Matthew Weinberg. Simple and Optimal Online Contention Resolution Schemes for k-Uniform Matroids. In Venkatesan Guruswami, editor, 15th Innovations in Theoretical Computer Science Conference (ITCS 2024), volume 287 of Leibniz International Proceedings in Informatics (LIPIcs), pages 39:1–39:23, Dagstuhl, Germany, 2024. Schloss Dagstuhl – Leibniz-Zentrum fΓΌr Informatik. doi:10.4230/LIPIcs.ITCS.2024.39.
  • [11] Paul DΓΌtting, Michal Feldman, Thomas Kesselheim, and Brendan Lucier. Prophet inequalities made easy: Stochastic optimization by pricing nonstochastic inputs. SIAM J. Comput., 49(3):540–582, 2020. doi:10.1137/20M1323850.
  • [12] Moran Feldman, Ola Svensson, and Rico Zenklusen. Online contention resolution schemes. In Robert Krauthgamer, editor, Proceedings of the Twenty-Seventh Annual ACM-SIAM Symposium on Discrete Algorithms, SODA 2016, Arlington, VA, USA, January 10-12, 2016, pages 1014–1033. SIAM, 2016. doi:10.1137/1.9781611974331.ch72.
  • [13] MohammadΒ Taghi Hajiaghayi, Robert Kleinberg, and Tuomas Sandholm. Automated online mechanism design and prophet inequalities. In AAAI, volumeΒ 7, pages 58–65, 2007. URL: http://www.aaai.org/Library/AAAI/2007/aaai07-009.php.
  • [14] Jiashuo Jiang, Will Ma, and Jiawei Zhang. Tight guarantees for multi-unit prophet inequalities and online stochastic knapsack. In Proceedings of the 2022 Annual ACM-SIAM Symposium on Discrete Algorithms (SODA), pages 1221–1246. SIAM, 2022. doi:10.1137/1.9781611977073.51.
  • [15] Thomas Kesselheim, Andreas TΓΆnnis, Klaus Radke, and Berthold VΓΆcking. Primal beats dual on online packing lps in the random-order model. In Proceedings of the forty-sixth annual ACM symposium on Theory of computing, pages 303–312, 2014. doi:10.1145/2591796.2591810.
  • [16] Robert Kleinberg and S.Β Matthew Weinberg. Matroid prophet inequalities. In Proceedings of the 44th Symposium on Theory of Computing Conference, STOC 2012, New York, NY, USA, May 19 - 22, 2012, pages 123–136, 2012. doi:10.1145/2213977.2213991.
  • [17] Ulrich Krengel and Louis Sucheston. On semiamarts, amarts, and processes with finite value. Advances in Probability and Related Topics, 4:197–266, 1978.
  • [18] Felix Lazebnik, VasiliyΒ A Ustimenko, and AndrewΒ J Woldar. A new series of dense graphs of high girth. Bulletin of the American mathematical society, 32(1):73–79, 1995.
  • [19] Euiwoong Lee and Sahil Singla. Optimal Online Contention Resolution Schemes via Ex-Ante Prophet Inequalities. In 26th Annual European Symposium on Algorithms (ESA 2018), volume 112, pages 57:1–57:14. Schloss Dagstuhl – Leibniz-Zentrum fΓΌr Informatik, 2018. doi:10.4230/LIPIcs.ESA.2018.57.
  • [20] Pascal Massart. About the constants in talagrand’s concentration inequalities for empirical processes. The Annals of Probability, 28(2):863–884, 2000.
  • [21] Colin McDiarmid etΒ al. On the method of bounded differences. Surveys in combinatorics, 141(1):148–188, 1989.
  • [22] Colin McDiarmid and Bruce Reed. Concentration for self-bounding functions and an inequality of talagrand. Random Structures & Algorithms, 29(4):549–557, 2006. doi:10.1002/rsa.20145.
  • [23] Aviad Rubinstein and Sahil Singla. Combinatorial prophet inequalities. In Proceedings of the Twenty-Eighth Annual ACM-SIAM Symposium on Discrete Algorithms, SODA 2017, Barcelona, Spain, Hotel Porta Fira, January 16-19, pages 1671–1687, 2017. doi:10.1137/1.9781611974782.110.
  • [24] Aviad Rubinstein and S.Β Matthew Weinberg. Simple mechanisms for a subadditive buyer and applications to revenue monotonicity. In Proceedings of the Sixteenth ACM Conference on Economics and Computation, EC ’15, Portland, OR, USA, June 15-19, 2015, pages 377–394, 2015. doi:10.1145/2764468.2764510.
  • [25] Ester Samuel-Cahn. Comparison of threshold stop rules and maximum for independent nonnegative random variables. Annals of Probability, 12(4):1213–1216, 1984.
  • [26] Gideon Schechtman. Concentration, results and applications. In Handbook of the geometry of Banach spaces, volumeΒ 2, pages 1603–1634. Elsevier, 2003.
  • [27] Jan VondrΓ‘k. A note on concentration of submodular functions. CoRR, abs/1005.2791, 2010. doi:10.48550/arXiv.1005.2791.

Appendix A Useful Concentration Inequalities

Theorem 27 (Multiplicative Chernoff bound).

Given n independent Bernoulli random variables X1,X2,…,Xn, let X=βˆ‘i=1nXi denote their sum. For any Ξ΄>0, we have

𝐏𝐫[Xβ‰₯(1+Ξ΄)⁒𝐄[X]]≀exp⁑(βˆ’Ξ΄2⁒𝐄[X]2+Ξ΄).
Theorem 28 (McDiarmid’s inequality).

Given n independent random variables X1,X2,…,Xnβˆˆπ’³ and a function f:𝒳n→ℝ. If for every i∈[n] and x1,x2,…,xn,xiβ€²βˆˆπ’³, the function f satisfies

|f⁒(x1,…,xiβˆ’1,xi,xi+1,…,xn)βˆ’f⁒(x1,…,xiβˆ’1,xiβ€²,xi+1,…,xn)|≀ci,

then for any t>0, we have

𝐏𝐫[f⁒(X)β‰₯𝐄[f⁒(X)]+t]≀exp⁑(βˆ’2⁒t2βˆ‘i=1nci2).

Appendix B Missing Proofs and Examples

Proof of 13.

It is straightforward to check β„³βˆ— in 12 is a matroid, and hence its k-fold union β„³βˆ—k remains a matroid by the closure property of matroid union. Also, note that the restriction of β„³βˆ—k to EΓ—{1}, β„³βˆ—k∣EΓ—{1}, is isomorphic to the k-fold union β„³k of β„³, as there exists a simple bijection (e,1)↦e between EΓ—{1},β„βˆ—|EΓ—{1} and E,ℐ. Therefore, an Ξ±-selectable OCRS for β„³βˆ—k can also be used as an Ξ±-selectable OCRS forΒ β„³k. β—€

Proof of 15.

Note that the rank function for any matroid is a submodular function. Therefore, rank⁑(Sβˆͺ({e}Γ—[k]))βˆ’rank⁑(S)β‰₯rank⁑(Tβˆͺ({e}Γ—[k]))βˆ’rank⁑(T) for every SβŠ†T by a simple induction, and thus Ο‰e⁒(β‹…) is monotone.

Also, we know the rank function is monotone, and the rank of a set can increase by at most 1 after adding an element. Therefore, rank⁑(Sβˆͺ{a})β‰₯rank⁑(S) and rank⁑(Sβˆͺ{a}βˆͺ({e}Γ—[k]))≀rank⁑(Sβˆͺ({e}Γ—[k]))+1 for every a∈Eβˆ— and SβŠ†Eβˆ—. As a result, Ο‰e⁒(β‹…) is 1-Lipschitz. β—€

Proof of 16.

When Ο‰e⁒(S)<k, we have rank⁑(Sβˆͺ({e}Γ—[k]))>rank⁑(S) and there exists (at least) one element (e,j)∈{e}Γ—[k] such that (e,j)βˆ‰span⁑(S). By definition of extended k-fold union, it further implies (e,i)βˆ‰span⁑(Sβˆ–{(e,i)}). β—€

Proof of 18.

It is straightforward to see S=S0Γ—[k] when the algorithm terminates, and thus β„³βˆ—k|S0Γ—[k] is the extended k-fold union of β„³|S0 by definition. It remains to prove S⊊Eβˆ—. Since S must be a subset of the universe Eβˆ—, it suffices to show Sβ‰ Eβˆ—. Our plan is to show that S is not full rank in β„³βˆ—k, even after combined with all active elements R⁒(π’™βˆ—) and take the expectation, i.e.,

𝐄[rankβ„³βˆ—k⁑(R⁒(π’™βˆ—)βˆͺS)]<rankβ„³βˆ—k⁑(Eβˆ—).

This would directly imply Sβ‰ Eβˆ— by the monotonicity of the rank function.

Denote r=rankℳ⁑(S0). Let e1,e2,…,er∈S0 be the elements from β„³ that increase the rank of S0 in β„³ during the execution of AlgorithmΒ 3, and denote ei (for 1≀i≀r) as the specific element that increases rankℳ⁑(S0) from iβˆ’1 to i. By definition, spanℳ⁑({e1,e2,…,er})=S0. In fact, we also have

spanβ„³βˆ—k⁑({e1,e2⁒…,er}Γ—[k])=S.

This is because {e1,e2,…,er}Γ—[k]βŠ†S is an independent set of size k⁒r in β„³βˆ—k by definition of the extended k-fold union, and we can further show it is a basis of S. Suppose it is not, then there must be another independent set TβŠ†S of size larger than k⁒r. Since one can partition T into k disjoint independent sets T1,T2,…,Tk in β„³βˆ— where βˆ‘j∈[k]|Tj|>k⁒r, we know there exists some Tj of size larger than r, which leads to a contradiction as TjβŠ†S0Γ—[k] and rankβ„³βˆ—β‘(S0Γ—[k])=r.

Also, by the way the algorithm picks elements to be added to S0, for every 1≀i≀r we have

𝐄[Ο‰ei⁒(R⁒(π’™βˆ—)βˆͺ({e1,e2,…,eiβˆ’1}Γ—[k]))]>b⁒k.

Equivalently, we have

𝐄[rankβ„³βˆ—k⁑(R⁒(π’™βˆ—)βˆͺ({e1,e2,…,ei}Γ—[k]))βˆ’rankβ„³βˆ—k⁑(R⁒(π’™βˆ—)βˆͺ({e1,e2,…,eiβˆ’1}Γ—[k]))]<(1βˆ’b)⁒k.

Together with these observations, we can upper bound 𝐄[rankβ„³βˆ—k⁑(R⁒(π’™βˆ—)βˆͺS)] by a telescoping sum as follows:

𝐄[rankβ„³βˆ—k⁑(R⁒(π’™βˆ—)βˆͺS)] =𝐄[rankβ„³βˆ—k⁑(R⁒(π’™βˆ—)βˆͺ{e1,e2,…,er}Γ—[k])]
=𝐄[rankβ„³βˆ—k(R(π’™βˆ—))]+βˆ‘i=1r𝐄[rankβ„³βˆ—k(R(π’™βˆ—)βˆͺ({e1,e2,…,ei}Γ—[k]))
βˆ’rankβ„³βˆ—k(R(π’™βˆ—)βˆͺ({e1,e2,…,eiβˆ’1})Γ—[k])]
<𝐄[rankβ„³βˆ—k⁑(R⁒(π’™βˆ—))]+(1βˆ’b)⁒k⁒r.

The former term 𝐄[rankβ„³βˆ—k⁑(R⁒(π’™βˆ—))] can be trivially upper bounded by 𝐄[|R⁒(π’™βˆ—)|] and further by b⁒rankβ„³βˆ—k⁑(Eβˆ—) due to π’™βˆ—βˆˆbβ‹…π’«β„³βˆ—k. For the latter term involving k⁒r, we already know k⁒r=rankβ„³βˆ—k⁑(S)≀rankβ„³βˆ—k⁑(Eβˆ—). In conclusion, we have

𝐄[rankβ„³βˆ—k⁑(R⁒(π’™βˆ—)βˆͺS)]<b⁒rankβ„³βˆ—k⁑(Eβˆ—)+(1βˆ’b)⁒rankβ„³βˆ—k⁑(Eβˆ—)=rankβ„³βˆ—k⁑(Eβˆ—).

β—€

Example 29 (A counterexample to (βˆ—β£βˆ— β€£ 19)).

Fix parameters n,k where n≫k, and consider the case when β„³ is an n-uniform matroid of size 2⁒n. Its extended k-fold union β„³βˆ—k is a k⁒n-uniform matroid of size 2⁒k⁒n. Similar to 17, for every e∈E and SβŠ†Eβˆ— we can derive

Ο‰e⁒(S)={0,|S|≀k⁒nβˆ’k|S|βˆ’(k⁒nβˆ’k),k⁒nβˆ’k<|S|<k⁒nk,|S|β‰₯k⁒n.

Since no protection is needed for uniform matroids, let f⁒(β‹…)=Ο‰e⁒(β‹…) for some fixed e∈E. When π±βˆ—=(12βˆ’12⁒n)β‹…πŸEβˆ— (namely, every element in Eβˆ— is active with probability 12βˆ’12⁒n), |X| will follow a binomial distribution with k⁒nβˆ’k as both its mean and median. As a result,

𝐄[f⁒(X)] ≀k⁒𝐏𝐫[f⁒(X)>0]=k⁒𝐏𝐫[|X|>k⁒nβˆ’k]≀k2,
while𝐏𝐫[f⁒(X)β‰₯k] =𝐏𝐫[|X|β‰₯k⁒n]β‰₯Ω⁒(1), (n≫k)

which is a counterexample to the claim (βˆ—β£βˆ— β€£ 19).

Note that this is not an actual counterexample to AlgorithmΒ 4 (even without the extra scaling) since π±βˆ—βˆ‰(1βˆ’O⁒(log⁑kk))β‹…π’«β„³βˆ—k. But it shows that the condition 𝐄[f⁒(X)]≀kβˆ’O⁒(k⁒log⁑k) alone is not enough to derive a good enough upper bound for 𝐏𝐫[f⁒(X)β‰₯k], and it is crucial to also rely on the scaling applied to π±βˆ—.