Abstract 1 Introduction 2 Preliminaries 3 A (𝟏/𝟐𝜺)-approximation random-order streaming algorithm 4 Interlude: continuous greedy filtering 5 A (𝟏𝟏/𝒆𝜺)-approximation offline algorithm References

Fixed-Parameter Tractable Submodular Maximization over a Matroid

Shamisa Nematollahi ORCID CNRS, IRIF, Université Paris Cité, Paris, France Adrian Vladu ORCID CNRS, IRIF, Université Paris Cité, Paris, France Junyao Zhao ORCID CNRS, IRIF, Université Paris Cité, Paris, France
Abstract

In this paper, we design fixed-parameter tractable (FPT) algorithms for (non-monotone) submodular maximization subject to a matroid constraint, where the matroid rank r is treated as a fixed parameter that is independent of the total number of elements n. We provide two FPT algorithms: one for the offline setting and another for the random-order streaming setting. Our streaming algorithm achieves a 1/2ε approximation using 𝒪~(rpoly(ε)) memory, while our offline algorithm obtains a 11/eε approximation with n2𝒪~(rpoly(ε)) runtime and 𝒪~(rpoly(ε)) memory. Both approximation factors are near-optimal in their respective settings, given existing hardness results. In particular, our offline algorithm demonstrates that – unlike in the polynomial-time regime – there is essentially no separation between monotone and non-monotone submodular maximization under a matroid constraint in the FPT framework.

Keywords and phrases:
Submodular maximization, matroids, parameterized complexity, streaming algorithms
Funding:
Shamisa Nematollahi: Partially supported by the French Agence Nationale de la Recherche (ANR) under grant ANR-21-CE48-0016 (project COMCOPT).
Adrian Vladu: Partially supported by the French Agence Nationale de la Recherche (ANR) under grant ANR-21-CE48-0016 (project COMCOPT).
Junyao Zhao: Supported by a postdoctoral fellowship from the Fondation Sciences Mathématiques de Paris (FSMP).
Copyright and License:
[Uncaptioned image] © Shamisa Nematollahi, Adrian Vladu, and Junyao Zhao; licensed under Creative Commons License CC-BY 4.0
2012 ACM Subject Classification:
Theory of computation Design and analysis of algorithms
Related Version:
Full paper: https://arxiv.org/abs/2509.01591
Editor:
Shubhangi Saraf

1 Introduction

Submodular maximization under a matroid constraint is a fundamental problem in combinatorial optimization, where we are given111We assume that the submodular function f is given by a value oracle that outputs f(X) for input X[n], and the matroid is given by a membership oracle that answers whether X for input X[n]. a submodular function f:2[n]0 and a matroid 2[n] with rank r+, and our task is to find a set of elements X that maximizes the function value f(X). For this problem, there is a well-known gap between monotone (i.e., non-decreasing) and non-monotone submodular functions: For monotone submodular functions, the optimal polynomial-time algorithm achieves a 11/e approximation to the optimal solution [8], whereas for non-monotone submodular functions, no polynomial-time algorithm can obtain a 0.478 approximation [17, 12], even for the special case of the cardinality constraint [25], where the matroid simply consists of all subsets of [n] with size at most r.

One of the most important paradigms in theoretical computer science for overcoming the limitations of polynomial-time algorithms is parameterized complexity, which treats a specific component of the problem as a parameter that is fixed at a small value. In the context of matroid-constrained submodular maximization, it is natural to assume that the matroid rank r is a small parameter222For example, in data summarization [23], given a large image dataset, an algorithm needs to choose a representative subset of images that is small enough to fit the screen of our smartphones. and design fixed-parameter tractable (FPT) algorithms, i.e., algorithms with runtime h(r)poly(n), where h can be any finite function.

For the special case of the cardinality constraint, [26] designed an FPT algorithm that guarantees a 0.539 approximation for non-monotone submodular functions, showing a substantial advantage of FPT algorithms over polynomial-time algorithms. On the hardness side, essentially the only known query complexity lower bound that applies to FPT algorithms is from the classic work of [24]. They proved that any algorithm that achieves a better-than-(11/e) approximation requires nΩ(r) queries to the value oracle, a result that holds for both monotone and non-monotone submodular maximization with the cardinality constraint. These results seem to suggest that the gap between monotone and non-monotone submodular maximization (at least for the cardinality constraint) could be narrowed further, or perhaps even closed, in the FPT regime. This brings us to the question:

For matroid/cardinality-constrained non-monotone submodular maximization, what is the best possible approximation ratio that can be achieved by FPT algorithms333The special case of this question for the cardinality constraint was posed by [26].? Is there a separation between monotone and non-monotone submodular maximization under a matroid/cardinality constraint in the FPT framework?

In this paper, we reveal a surprising answer to this question: There is essentially no separation between monotone and non-monotone submodular maximization under a matroid constraint in the FPT regime. Specifically, we design a nearly (11/e)-approximation FPT algorithm (Algorithm 3) for general (non-monotone) submodular maximization subject to a matroid constraint.

Theorem 1 (Restatement of Theorem 28).

There is an FPT algorithm that achieves a 11/eε approximation for maximizing any non-negative submodular function under a matroid constraint, with n2𝒪~(rpoly(ε)) runtime and 𝒪~(rpoly(ε)) memory for any ε>0.

The bedrock of our (11/eε)-approximation FPT algorithm is a subroutine (Subroutine 2) that can be simplified into a random-order semi-streaming algorithm (Algorithm 1), and it has significant implications for the streaming setting as well. Briefly, a random-order streaming algorithm makes a single pass over elements of [n] in a uniformly random order. During the pass, the algorithm maintains a subset of visited elements in a memory buffer of bounded size, and it is allowed to make unlimited queries to the value oracle of the function f and the membership oracle of the matroid for any subset of elements stored in its memory. In the literature, there is particular interest in streaming algorithms that use 𝒪~(r) memory, which are known as semi-streaming algorithms.

Our random-order semi-streaming algorithm (Algorithm 1) achieves a nearly 1/2 approximation. (We note that a nearly 1/2 approximation is known to be achievable by a streaming algorithm with 2𝒪(r) memory, for the more general setting of matroid intersection constraints and adversarial-order streams [15].) This matches the lower bound established by [26], which asserts that any random-order streaming algorithm exceeding 1/2 approximation must use Ω(nr2) memory, even for the special case of the cardinality constraint.

Theorem 2 (Restatement of Theorem 16).

There is a random-order semi-streaming algorithm that achieves a 1/2ε approximation for maximizing any non-negative submodular function under a matroid constraint, using 𝒪~(rpoly(ε)) memory for any ε>0.

In comparison, the previous best random-order semi-streaming algorithm for maximizing monotone submodular functions under a matroid constraint achieves a (1e220.432) approximation [27]. For non-monotone functions, the best known semi-streaming algorithm obtains a 0.1921 approximation [14], though this algorithm applies to the more restrictive adversarial-order streaming setting. We highlight that the algorithms of [27] and [14] run in polynomial time, whereas our algorithm runs in FPT time.

1.1 Overview of our approach

Deferring a more detailed exposition to the main sections, we provide an overview of our algorithms and techniques. As mentioned previously, the core of our (11/eε)-approximation algorithm (Algorithm 3) is Subroutine 2, which can be simplified into a (1/2ε)-approximation semi-streaming algorithm (Algorithm 1). We call this subroutine continuous greedy filtering. It is inspired by a filtering technique developed by [26] for cardinality-constrained non-monotone submodular maximization, and a fast continuous greedy algorithm designed by [4] for matroid-constrained monotone submodular maximization.

Continuous greedy filtering

The initial phase of this subroutine maintains a fractional solution over 1ε epochs. In each epoch t[1ε], the subroutine constructs an incremental sequence of integral solutions S1t,,Srt through r selection steps. At each step i[r], it samples a small subset of elements. Among the elements that can be added to Si1t, it selects the element sit that maximizes the marginal value relative to the current fractional solution xi1t (a “weighted average” of the integral solutions Sr1,,Srt1,Si1t). The integral solution Sit is then updated to be Si1t{sit}.

In the second phase, the subroutine uses the integral solutions constructed in the first phase as a filter to select a subset H of elements from [n]. Specifically, for each element e[n], e is included in H if it can be added to Si1t and its marginal value with respect to xi1t exceeds that of element sit, for some t[1ε] and i[r]. A minor detail is that the subroutine rounds the marginal values of e and sit to the closest power of 1+ε before comparing them. This detail will keep the subroutine’s memory usage nearly linear in the matroid rank r.

This subroutine achieves a key property: We partition the optimal solution O into two sets OL:=OH and OH:=OH. Then, the subroutine guarantees that there is a feasible subset of t=11/εSrt that is a 11/eε approximation to the partial optimal solution OL.

(𝟏𝟏/𝒆𝜺)-approximation: a recursive approach

Our (11/eε)-approximation algorithm applies the continuous greedy filtering subroutine recursively on refined instances. Specifically, after the first invocation of the subroutine on the input instance (f,), if the value of OH is negligible, then by submodularity, OL must account for almost all the optimal value. In this case, because of the aforementioned property, our algorithm can find a (11/eε)-approximate solution through an exhaustive search over all subsets of t=11/εSrt. If otherwise, our algorithm enumerates all subsets of H to identify OH (the algorithm will encounter OH in some iteration of the enumeration, even though it does not know what OH is), and then, it recursively applies the above process to a refined instance (f,), where the matroid is obtained by contracting the original matroid by the set OH (see Definition 9), and the function f is defined by f(X):=f(XOH). The high-level intuition is that, after sufficiently many recursions, we will eventually reach the first case, where the value of OH (the analogue of OH in the refined instance) is negligible.

(𝟏/𝟐𝜺)-approximation semi-streaming algorithm: single-epoch filtering

Our random-order semi-streaming algorithm simplifies continuous greedy filtering by performing a single-epoch greedy procedure to construct an integral solution in the first ε fraction of the random stream, followed by a similar filtering phase in the remaining stream. The final solution is obtained by enumerating all subsets of elements selected during these two phases and choosing the best one. In the main body of the paper, we will first present this simpler algorithm and its analysis.

Comparison to [26]

Briefly, [26] first designed a (1/2ε)-approximation random-order streaming algorithm with quadratic memory for cardinality-constrained non-monotone submodular maximization, based on the filtering technique. They then identified cases where their streaming algorithm only achieves a 1/2 approximation, and introduced an offline postprocessing procedure to handle those cases and improve the approximation ratio. Their analysis of this procedure relies on a factor-revealing program to balance different cases. To better leverage this factor-revealing program, they extended their postprocessing procedure to a recursive one, which led to a 0.539-approximation FPT algorithm.

Our semi-streaming algorithm is a generalization of their streaming algorithm to the matroid constraint, with improved memory usage. Compared to their 0.539-approximation FPT algorithm, the two main novelties of our (11/eε)-approximation algorithm, as highlighted above, are (i) the combination of the filtering technique and the continuous greedy algorithm, which enables our subroutine to achieve the aforementioned key property under the matroid constraint, and (ii) a natural recursive procedure that applies the subroutine repeatedly to refine the problem instance. This differs significantly from the recursive procedure of [26], which is used only to postprocess an output produced once by their streaming algorithm.

1.2 Additional related work

FPT submodular maximization

The parameterized complexity of submodular maximization was first studied by [28], who provided FPT approximation schemes for maximizing p-separable monotone submodular functions under the cardinality constraint, assuming that both the cardinality constraint and p are fixed parameters. If the cardinality constraint is the only parameter, without further assumptions, even FPT algorithms cannot obtain a better-than-(11/e) approximation for monotone submodular functions, both in terms of query complexity [24] and computational complexity under the Gap-Exponential Time Hypothesis [11, 22]. For non-monotone submodular maximization under the cardinality constraint, [26] gave a 0.539-approximation FPT algorithm, while the current best polynomial-time algorithm achieves a 0.401 approximation [6], which also applies to the matroid constraint.

Streaming submodular maximization

In the random-order streaming setting, [1] designed a nearly (11/e)-approximation semi-streaming algorithm for monotone submodular maximization subject to a cardinality constraint, which was later simplified by [21] to reduce the memory size. Moreover, under the cardinality constraint, [26] designed a quadratic-memory streaming algorithm that achieves a 0.5029 approximation and a 1/2 approximation for symmetric and asymmetric submodular maximization respectively. Under the matroid constraint, [27] gave a 1e22-approximation semi-streaming algorithm for monotone submodular maximization. All these random-order streaming algorithms (and ours) also apply to the (similar but incomparable) secretary with shortlists model [1].

In the adversarial-order streaming setting, [3] designed a nearly 1/2-approximation semi-streaming algorithm for cardinality-constrained monotone submodular maximization, which was later improved by [20] to achieve a linear memory size. On the hardness side, [19] showed that any streaming algorithm for monotone submodular maximization that achieves an approximation better than 22+2 under a cardinality constraint must use Ω(nr2) memory, while attaining an approximation better than r2r1 under a partition matroid constraint requires Ω(nr) memory. Later, [16] proved that any streaming algorithm achieving a better-than-1/2 approximation for cardinality-constrained monotone submodular maximization requires Ω(nr3) memory. More recently, [2] designed a nearly 1/2-approximation semi-streaming algorithm for cardinality-constrained non-monotone submodular maximization, which is also an FPT algorithm (although not explicitly stated in their paper). Furthermore, matroid-constrained submodular maximization in the adversarial-order streaming setting was first studied by [9], who provided a 1/4-approximation semi-streaming algorithm for monotone functions. This was recently improved to a 0.3178 approximation by [14], who also gave a 0.1921 approximation for non-monotone functions. Finally, we mention that [14] developed a multi-pass nearly (11/e)-approximation semi-streaming algorithm for matroid-constrained monotone submodular maximization, which also builds on the continuous greedy algorithm of [4]. However, their approach is very different from ours.

2 Preliminaries

2.1 Submodular functions and matroids

We first introduce submodular functions, matroids, and their associated concepts and properties. Given a ground set [n] and a non-negative discrete function f:2[n]0, for any X,Y[n], we let f(X|Y) denote the marginal value of X with respect to Y, i.e., f(X|Y):=f(XY)f(Y).

Definition 3 (submodular function).

A non-negative function f:2[n]0 is submodular, if it satisfies that f(X|Y)f(X|Z) for any X[n] and any YZ[n].

For any vector x[0,1]n, we use the notation (x) to refer to a random set such that each element of [n] appears in (x) independently with probability xi, and we define the multi-linear extension of a submodular function as follows.

Definition 4 (multi-linear extension).

The multi-linear extension F:[0,1]n0 of a submodular function f:2[n]0 is given by

F(x):=𝔼[f((x))]=S[n]iSxii[n]S(1xi)f(S) for all x[0,1]n.

Moreover, for any x,y[0,1]n such that x+y[0,1]n, we let F(x|y) denote the marginal value of x with respect to y, i.e., F(x|y):=F(x+y)F(y).

We are interested in the problem of maximizing a non-negative submodular function subject to a matroid constraint. A matroid is a set system that satisfies certain independence structure.

Definition 5 (matroid).

A set system 2[n] is a matroid444In the literature, a matroid is often represented as a pair ([n],), where [n] is the ground set and 2[n] is the family of independent sets. For simplicity, we refer to the matroid simply as throughout this paper. if the following conditions hold:

  1. i.

    .

  2. ii.

    If X, then Y for all YX.

  3. iii.

    If X,Y and |Y|<|X|, then there exists iXY such that Y{i}.

Given a matroid, we define its rank, bases, restriction and contraction as follows.

Definition 6 (rank).

Given a matroid 2[n], the rank of is r:=maxX|X|.

Definition 7 (base).

Given a matroid 2[n], we say that a set B is a base of , if B{i} for all i[n].

Definition 8 (restriction).

Given a matroid 2[n] and a set S[n], we define the restriction of to S as S:={XSX}. It is well-known that the restriction S is a matroid.

Definition 9 (contraction).

Given a matroid 2[n] and a set S[n], we let /S denote the contraction of by S, which is given by /S:={X[n]SXS for all SS such that S}. It is well-known that the contraction /S is a matroid, and its membership oracle can be efficiently evaluated using the membership oracle of .

In matroid-constrained submodular maximization, an algorithm ALG is given a non-negative submodular function f:2[n]0 and a matroid 2[n], and its task is to find a feasible set X that maximizes f(X). Throughout the paper, we denote the optimal solution by O, i.e., O:=argmaxXf(X) (and we reserve 𝒪 for the Big-O notation to avoid confusion). We say that ALG achieves an α approximation with α[0,1], if its solution set XALG satisfies that 𝔼[f(XALG)]αf(O), where the expectation is taken over the randomness of ALG (and the arrival order of the elements, in case ALG is a random-order streaming algorithm, which we will introduce shortly). Moreover, we assume that the matroid rank r is a fixed parameter that does not depend on the total number of elements n (and hence, r=o(n)555If r=Ω(n), then an FPT algorithm with 2𝒪(r) runtime can simply enumerate all subsets of [n], and a streaming algorithm with 𝒪(r) memory can simply store all elements of [n], rendering the problem trivial.), and we are interested in fixed-parameter tractable (FPT) algorithms, i.e., algorithms with runtime h(r)poly(n), where h can be any finite function.

2.2 Random-order streaming model

Besides the standard offline setting (where all elements in the ground set [n] are presented to the algorithm from the outset), we are also interested in the random-order streaming setting, where elements in [n] arrive in a uniformly random order as a data stream.

In this setting, an algorithm has a memory buffer of a limited size (which is 𝒪~(r) for semi-streaming algorithms). Each time when a new element arrives, the algorithm decides how to update its memory buffer, i.e., whether to store the new element, remove some previously stored elements, or modify other stored information. At any point, the algorithm can make any number of queries to the value oracle of the submodular function and the membership oracle of the matroid, provided that the input for the query is a subset of elements stored in its memory. At the end of the stream, the algorithm outputs a subset of elements from its memory that satisfies the matroid constraint.

2.3 Other useful notions and lemmata

We use the notations 𝟏S{0,1}n and 𝟏e{0,1}n to represent the indicator vectors for any set S[n] and for any element e[n] respectively, and let 𝟎 denote the all-zero vector. Moreover, we define a rounding operator that will be the key to reducing the memory usage of our algorithms.

Definition 10 (rounding operator).

Given any finite non-empty set of real numbers I, we use the notation I to denote the rounding operator that maps any real number a to its greatest lower bound in I, or to the smallest number in I if no lower bound exists, i.e., aI:=max{iIia} if aminI, and aI:=minI otherwise.

Then, we present two base exchange lemmas for matroids and two sampling lemmas for non-negative submodular functions.

Lemma 11 ([18, 29]).

Given two bases B1 and B2 of a matroid 2[n], and a partition B1=X1ΓY1, there exists a partition B2=X2ΓY2 such that X1ΓY2 and X2ΓY1 are both bases of .

Lemma 12 ([5]).

Given two bases B1 and B2 of a matroid 2[n], there exists a bijection h:B1B2B2B1 such that for all eB1B2, (B1{e}){h(e)} is a base of .

Lemma 13 ([13, Lemma 2.2]).

Given a non-negative submodular function f:2[n]0 and a set X[n], if R is a random subset of X such that every element of X appears in R with probability exactly p (not necessarily independently), then we have that 𝔼[f(R)](1p)f()+pf(X).

Lemma 14 ([7, Lemma 2.2]).

Given a non-negative submodular function f:2[n]0 and a set X[n], if R is a random subset of X such that every element of X appears in R with probability at most p (not necessarily independently), then we have that 𝔼[f(R)](1p)f().

Finally, we state a lemma that follows from standard rounding schemes [8, 10] for matroid-constrained submodular maximization.

Lemma 15.

Given a matroid 2[n], a submodular function f:2[n]0, and its multi-linear extension F:[0,1]n0, for any x=i=1N1N𝟏Si such that Si for all i[N], there exists Xi=1NSi such that X and f(X)F(x).

3 A (𝟏/𝟐𝜺)-approximation random-order streaming algorithm

In this section, we present our semi-streaming algorithm, Greedy-Filtering (Algorithm 1), which achieves a 1/2ε approximation using 𝒪~(rpoly(ε)) memory for any submodular function f:2[n]0 and any matroid with rank r=o(n). Algorithm 1 operates in three phases.

Algorithm 1 Greedy-Filtering(f,,r,π,ε).

Phase 1: Estimating the highest singleton value

Algorithm 1 first identifies the element with the highest singleton value in the initial ε fraction of the random stream π, and sets w to be rε times the value of this element. It treats w as an upper bound on the value of the most valuable element in [n]. If w fails to upper bound this value, we can show that w.h.p., only a few of the most valuable elements contribute non-negligibly to the optimal solution. Hence, to address this, throughout the stream, Algorithm 1 also maintains a set T of the top ln(1/ε)ε elements with highest singleton values among the elements that have appeared.

Phase 2: Greedily constructing a solution set

Then, Algorithm 1 divides the second ε fraction of the stream into r windows of equal sizes, and greedily constructs a feasible solution set Sr as follows: For each i[r], among all elements in the i-th window Vi that can be added to the current solution set Si1 without violating the matroid constraint, Algorithm 1 identifies the element si with the highest marginal value with respect to Si1, and updates Si=Si1{si}. If no element in Vi can be added to Si1 without violating the matroid constraint (or all elements in Vi that can be added have negative marginal values with respect to Si1), the algorithm sets si to be s1, which acts as a dummy element with zero marginal value with respect to Si1.

Phase 3: Filtering the remaining stream

Algorithm 1 filters elements in the rest of the stream, keeping only a subset H of them: For each remaining element e, Algorithm 1 adds element e to H if, for some i[r], element e has a strictly higher marginal value with respect to solution set Si1 than element si. Notably, when comparing the marginal values of e and si, Algorithm 1 first applies the rounding operator defined in Definition 10 with respect to a set I of geometrically increasing numbers, upper bounded by w. As we will show in the analysis, this coarse comparison of marginal values, based on the rounding operator, is crucial for the algorithm to achieve 1/2ε approximation while keeping the memory cost nearly linear in the matroid rank w.h.p. In the algorithm, we also set a hard memory limit at Line 17.

Finally, Algorithm 1 finds the most valuable subset of TSrH that satisfies the matroid constraint through an exhaustive search and returns it as the final solution. The approximation guarantee of Algorithm 1, along with its memory cost, is formally stated in Theorem 16.

Theorem 16.

Given any input submodular function f:2[n]0, matroid 2[n] with rank r+, and parameter ε(0,12), Algorithm 1 achieves a 1/2ε approximation using 𝒪(rln(r/ε)2ε2) memory, where ε=82ε+2rn.

The memory bound follows from a simple calculation: Throughout the data stream, Algorithm 1 only needs to maintain three sets of elements T,Sr,H (note that it can infer S1,,Sr1 from Sr). By the construction of T and Sr, we have that |T|=ln(1/ε)ε=𝒪(1ε2) and |Sr|r. Moreover, the break condition at Line 17 of Algorithm 1 ensures that |H|=𝒪(rln(r/ε)|I|ε)=𝒪(rln(r/ε)2ε2), since |I|=2log1+ε(rε)+1=𝒪(ln(r/ε)ε). Hence, the total memory cost is 𝒪(rln(r/ε)2ε2).

Next, we prove the approximation guarantee in three main steps: First, we show that unless maxe[n]f({e})rεmineTf({e}) (in which case we prove that the most valuable and feasible subset of T is a 1ε approximation of the optimal solution), w.h.p., the value w in Algorithm 1 is an upper bound on the value of the most valuable element in [n]. Then, we show that not many elements pass the filter in the third phase of Algorithm 1, meaning that w.h.p., the break condition at Line 17 is never met. Finally, in the regular case where w caps the value of the most valuable element and only a few elements pass the filter, we prove that w.h.p., the most valuable and feasible subset of SrH is almost a 1/2 approximation of the optimal solution. We implement these three steps in the following three subsections.

3.1 The simple case: 𝐦𝐚𝐱𝒆[𝒏]𝒇({𝒆})𝒓𝜺𝐦𝐢𝐧𝒆𝑻𝒇({𝒆})

We first analyze the simple case where maxe[n]f({e})rεmineTf({e}). We note that the set T in Algorithm 1 eventually consists of the top ln(1/ε)ε elements in [n] with the highest singleton values (and we will only consider this final version of T in the analysis). In Lemma 17, we show that Algorithm 1 achieves a 1ε approximation in this case.

Lemma 17.

If maxe[n]f({e})rεmineTf({e}), Algorithm 1 obtains a 1ε approximation.

The proof is provided in the full version.

Given Lemma 17, we can now focus on the cases where maxe[n]f({e})<rεmineTf({e}). Under this assumption, we show that w.h.p., the value w in Algorithm 1 is an upper bound on maxe[n]f({e}). Specifically, we define the set T+:={e[n]f({e})mineTf({e})} (note that if there are multiple elements with the same singleton value, T could be a random set because of the random stream, but T+ is always deterministic). In Lemma 18, we show that w.h.p., at least one element of T+ appears in the first ε fraction of the random stream π, which implies that wmaxe[n]f({e}), assuming maxe[n]f({e})<rεmineTf({e}).

Lemma 18.

Let E1 denote the event that there exists some element of T+ that appears in V0={π(1),,π(εn)}, i.e., the first ε fraction of the random stream π. Then, we have that Pr[E1]1ε. Moreover, if maxe[n]f({e})<rεmineTf({e}), then conditioned on event E1, it holds that wmaxe[n]f({e}).

The proof is provided in the full version.

3.2 The unlikely case: |𝑯|>𝒓𝐥𝐧(𝒓/𝜺)|𝑰|𝜺

Recall that the set H in Algorithm 1 consists of elements that pass the filter in the third phase (throughout the analysis, we will only consider the final version of H). In Lemma 19, we show that w.h.p., |H|rln(r/ε)|I|ε, which implies that the break condition at Line 17 of Algorithm 1 is unlikely to be reached.

Lemma 19.

Let E2 denote the event that |H|rln(r/ε)|I|ε. Then, Pr[E2]1ε.

Now we introduce several key concepts that will play important roles in the proof of Lemma 19. First, we notice that in Algorithm 1, each element si chosen in the second phase acts as a selector in the third phase: for each element e in the third phase of Algorithm 1, the algorithm will include e in set H if it satisfies the selection condition posed by si, namely, if Si1{e} and f({e}|Si1)I>f({si}|Si1)I (we say that an element e[n] is selected by the selector si if it satisfies this condition). For each i[r], we let Gi denote the set of elements selected by si among all elements appearing after the i-th window in the second phase of Algorithm 1, i.e.,

Gi:={e[n](j=0iVj)Si1{e} and f({e}|Si1)I>f({si}|Si1)I} (1)

(recall that V0 is the set of elements in the first ε fraction of the stream, and for each j[r], Vj is the set of elements in the j-th window during the second phase of Algorithm 1).

We order the selectors according to their indices: s1,,sr, and for each i[r], we define

Hi:=j=1iGj (and we let H0:= for completeness). (2)

We say that a selector si is ineffective if there is an earlier selector sj for some j<i such that f({si}|Si1)If({sj}|Sj1)I (otherwise we call si an effective selector). In Claim 20, we show that, as the name suggests, any element that is selected by an ineffective selector si would have already been selected by an earlier selector sj for some j<i, which implies that Hi=Hi1.

Claim 20.

Suppose that si is an ineffective selector, i.e., there exists some j<i such that f({si}|Si1)If({sj}|Sj1)I. Then, any element e[n] that satisfies the selection condition of si, i.e., Si1{e} and f({e}|Si1)I>f({si}|Si1)I, also satisfies the selection condition of sj, i.e., Sj1{e} and f({e}|Sj1)I>f({sj}|Sj1)I. In particular, this implies that Hi=Hi1.

Proof.

We establish the first part of the claim by showing that Si1{e} implies Sj1{e}, and f({e}|Si1)I>f({si}|Si1)I implies f({e}|Sj1)I>f({sj}|Sj1)I.

First, since is a matroid and Sj1{e}Si1{e}, the condition that Si1{e} implies the condition that Sj1{e}.

Moreover, because f is a submodular function and Sj1Si1, it holds that f({e}|Sj1)f({e}|Si1), which implies that f({e}|Sj1)If({e}|Si1)I, by monotonicity of the rounding operator I (Definition 10). Hence, the condition that f({e}|Si1)I>f({si}|Si1)I implies that f({e}|Sj1)I>f({si}|Si1)I. Since we assume that f({si}|Si1)If({sj}|Sj1)I, it follows that f({e}|Sj1)I>f({sj}|Sj1)I.

Next, we prove the second part of the claim: Hi=Hi1. Recall that every element eGi appears after the i-th window and satisfies the selection condition of si, which implies that element e also appears after the j-th window (because j<i) and satisfies the selection condition of sj (because of the first part of the claim). It follows that every element eGi belongs to Gj. Hence, we have that GiGjHi1, since Hi1==1i1G and ji1. Finally, Hi=Hi1 follows from Hi=GiHi1 and GiHi1.

Now observe that the set H in Algorithm 1 is a subset of Hr. Hence, to upper bound |H|, it suffices to upper bound |Hr|. If we could prove that for each i[r], |HiHi1|=𝒪(rln(r/ε)ε) holds w.h.p., conditioned on si being an effective selector, then Lemma 19 would follow immediately. Indeed, Claim 20 states that HiHi1= for every ineffective selector si, and thus, we have that Hr=i[r] s.t.si is an effective selector(HiHi1). Notice that there can be at most |I| effective selectors because of the rounding operator I (specifically, any two distinct effective selectors si and sj must satisfy that f({si}|Si1)If({sj}|Sj1)I, and there are only |I| distinct values in the range of I). Therefore, if |HiHi1|=𝒪(rln(r/ε)ε) holds for every effective selector si, then it would follow that |Hr|=𝒪(rln(r/ε)|I|ε).

However, there is a caveat to the above argument: |HiHi1|=𝒪(rln(r/ε)ε) does not always hold w.h.p., conditioned on si being an effective selector. It is possible to construct instances where |HiHi1|=ω(rln(r/ε)ε) always holds, conditioned on si being an effective selector. Fortunately, if this is the case, we can show that the probability that si is an effective selector is negligible. We formalize this intuition in Lemma 21. The proof of Lemma 21 is provided in the full version.

Lemma 21.

For each i[r], let ki denote the number of effective selectors among the first i selectors s1,,si, and let k0=0. Then, for all i[r], we have that

Pr[|Hi|>kirln(r/ε)ε|j{0,,i1},|Hj|kjrln(r/ε)ε]εr. (3)

We are ready to complete the proof of Lemma 19.

Proof of Lemma 19.

First, following the notations in Lemma 21, we upper bound the probability Pr[|Hr|krrln(r/ε)ε] as follows,

Pr[|Hr|krrln(r/ε)ε]
Pr[i[r],|Hi|kirln(r/ε)ε]
= i[r]Pr[|Hi|kirln(r/ε)ε|j{0,,i1},|Hj|kjrln(r/ε)ε]
(1εr)r
 1ε. (4)

Moreover, we notice that the total number of effective selectors kr is at most |I|, because any two distinct effective selectors si and sj must satisfy that f({si}|Si1)If({sj}|Sj1)I (and there are only |I| distinct values in the range of I). Thus, Ineq. (3.2) implies that Pr[|Hr|rln(r/ε)|I|ε]1ε. The proof finishes by noticing that the set H in Algorithm 1 is a subset of Hr.

3.3 The regular case: 𝐦𝐚𝐱𝒆[𝒏]𝒇({𝒆})<𝒓𝜺𝐦𝐢𝐧𝒆𝑻𝒇({𝒆}) and |𝑯|𝒓𝐥𝐧(𝒓/𝜺)|𝑰|𝜺

Now we consider the regular case where maxe[n]f({e})<rεmineTf({e}) and |H|rln(r/ε)|I|ε. To analyze this case, we first establish two simple claims. Claim 22 shows that w.h.p., the value of the optimal solution O does not decrease significantly, when combined with any subset of elements in a small fraction of the random stream π.

Claim 22.

For any η,δ[0,1], let V be a random subset of [n] such that every element of [n] appears in V with probability at most δ (not necessarily independently). Then, with probability at least 1δη, it holds for all XV that f(X|O)ηf(O).

Proof.

We define g:2[n]0 such that g(Y)=f(YO) for all Y[n]. Because f is a non-negative submodular function, it follows that g is also non-negative and submodular. Moreover, we define Xmin:=argminXVf(X|O). Since Xmin is a subset of V and each element of [n] appears in V with probability at most δ, it follows that each element of [n] appears in Xmin with probability at most δ. Hence, by Lemma 14, we have that 𝔼[g(Xmin)](1δ)g(), which is equivalent to 𝔼[f(Xmin|O)+f(O)](1δ)f(O). By rearranging, we obtain that 𝔼[f(Xmin|O)]δf(O). Then, by applying Markov’s inequality to the random variable f(Xmin|O) (note that this random variable is non-negative because f(Xmin|O)f(|O)=0), we have that

Pr[f(Xmin|O)ηf(O)]𝔼[f(Xmin|O)]ηf(O)δf(O)ηf(O)=δη.

Thus, f(Xmin|O)ηf(O) holds with probability at least 1δη, which implies the claim.

Claim 23 bounds the error incurred by the rounding operator I. The proof is provided in the full version.

Claim 23.

Given I={(1+ε)iγwi{0,,log1+ε(1/γ)}} with w0 and ε,γ(0,1), for any two real numbers a0 and bw, if aIbI, then we have that a(1ε)bγw.

Next, we denote by Vfinal the set of elements that appear in the third phase of Algorithm 1, i.e., Vfinal:={π(i)i=εn+rεnr+1,,n}. Recall that XALG denotes the solution set returned by Algorithm 1, and O denotes the optimal solution. In Lemma 24, we show that XALG is almost a 1/2 approximation of OVfinal, conditioned on three high-probability events in the regular case.

Lemma 24.

Let E1 and E2 be the events defined in Lemma 18 and Lemma 19 respectively. Moreover, let E3 denote the event that f(X|O)ηf(O) for all Xi=1rVi, where η:=2εn+2rn. Suppose that the input parameter ε for Algorithm 1 is in the range (0,12), and that maxe[n]f({e})<rεmineTf({e}). Then, conditioned on events E1,E2,E3, Algorithm 1 guarantees that f(XALG)(12ε)(f(OVfinal)5ηf(O)).

We provide the proofs of Lemma 24 and Theorem 16 in the full version. Here, we give a simplified proof sketch to illustrate the main idea.

Proof sketch.

To highlight the main idea, we make the simplifying assumption that both the set Sr constructed in the second phase of Algorithm 1 and the optimal solution O are bases of matroid . We further assume that all elements in the optimal solution O arrive in the third phase of Algorithm 1 (in reality, we expect a random 12ε fraction of them to appear in the third phase, which, by Lemma 13, accounts for 12ε fraction of the optimal value in expectation).

First, we partition the optimal solution into two sets: O=OLΓOH, where OL:=OH and OH:=OH. That is, OH contains the optimal elements that are included in H by Algorithm 1, and OL consists of those that are not included in H (note that conditioned on the high-probability event E2 defined in Lemma 19, these elements were not added to H because they did not satisfy the filter condition at Line 14 of Algorithm 1, not because of the break condition at Line 17). By Lemma 11, we can find a partition Sr=SLΓSH such that SLΓOH and OLΓSH are also bases. Moreover, because both Sr=SLΓSH and OLΓSH are bases, by Lemma 12, there exists a bijection h:SLOL such that for all siSL, substituting element si in Sr with element h(si) yields a new base. In particular, this implies that element h(si) could be added to set Si1 without violating the matroid constraint. Despite this, element h(si)OL was not selected by the selector si (since otherwise it would have been included in H). Thus, it must hold that f({h(si)}|Si1)f({si}|Si1) (barring the rounding error incurred by the rounding operator I, which can be handled by Claim 23). Hence, we can derive that

f(SL) =i[r] s.t.siSLf({si}|Si1SL) (By telescoping sum)
i[r] s.t.siSLf({si}|Si1) (By submodularity)
i[r] s.t.siSLf({h(si)}|Si1)
=oOLf({o}|Si1) (Since h:SLOL is a bijection)
f(OL|Si1)f(OL|Sr) (By submodularity). (5)

Then, we consider the following two solution sets: Sr and SLΓOH. Recall that both Sr and SLΓOH are bases, and thus, they are feasible solutions. Moreover, both Sr and SLΓOH are subsets of SrH, and hence, they are candidate solutions in the final exhaustive search of Algorithm 1. Now we lower bound the sum of the solution values of Sr and SLΓOH, as follows,
f(Sr)+f(SLΓOH) =f(Sr)+f(SL)+f(OH|SL) f(Sr)+f(OL|Sr)+f(OH|SL) (By Ineq. (3.3)) f(Sr)+f(OL|Sr)+f(OH|OLSr) (By submodularity and SLSr) =f(OSr) (Since OLΓOH=O) (1η)f(O),

where the last inequality holds because of the event E3 in the statement of Lemma 24 (which is a high-probability event by Claim 22) and the fact that Sri=1rVi. It follows that one of Sr and SLΓOH must be a nearly 1/2 approximation of the optimal solution O. This establishes Lemma 24.

To complete the proof of Theorem 16, we can assume w.l.o.g. that maxe[n]f({e})<rεmineTf({e}) (because otherwise Theorem 16 follows immediately from Lemma 17), and then combine Lemma 24 with the high-probability bounds we have established for events E1,E2,E3.

4 Interlude: continuous greedy filtering

In this section, we present Continuous-Greedy-Filtering (Subroutine 2), a subroutine that will serve as a building block of our final (11/eε)-approximation offline algorithm. Subroutine 2 combines the techniques used in our streaming algorithm (Algorithm 1) with a fast continuous greedy algorithm introduced by [4]. It operates in three phases.

Algorithm 2 Continuous-Greedy-Filtering(f,,r,ε).

Phase 1: Computing the highest singleton value

Subroutine 2 first identifies the element in [n] with the highest singleton value and assigns this value to v. Since Subroutine 2 is designed for the offline setting, v can be computed exactly by iterating through all elements in [n].

Phase 2: Constructing a fractional solution using the continuous greedy algorithm

Then, Subroutine 2 uses the continuous greedy algorithm to construct a fractional solution from rε subsampled sets: It runs for 1ε epochs. In each epoch t[1ε], it builds a new integral solution Srt from scratch in r steps. In each step i[r], it samples a small subset of elements Vit. Among the elements in Vit that can be added to the current integral solution without violating the matroid constraint, Subroutine 2 identifies the element with the highest “marginal value” (as specified at Line 11) with respect to the current fractional solution. If such an element exists and its marginal value is non-negative, Subroutine 2 includes it in the current integral solution and adds an ε fraction of it to the current fractional solution.

Phase 3: Filtering all elements in [𝒏]

Finally, Subroutine 2 filters all elements in [n], selecting a subset H from them: For each element e[n], it adds element e to H if, for some step i[r] of some epoch t[1ε] in the second phase, (i) element e can be added to the integral solution at that step without violating the matroid constraint, and (ii) it has a strictly higher marginal value than element sit with respect to the fractional solution at that step (or has a strictly positive marginal value in case sit does not exist). Here, the marginal values are also compared after applying the rounding operator I with some well-chosen discretization I, in order to keep the memory usage nearly linear in the matroid rank w.h.p. In the subroutine, we also impose a hard memory limit at Line 22. At the end of Subroutine 2, it returns the set H, along with the union t=11/εSrt of all the integral solutions constructed during the second phase.

Now we let O denote any optimal solution, and define OH:=OH and OL:=OH (i.e., OH contains the optimal elements that are selected in the third phase of Subroutine 2, and OL consists of those that are not selected). In Theorem 25, we show that Subroutine 2 satisfies a property that is crucial for proving the 11/eε approximation guarantee of our final algorithm.

Theorem 25.

Given any input submodular function f:2[n]0, matroid 2[n] with rank r0, and parameter ε<14 such that 1ε+, Subroutine 2 guarantees that with probability at least 12ε, there exists a solution set XALGS such that XALG and

f(XALG)(11e)(f(OL)8εf(O)),

where the set S is the first output of Subroutine 2.

To prove Theorem 25, we assume w.l.o.g. that the matroid rank r is non-zero, as the statement holds trivially if r=0 (we include this edge case in Theorem 25 because our final algorithm might invoke Subroutine 2 on a matroid with rank zero). We first establish Lemma 26, which shows that w.h.p., the break condition at Line 22 of Subroutine 2 is never reached. The proof of Lemma 26 relies on techniques similar to those developed in the proof of Lemma 19, which we defer to the full version.

Lemma 26.

Let E1 denote the event that |H|rln(r/ε2)|I|ε4. Then, we have that Pr[E1]1ε.

Then, in Lemma 27, we show that conditioned on event E1 and another high-probability event E2, the fractional solution x1/ε constructed in Subroutine 2 is a nearly 11e approximation of OL.

Lemma 27.

Suppose that the input parameter ε to Subroutine 2 is in the range (0,14). Let E1 be the event defined in Lemma 26, and let E2 denote the event that f(X|O)εf(O) for all Xt[1ε],i[r]Vit, where the sets Vit are the subsampled sets in Subroutine 2. Then, conditioned on events E1 and E2, Subroutine 2 guarantees that F(x1/ε)(11e)(f(OL)8εf(O)).

We defer the proof of Lemma 27 to the full version and provide a simplified proof sketch.

Proof sketch.

To illustrate the main idea, we make the simplifying assumption that the integral solution sets Srt constructed in Subroutine 2 for all t[1ε], and the optimal solution O are all bases of matroid .

Recall that the optimal solution O is split into OL and OH, where OH contains the optimal elements that are included in H by Subroutine 2, and OL consists of those that are not included in H (note that conditioned on the high-probability event E1 defined in Lemma 26, these elements were not added to H because they did not satisfy the filter condition at Line 20 of Subroutine 2, not because of the break condition at Line 22). Given the partition O=OLΓOH, for each t[1ε], we can find a partition Srt=SLtΓSHt such that SLtΓOH and OLΓSHt are also bases. Moreover, because both Srt=SLtΓSHt and OLΓSHt are bases, by Lemma 12, there exists a bijection ht:SLtOL such that for all sitSLt, replacing element sit in Srt with element ht(sit) results in a new base. In particular, this implies that element ht(sit) could be added to set Si1t without violating the matroid constraint. Despite this, element ht(sit)OL did not satisfy the selection condition at Line 20 corresponding to Δit (since otherwise it would have been included in H). This implies that

F(ε𝟏ht(sit)|xt1+ε𝟏Si1t)F(εΔit|xt1+ε𝟏Si1t)=F(ε𝟏sit|xt1+ε𝟏Si1t) (6)

(ignoring the error incurred by the rounding operator I, which can be addressed by Claim 23). Now we derive that

F(xt)F(xt1) (7)
= i[r]F(ε𝟏sit|xt1+ε𝟏Si1t) (By telescoping sum)
i[r] s.t.sitSLtF(ε𝟏sit|xt1+ε𝟏Si1t) (By the if condition at Line 10)
i[r] s.t.sitSLtF(ε𝟏ht(sit)|xt1+ε𝟏Si1t) (By Ineq. (6))
i[r] s.t.sitSLtF(ε𝟏ht(sit)|xt) (By submodularity)
= oOLF(ε𝟏o|xt) (Since ht:SLtOL is a bijection). (8)

By Definition 4, F(ε𝟏o|xt) represents the additional value gained by increasing the probability of element o appearing in the random set (xt) by ε, and thus, F(ε𝟏o|xt)=ε𝔼[f({o}|(xt){o})]. Combining this with Ineq. (7), we obtain that

F(xt)F(xt1) oOLε𝔼[f({o}|(xt){o})]
oOLε𝔼[f({o}|(xt))] (By submodularity)
ε𝔼[f(OL|(xt))] (By submodularity)
=ε(𝔼[f(OL(xt))]F(xt)) (By Definition 4)
ε(f(OL)εf(O)F(xt)), (9)

where the last inequality follows because conditioned on event E2 in the statement of Lemma 27, the random set (xt) (which is always a subset of t[1ε],i[r]Vit in Subroutine 2) satisfies that f((xt)|O)εf(O).

Then, by iteratively applying Ineq. (4) from t=1 to t=1ε (as in the analysis of the continuous greedy algorithm, with the difference being that the approximation guarantee here is with respect to f(OL)εf(O) rather than f(O)), we can show that F(x1/ε)(11e)(f(OL)cεf(O)) for some constant c, which establishes Lemma 27.

Finally, Theorem 25 follows by combining Lemma 27 with Lemma 26 and Claim 22.

Proof of Theorem 25.

We assume w.l.o.g. that the matroid rank r is non-zero. First, we show that the event E2 defined in Lemma 27 occurs w.h.p. Notice that in Subroutine 2, for any t[1ε] and i[r], the probability that an element e[n] appears in the subsampled set Vit is ε3r. By a union bound, the probability that element e appears in t[1ε],i[r]Vit is at most rεε3r=ε2. Therefore, it follows from Claim 22 that event E2 occurs with probability at least 1ε. Then, combining this with Lemma 26 through a union bound, the probability that both events E1 and E2 occur is at least 12ε. Thus, by Lemma 27, it holds with probability at least 12ε that F(x1/ε)(11e)(f(OL)8εf(O)). This implies Theorem 25 by Lemma 15, because x1/ε=t=11/εε𝟏Srt and S=t=11/εSrt.

On a separate note, Subroutine 2 itself does not guarantee a better-than-1/2 approximation. This is because Subroutine 2 can be interpreted as an 𝒪~(rpoly(ε))-memory random-order streaming algorithm (by estimating the highest singleton value as in Algorithm 1 and replacing the subsampled sets with small segments of the random stream), and the hardness result by [26] asserts that such random-order streaming algorithms cannot exceed 1/2 approximation.

5 A (𝟏𝟏/𝒆𝜺)-approximation offline algorithm

In this section, we present our final algorithm (Algorithm 3) that achieves a nearly 11/e approximation by recursively applying Continuous-Greedy-Filtering (Subroutine 2) to progressively refined instances. Specifically, Algorithm 3 performs a recursive process consisting of 1α levels.

Algorithm 3 Recursive-Continuous-Greedy-Filtering(f,,r,Sacc,OHacc,α,k).

Initial phase

We initiate the first call to Algorithm 3 by setting k=1, Sacc=, and OHacc=. We denote the optimal solution to the input instance (f,) by O(1). At the root level k=1, Algorithm 3 first invokes Subroutine 2 with parameter ε=α2 on the input instance (f,), which returns two sets S(1) and H(1). Then, Algorithm 3 enumerate all subsets of H(1) in the for loop at Line 9. Crucially, there will be an iteration of this loop where OH(1)=O(1)H(1) (all other iterations will be irrelevant to the analysis). Within this particular iteration (as well as in all other iterations), Algorithm 3 makes a recursive call to itself by setting Sacc=S(1) and OHacc=OH(1). These sets will be accumulated throughout the recursion.

Recursion phase

At each internal recursion level k{2,,1α1}, Algorithm 3 receives the accumulated sets Sacc=j=1k1S(j) and OHacc=j=1k1OH(j) from the previous k1 levels. It first constructs a refined instance (f(k),(k)), where the matroid (k) is the contraction of the original matroid by the accumulated set j=1k1OH(j), and the non-negative submodular function f(k) is constructed by evaluating the original function f on the union of the accumulated set j=1k1OH(j) and the input set to f(k). We define O(k):=O(1)(j=1k1OH(j)). Next, Algorithm 3 calls Subroutine 2 with parameter ε=α2 on the instance (f(k),(k)), which returns two sets S(k) and H(k). Then, Algorithm 3 enumerates all subsets of H(k). Crucially, there will be an iteration of this loop where OH(k)=O(k)H(k) (all other iterations will be irrelevant to the analysis). In this particular iteration (as well as in all other iterations), Algorithm 3 makes a recursive call to itself, passing the updated accumulated sets Sacc=j=1kS(j) and OHacc=j=1kOH(j) as arguments.

It is important to keep in mind that, in the recursion tree of Algorithm 3 (illustrated in Figure 1), there is a unique root-to-leaf path such that for each k[1α1], the (k+1)-th call to Algorithm 3 along the path (corresponding to the (k+1)-th node in the path) is invoked by the iterate OH(k)=O(k)H(k) (from the for loop at Line 9) during the k-th call, where O(k)=O(1)(j=1k1OH(j)). We will focus on this particular path when we prove the approximation guarantee of Algorithm 3.

Figure 1: This figure shows an example depth-2 recursion tree of Algorithm 3. The empty nodes form a unique path (made bold in the figure) in which, for each k[2], the (k+1)-th call is invoked by the iterate OH(k)=O(k)H(k) during the k-th call, where O(k)=O(1)(j=1k1OH(j)).

Return phase

At the leaf level k=1α, Algorithm 3 first calls Subroutine 2 on a refined instance (f(1/α),(1/α)) (constructed similarly to the instances in the previous levels), which returns two sets S(1/α) and H(1/α). Then, it conducts an exhaustive search to find the most valuable subset XALG of SaccOHacc=(j=11/αS(j))(j=11/α1OH(j)) that belongs to matroid , and returns XALG to the previous level. Then, at each level k[1α1], among all solution sets returned by the recursive calls, Algorithm 3 identifies and returns the most valuable one to the previous level.

Time and space complexity

Now we analyze the time and space complexity of Algorithm 3 (assuming that we set k=1, Sacc=, and OHacc= in the initial call). We first observe that at each level k[1α], the sets S(k) and H(k), which are outputs of Subroutine 2 with parameter α2, have sizes |S(k)|rα2 (because S(k) is either the union of 1α2 solution sets constructed in the second phase of Subroutine 2 or an empty set) and |H(k)|=𝒪(rln(r)2poly(α)) (because of the break condition at Line 22 of Subroutine 2). From this, we observe the following:

  1. (i)

    At each leaf node of the recursion tree of Algorithm 3, the accumulated set Sacc=k=11/αS(k) has size |Sacc|rα3 (because each set S(k) has size at most rα2), and the accumulated set OHacc=k=11/αOH(k) has size |OHacc|=𝒪(rln(r)2poly(α)) (because |OH(k)||H(k)|=𝒪(rln(r)2poly(α))).

  2. (ii)

    Each non-leaf node of the recursion tree has at most 2𝒪(rln(r)2poly(α)) child nodes (because each set H(k) has size 𝒪(rln(r)2poly(α))), which implies that the total number of leaves is 2𝒪(rln(r)2poly(α)), since the recursion tree has 1α levels in total.

Then, we notice that the runtime of each call to Subroutine 2 at each node is n2𝒪(r/α2), where the 2𝒪(r/α2) factor accounts for the exact computation of the multi-linear extension (which, if desired, can be replaced by polynomial-time estimation using standard Monte-Carlo sampling). Moreover, the exhaustive search step at each leaf node takes 2𝒪(rln(r)2poly(α)) time, since |Sacc|+|OHacc|=𝒪(rln(r)2poly(α)). Hence, summing the runtime over all nodes, we obtain that the total runtime is n2𝒪(rln(r)2poly(α)). For the space complexity, we assume that Algorithm 3 traverses the recursion tree in depth-first order. We notice that at any node in any level [1α] of the recursion tree, Algorithm 3 only needs to store the sets S(k),H(k),OH(k) for all k[] from its ancestor nodes and itself, and among S(k),H(k),OH(k), the set H(k) has the largest size, which is 𝒪(rln(r)2poly(α)). Therefore, the space complexity is 𝒪(rln(r)2poly(α)).

Next, in Theorem 28, we show that Algorithm 3 achieves a nearly 11/e approximation. At a high level, the argument is as follows: Consider the specific path in the recursion tree of Algorithm 3, as illustrated in Figure 1, but with depth 1α for arbitrarily small α. Suppose that for all k[1α], the marginal value of the set OH(k) with respect to j=1k1OH(j) is at least α fraction of the optimal value, i.e., at least αf(O(1)). Then, the union j=11/αOH(j), which is a candidate solution in the exhaustive search step at the leaf node, must capture the entire optimal value. On the other hand, if for any k[1α], the marginal value of the set OH(k) with respect to j=1k1OH(j) is less than α fraction of the optimal value, then by submodularity, the set O(1)OH(k) should account for the majority of the optimal value. We can show that (using Theorem 28) w.h.p., the union of the set j=1k1OH(j) and a subset of the first output of Subroutine 2 during the k-th call along the path, which is a candidate solution in the exhaustive search step, is a nearly 11/e approximation to O(1)OH(k). The complete proof is provided in the full version.

Theorem 28.

Given any input submodular function f:2[n]0, matroid 2[n] with rank r+, and parameter α(0,12) such that 1α+, by setting Sacc=, OHacc=, and k=1, Algorithm 3 obtains a 11e7α approximation with n2𝒪(rln(r)2poly(α)) runtime and 𝒪(rln(r)2poly(α)) memory.

References

  • [1] Shipra Agrawal, Mohammad Shadravan, and Cliff Stein. Submodular secretary problem with shortlists. In 10th Innovations in Theoretical Computer Science Conference (ITCS 2019). Schloss Dagstuhl – Leibniz-Zentrum für Informatik, 2019. doi:10.4230/LIPIcs.ITCS.2019.1.
  • [2] Naor Alaluf, Alina Ene, Moran Feldman, Huy L Nguyen, and Andrew Suh. An optimal streaming algorithm for submodular maximization with a cardinality constraint. Mathematics of Operations Research, 47(4), 2022. doi:10.1287/moor.2021.1224.
  • [3] Ashwinkumar Badanidiyuru, Baharan Mirzasoleiman, Amin Karbasi, and Andreas Krause. Streaming submodular maximization: Massive data summarization on the fly. In Proceedings of the 20th ACM SIGKDD international conference on Knowledge discovery and data mining, 2014. doi:10.1145/2623330.2623637.
  • [4] Ashwinkumar Badanidiyuru and Jan Vondrák. Fast algorithms for maximizing submodular functions. In Proceedings of the twenty-fifth annual ACM-SIAM symposium on Discrete algorithms. SIAM, 2014. doi:10.1137/1.9781611973402.110.
  • [5] Richard A Brualdi. Comments on bases in dependence structures. Bulletin of the Australian Mathematical Society, 1(2), 1969. doi:10.1017/S000497270004140X.
  • [6] Niv Buchbinder and Moran Feldman. Constrained submodular maximization via new bounds for dr-submodular functions. In Proceedings of the 56th Annual ACM Symposium on Theory of Computing, 2024. doi:10.1145/3618260.3649630.
  • [7] Niv Buchbinder, Moran Feldman, Joseph Naor, and Roy Schwartz. Submodular maximization with cardinality constraints. In Proceedings of the twenty-fifth annual ACM-SIAM symposium on Discrete algorithms. SIAM, 2014. doi:10.1137/1.9781611973402.106.
  • [8] Gruia Calinescu, Chandra Chekuri, Martin Pal, and Jan Vondrák. Maximizing a monotone submodular function subject to a matroid constraint. SIAM Journal on Computing, 40(6), 2011. doi:10.1137/080733991.
  • [9] Amit Chakrabarti and Sagar Kale. Submodular maximization meets streaming: matchings, matroids, and more. Mathematical Programming, 154, 2015. doi:10.1007/s10107-015-0900-7.
  • [10] Chandra Chekuri, Jan Vondrák, and Rico Zenklusen. Dependent randomized rounding via exchange properties of combinatorial structures. In 2010 IEEE 51st Annual Symposium on Foundations of Computer Science. IEEE, 2010. doi:10.1109/FOCS.2010.60.
  • [11] Vincent Cohen-Addad, Anupam Gupta, Amit Kumar, Euiwoong Lee, and Jason Li. Tight fpt approximations for k-median and k-means. In 46th International Colloquium on Automata, Languages, and Programming (ICALP 2019). Schloss Dagstuhl – Leibniz-Zentrum für Informatik, 2019. doi:10.4230/LIPIcs.ICALP.2019.42.
  • [12] Shahar Dobzinski and Jan Vondrák. From query complexity to computational complexity. In Proceedings of the forty-fourth annual ACM symposium on Theory of computing, 2012. doi:10.1145/2213977.2214076.
  • [13] Uriel Feige, Vahab S Mirrokni, and Jan Vondrák. Maximizing non-monotone submodular functions. SIAM Journal on Computing, 40(4), 2011. doi:10.1137/090779346.
  • [14] Moran Feldman, Paul Liu, Ashkan Norouzi-Fard, Ola Svensson, and Rico Zenklusen. Streaming submodular maximization under matroid constraints. Mathematics of Operations Research, 2025. doi:10.1287/moor.2023.0276.
  • [15] Moran Feldman, Ashkan Norouzi-Fard, Ola Svensson, and Rico Zenklusen. Submodular Maximization Subject to Matroid Intersection on the Fly. In 30th Annual European Symposium on Algorithms (ESA 2022). Schloss Dagstuhl – Leibniz-Zentrum für Informatik, 2022. doi:10.4230/LIPIcs.ESA.2022.52.
  • [16] Moran Feldman, Ashkan Norouzi-Fard, Ola Svensson, and Rico Zenklusen. The one-way communication complexity of submodular maximization with applications to streaming and robustness. Journal of the ACM, 70(4), 2023. doi:10.1145/3588564.
  • [17] Shayan Oveis Gharan and Jan Vondrák. Submodular maximization by simulated annealing. In Proceedings of the twenty-second annual ACM-SIAM symposium on Discrete Algorithms. SIAM, 2011. doi:10.1137/1.9781611973082.83.
  • [18] Curtis Greene. A multiple exchange property for bases. Proceedings of the American Mathematical Society, 39(1), 1973. doi:10.1090/S0002-9939-1973-0311494-8.
  • [19] Chien Chung Huang, Naonori Kakimura, Simon Mauras, and Yuichi Yoshida. Approximability of monotone submodular function maximization under cardinality and matroid constraints in the streaming model. SIAM Journal on Discrete Mathematics, 2022. doi:10.1137/20M1357317.
  • [20] Ehsan Kazemi, Marko Mitrovic, Morteza Zadimoghaddam, Silvio Lattanzi, and Amin Karbasi. Submodular streaming in all its glory: Tight approximation, minimum memory and low adaptive complexity. In International Conference on Machine Learning. PMLR, 2019. URL: https://proceedings.mlr.press/v97/kazemi19a.html.
  • [21] Paul Liu, Aviad Rubinstein, Jan Vondrák, and Junyao Zhao. Cardinality constrained submodular maximization for random streams. Advances in Neural Information Processing Systems, 34, 2021. URL: https://proceedings.neurips.cc/paper_files/paper/2021/file/333222170ab9edca4785c39f55221fe7-Paper.pdf.
  • [22] Pasin Manurangsi. Tight running time lower bounds for strong inapproximability of maximum k-coverage, unique set cover and related problems (via t-wise agreement testing theorem). In Proceedings of the Fourteenth Annual ACM-SIAM Symposium on Discrete Algorithms. SIAM, 2020. doi:10.1137/1.9781611975994.5.
  • [23] Baharan Mirzasoleiman, Stefanie Jegelka, and Andreas Krause. Streaming non-monotone submodular maximization: Personalized video summarization on the fly. In Proceedings of the AAAI Conference on Artificial Intelligence, 2018. doi:10.1609/aaai.v32i1.11529.
  • [24] George L Nemhauser and Laurence A Wolsey. Best algorithms for approximating the maximum of a submodular set function. Mathematics of operations research, 3(3), 1978. doi:10.1287/moor.3.3.177.
  • [25] Benjamin Qi. On maximizing sums of non-monotone submodular and linear functions. Algorithmica, 86(4), 2024. doi:10.1007/s00453-023-01183-3.
  • [26] Aviad Rubinstein and Junyao Zhao. Maximizing non-monotone submodular functions over small subsets: Beyond 1/2-approximation. In 49th International Colloquium on Automata, Languages, and Programming (ICALP 2022). Schloss Dagstuhl – Leibniz-Zentrum für Informatik, 2022. doi:10.4230/LIPIcs.ICALP.2022.106.
  • [27] Mohammad Shadravan. Improved submodular secretary problem with shortlists. arXiv preprint arXiv:2010.01901, 2020. arXiv:2010.01901.
  • [28] Piotr Skowron. FPT approximation schemes for maximizing submodular functions. Information and Computation, 257, 2017. doi:10.1016/j.ic.2017.10.002.
  • [29] Douglas R Woodall. An exchange theorem for bases of matroids. Journal of Combinatorial Theory, Series B, 16(3), 1974. doi:10.1016/0095-8956(74)90067-7.