A General Framework for Finding Diverse Solutions via Network Flow and Its Applications

Iwamasa, Yuni; Matsuda, Tomoki; Morihira, Shunya; Sumita, Hanna

doi:10.4230/LIPIcs.ISAAC.2025.41

A General Framework for Finding Diverse Solutions via Network Flow and Its Applications

Yuni Iwamasa

Graduate School of Informatics, Kyoto University, Japan Tomoki Matsuda School of Computing, Institute of Science Tokyo, Japan Shunya Morihira Graduate School of Informatics, Kyoto University, Japan Hanna Sumita

School of Computing, Institute of Science Tokyo, Japan

Abstract

In this paper, we present a general framework for efficiently computing diverse solutions to combinatorial optimization problems. Given a problem instance, the goal is to find $k$ solutions that maximize a specified diversity measure – the sum of pairwise Hamming distances or the size of the union of the $k$ solutions. Our framework applies to problems satisfying two structural properties: (i) All solutions are of equal size and (ii) the family of all solutions can be represented by a surjection from the family of ideals of some finite poset. Under these conditions, we show that the problem of computing $k$ diverse solutions can be reduced to the minimum cost flow problem and the maximum $s$ - $t$ flow problem. As applications, we demonstrate that both the unweighted minimum $s$ - $t$ cut problem and the stable matching problem satisfy the requirements of our framework. By utilizing the recent advances in network flows algorithms, we improve the previously known time complexities of the diverse problems, which were based on submodular function minimization.

Keywords and phrases:

Diverse Solutions, Network Flow Algorithm, Lattice Theory

Funding:

Yuni Iwamasa: Partially supported by JSPS KAKENHI Grant Numbers JP22K17854, JP24K02901, JP24K21315, Japan.

Hanna Sumita: Partially supported by JST ERATO Grant Number JPMJER2301, JST ASPIRE Grant Number JPMJAP2302, and JSPS KAKENHI Grant Numbers JP21K17708, JP21H03397, Japan.

Copyright and License:

2012 ACM Subject Classification:

Theory of computation

\rightarrow

Design and analysis of algorithms

Related Version:

Full Version: https://arxiv.org/abs/2504.17633

Acknowledgements:

We are grateful to Yasunori Kinoshita for his insightful suggestion on the start of this work. We also thank Yasuaki Kobayashi and Yutaro Yamaguchi for bibliographic information.

DOI:

10.4230/LIPIcs.ISAAC.2025.41

Event:

36th International Symposium on Algorithms and Computation (ISAAC 2025)

Editors:

Ho-Lin Chen, Wing-Kai Hon, and Meng-Tsung Tsai

Series and Publisher:

Leibniz International Proceedings in Informatics, Schloss Dagstuhl – Leibniz-Zentrum für Informatik

1 Introduction

In modeling real-world problems as optimization problems, many factors are inevitably omitted from the model due to the limited information and the need for tractability. Consequently, finding a single optimal solution to the optimization problem may not be enough. Motivated by this issue, developing algorithms to find diverse multiple solutions has recently gained attention in the field of combinatorial optimization [3, 4, 7, 8, 9, 10, 11, 15, 17].

We formally define the $k$ -diverse problem for a positive integer $k$ . Let $d_{\mathrm{div}}$ be a function that measures the “diversity” for a $k$ -tuple of subsets of some arbitrary fixed ground set $E$ . For a combinatorial problem Prob, the $k$ -diverse problem of Prob asks, given an instance $\mathbf{I}$ of Prob, to find a $k$ -tuple $(S_{1},S_{2},\dots,S_{k})$ of solutions of $\mathbf{I}$ that maximizes the fixed measure $d_{\mathrm{div}}$ . Typical examples of the measure of diversity include the following:

d_{\mathrm{sum}}(S_{1},S_{2},\dots,S_{k})\coloneqq\sum_{1\leq i<j\leq k}\left|% S_{i}\triangle S_{j}\right|,\qquad d_{\mathrm{cov}}(S_{1},S_{2},\dots,S_{k})% \coloneqq\left|\bigcup_{1\leq i\leq k}S_{i}\right|.

(2)

If we use $d_{\mathrm{sum}}$ (resp. $d_{\mathrm{cov}}$ ) as the measure, then we refer to the corresponding $k$ -diverse problem as Sum- $k$ -Diverse (resp. Cov- $k$ -Diverse).

In this paper, we present a general framework for efficiently finding diverse solutions to combinatorial optimization problems. Namely, we focus on Sum- $k$ -Diverse/Cov- $k$ -Diverse of a combinatorial problem Prob having two properties (S) and (R) introduced below. The first property (S) states that all solutions of each instance of Prob have the same size:

(S) For any instance $\mathbf{I}$ of Prob, there exist a finite ground set $E$ and a positive integer $q$ such that the family $\mathcal{S}(\mathbf{I})$ of solutions of $\mathbf{I}$ consists of subsets of $E$ of size $q$ .

The second property (R) indicates that $\mathcal{S}(\mathbf{I})$ admits a lattice structure and $\mathcal{S}(\mathbf{I})$ has a compact representation using the lattice. To state this property precisely, we first introduce some notation. Let $(P,\preceq)$ be a finite poset with minimum element $\bot$ and maximum element $\top$ . For such a poset $P$ , let $P^{*}$ denote the poset obtained from $P$ by removing $\bot$ and $\top$ . For a map $r\colon E\to P^{2}$ that assigns each element $e\in E$ to an ordered pair

\displaystyle r(e)\coloneqq(e^{+},e^{-})

(3)

of elements in $P$ with $e^{+}\preceq e^{-}$ , we define a set ${\sup}_{r}(I)$ by

{\sup}_{r}(I)\coloneqq\{e\in E\mid e^{+}\in I\cup\{\bot\}\not\ni e^{-}\}

(4)

for each ideal $I\in\mathcal{I}(P^{*})$ . Here, an ideal of a poset is a subset $I\subseteq P^{*}$ that is closed downward under $\preceq$ , that is, $y\preceq x\in I$ implies $y\in I$ for all $x,y\in P^{*}$ , and $\mathcal{I}(P^{*})$ denotes the family of ideals of the poset $P^{*}$ . With this notation, the property (R) can be stated as follows:

(R) For any instance $\mathbf{I}$ of Prob, there exist a finite poset $(P,\preceq)$ with distinct minimum element $\bot$ and maximum element $\top$ and a map $r\colon E\to P^{2}$ given by $e\mapsto(e^{+},e^{-})$ with $e^{+}\preceq e^{-}$ such that

\displaystyle\mathcal{S}(\mathbf{I})=\{{\sup}_{r}(I)\mid I\in\mathcal{I}(P^{*}% )\}.

(5)

We refer to a map $r$ appearing in the property (R) as a reduction map. In other words, the property (R) requires that every instance $\mathbf{I}$ of Prob admits a reduction map $r$ such that ${\sup}_{r}$ forms a surjection from the family $\mathcal{I}(P^{*})$ of ideals of $P^{*}$ to the family $\mathcal{S}(\mathbf{I})$ of solutions of $\mathbf{I}$ . A toy example of a reduction map is found in Example 1.

Example 1.

Let $E=\{a,b,c,d\}$ be the ground set. Suppose that the set of solutions is $\mathcal{S}(\mathbf{I})=\{\{a,b\},\{b,c\},\{c,d\}\}$ . We consider a poset $(P=\{\bot,u,v,\top\},\preceq)$ , where $\bot$ is the minimum element, $\top$ is the maximum element, and $u\prec v$ . We define a map $r\colon E\to P^{2}$ by $r(a)=(\bot,u)$ , $r(b)=(\bot,v)$ , $r(c)=(u,\top)$ , $r(d)=(v,\top)$ . Since $\mathcal{I}(P^{*})=\{\emptyset,\{u\},\{u,v\}\}$ , we can see that $r$ is a reduction map. Indeed, we have ${\sup}_{r}(\emptyset)=\{e\in E\mid e^{+}\in\{\bot\}\not\ni e^{-}\}=\{a,b\}$ , ${\sup}_{r}(\{u\})=\{b,c\}$ , and ${\sup}_{r}(\{u,v\})=\{c,d\}$ .

We show that if the problem Prob has the properties (S) and (R), then $\mathcal{S}(\mathbf{I})$ with a partial order defined from a reduction map forms a distributive lattice (see Theorem 11).

Our framework exploits network flow techniques to efficiently compute diverse solutions. To utilize them, we will construct a directed acyclic graph (DAG) to represent a poset. Here, we say that a DAG $G$ represents a poset $(P,\preceq)$ with distinct minimum element $\bot$ and maximum element $\top$ if the vertex set of $G$ is $P$ , $\top$ is a source and $\bot$ is a sink in $G$ , and for any $u,v\in P^{*}$ , there is a $u$ - $v$ path in $G$ if and only if $u\succeq v$ .

Our main result is stated as follows. We focus on deterministic algorithms in this paper.

Theorem 2.

Suppose that a combinatorial problem Prob has the properties (S) and (R). Then, for any instance $\mathbf{I}$ of Prob, the problems Sum- $k$ -Diverse and Cov- $k$ -Diverse of Prob can be solved in $O(T_{P}(\mathbf{I})+T_{r}(\mathbf{I})+(|A|+k|E|)^{1+o(1)})$ time and in $O(T_{P}(\mathbf{I})+T_{r}(\mathbf{I})+(|A|+|E|)^{1+o(1)}\log^{2}k+kq)$ time, respectively. Here, $E$ is the ground set of $\mathbf{I}$ , $q$ is the size of each solution of $\mathbf{I}$ , $T_{P}(\mathbf{I})$ is the time required to construct a DAG representing the poset $P$ in (R), $T_{r}(\mathbf{I})$ is the time required to construct a reduction map $r$ in (R), and $A$ is the set of arcs of a DAG $G$ that represents $P$ .

We obtain Theorem 2 via the reduction of Sum- $k$ -Diverse/Cov- $k$ -Diverse of Prob to a classical network flow problem, called the minimum cost flow problem. This enables us to utilize the state-of-the-art (deterministic) algorithm [25] for the minimum cost flow problem in solving Sum- $k$ -Diverse/Cov- $k$ -Diverse of Prob. To this end, we introduce an intermediate problem called the minimum $k$ -potential problem, and reduce the $k$ -diverse problems to the minimum $k$ -potential problem. Then we further reduce the minimum $k$ -potential problem to the minimum cost flow problem by utilizing the idea of Ahuja, Hochbaum, and Orlin [1], who dealt with a more general problem. Our novelty is to identify the properties (S) and (R) as a sufficient condition to reduce the $k$ -diverse problem to the minimum $k$ -potential problem.

We also develop a reduction of Sum- $k$ -Diverse/Cov- $k$ -Diverse of Prob to the maximum $s$ - $t$ flow problem via the minimum $k$ -potential problem. We describe this in the full version. While the running-time of this reduction is slightly worse than that of the reduction to the minimum cost flow problem, its practical performance may be superior thanks to the simplicity of the reduction and the maximum $s$ - $t$ flow problem.

As applications of our framework, we demonstrate that two classical combinatorial problems, Unweighted Minimum $s$ - $t$ Cut and Stable Matching, have properties (S) and (R). Here, Unweighted Minimum $s$ - $t$ Cut is to find an $s$ - $t$ cut of a given digraph $G$ with the minimum size, and Stable Matching is to find a matching of two parties such that no unmatched pair both prefer each other to their current partners (see Sections 5.2 and 5.3 for the definitions). In fact, both problems are known to possess a distributive lattice structure. By applying Theorem 2 to Sum- $k$ -Diverse/Cov- $k$ -Diverse of these problems, we obtain the following results.

Theorem 3.

1.

The problems Sum- $k$ -Diverse and Cov- $k$ -Diverse of Unweighted Minimum $s$ - $t$ Cut can be solved in $O(n+(km)^{1+o(1)})$ time and in $O(n+m^{1+o(1)}\log^{2}k+kq)$ time, respectively, where $n$ denotes the number of vertices of the input digraph $G$ , $m$ the number of arcs of $G$ , and $q$ the size of a minimum $s$ - $t$ cut of $G$ .
2.

The problems Sum- $k$ -Diverse and Cov- $k$ -Diverse of Stable Matching can be solved in $O((kn^{2})^{1+o(1)})$ time and in $O(n^{2+o(1)}\log^{2}k+kn)$ time, respectively, where $n$ denotes the size of the ground set $U$ (or $V$ ) of the input instance $(U,V;(\leq_{u})_{u\in U},(\leq_{v})_{v\in V})$ .

The polynomial-time solvability of Sum- $k$ -Diverse/Cov- $k$ -Diverse of Unweighted Minimum $s$ - $t$ Cut has already been shown by De Berg, Martínez, and Spieksma [7]. Very recently, independently of our work, the same authors [8] developed a framework for solving Sum- $k$ -Diverse/Cov- $k$ -Diverse of a combinatorial problem having a certain property, and they show that Sum- $k$ -Diverse/Cov- $k$ -Diverse of Stable Matching can be solved in polynomial time. Both of their tractability results in [7, 8] are based on the polynomial-time solvability of the submodular function minimization (SFM) [2, 12]. Even if we use the state-of-the-art algorithm for SFM, given in [18], the running-time of their algorithms is (polynomial but) not very fast; it takes $O(k^{5}n^{5})$ time for Sum- $k$ -Diverse/Cov- $k$ -Diverse of Unweighted Minimum $s$ - $t$ Cut, and $O(k^{5}n^{9})$ time for that of Stable Matching. As in Theorem 3, our proposed algorithms for Sum- $k$ -Diverse/Cov- $k$ -Diverse of Unweighted Minimum $s$ - $t$ Cut/Stable Matching are much faster, thanks to recent advance in network flow algorithms. Moreover, we show that the framework of [8] for Sum- $k$ -Diverse and Cov- $k$ -Diverse can be captured by our framework (see Section 5.4). This allows us to improve the time complexity.

Due to the space limitation, all the proofs (marked with $\star$ ) are omitted, and they can be found in the full version.

Related work.

There exists a vast body of literature on the problem of finding diverse solutions; here, we mention only a few representative papers. Finding diverse solutions is generally harder than finding a single one. The $k$ -diverse problems with respect to $d_{\mathrm{sum}}$ of some polynomial-time solvable problems – such as the maximum matching problem [10] and the (global) minimum cut problem [15] – are known to be NP-hard. Thus, in recent years, there has been a growing body of work on fixed-parameter tractable (FPT) algorithms for NP-hard diverse problems [3, 4, 9, 10, 11, 17]. Meanwhile, polynomial-time algorithms are known for several other $k$ -diverse problems with respect to the diversity measures $d_{\mathrm{sum}}$ or $d_{\mathrm{cov}}$ , including the $k$ -diverse variants of the spanning-tree problem [17], the shortest $s$ - $t$ path problem [16], as well as Unweighted Minimum $s$ - $t$ Cut [7] and Stable Matching [8], which have already been mentioned.

2 Preliminaries

Let $\mathbb{Z}$ , $\mathbb{Z}_{+}$ , $\mathbb{R}$ , and $\mathbb{R}_{+}$ denote the set of integers, nonnegative integers, real numbers, and nonnegative real numbers, respectively. For a nonnegative integer $k$ , let $[k]\coloneqq\{1,2,\dots,k\}$ and $[0,k]\coloneqq\{0,1,\dots,k\}$ . The symmetric difference $(S\setminus T)\cup(T\setminus S)$ of sets $S$ and $T$ is denoted by $S\triangle T$ . For a finite set $E$ and a nonnegative integer $q\in\mathbb{Z}_{+}$ , let $\binom{E}{q}$ denote the family of all subsets of $E$ of size $q$ , i.e., $\binom{E}{q}\coloneqq\{S\subseteq E\mid|S|=q\}$ . For any finite set $E$ , element $e\in E$ , and $k$ -tuple $\mathbf{S}=(S_{1},S_{2},\dots,S_{k})$ of subsets of $E$ , the multiplicity of $e$ with respect to $\mathbf{S}$ , denoted by $\mu_{e}(\mathbf{S})$ , is defined as the number of subsets $S_{i}$ that contain $e$ , i.e., $\mu_{e}(\mathbf{S})\coloneqq|\{i\in[k]\mid e\in S_{i}\}|$ .

Posets.

A partially ordered set (or poset) is a pair $(P,\preceq)$ of a set $P$ and a binary relation $\preceq$ over $P$ satisfying, for $x,y,z\in P$ , that $x\preceq x$ (reflexivity), $x\preceq y$ and $y\preceq x$ imply $x=y$ (antisymmetry), and $x\preceq y$ and $y\preceq z$ imply $x\preceq z$ (transitivity). By $x\prec y$ we mean $x\preceq y$ and $x\neq y$ . Such a binary relation is called a partial order. If no confusion arises, we denote by $P$ a poset and its underlying set interchangeably. In this paper, we consider only a finite poset, i.e., posets whose underlying sets are finite. Hence, by a poset we mean a finite poset. A partial order $\preceq$ on $P$ is called a total order if $x\preceq y$ or $y\preceq x$ holds for any $x,y\in P$ . For a poset $(P,\preceq)$ , a subset $I\subseteq P$ is called an ideal if $I$ is closed under $\preceq$ , i.e., for any $v\in P$ and $u\preceq v$ , we have $u\in P$ . Let $\mathcal{I}(P)$ denote the set of all ideals of $P$ . If $P$ has the minimum element $\bot$ and the maximum element $\top$ , then we denote by $P^{*}$ the poset obtained from $P$ by removing $\bot$ and $\top$ .

Network flows.

Let $G=(V,A)$ be a digraph. For a vertex subset $X\subseteq V$ , let $\Delta^{+}_{G}(X)$ (resp. $\Delta^{-}_{G}(X)$ ) denote the set of outgoing arcs from (resp. incoming arcs to) $X$ . If no confusion arises, we omit the subscript $G$ from $\Delta^{+}_{G}(X)$ and $\Delta^{-}_{G}(X)$ . If $X$ consists of a single vertex $v$ , i.e., $X=\{v\}$ , we simply write $\Delta^{+}(v)$ and $\Delta^{-}(v)$ instead of $\Delta^{+}(\{v\})$ and $\Delta^{-}(\{v\})$ , respectively. A function $f\colon A\to\mathbb{Z}_{+}$ from the arc set $A$ to the nonnegative integers is called a flow of $G$ . In the case where the digraph $G$ has an arc capacity $c\colon A\to\mathbb{Z}_{+}$ , a flow $f\colon A\to\mathbb{Z}_{+}$ is said to be feasible (with respect to $c$ ) if $f(a)\leq c(a)$ for all $a\in A$ . For a feasible flow $f$ with respect to an arc capacity $c$ , its residual graph, denoted by $G_{f}$ , is the digraph whose vertex set is $V$ and arc set is $\{a\mid a\in A,\ f(a)<c(a)\}\cup\{a\mid\bar{a}\in A,\ 0<f(\bar{a})\}$ , where $\bar{a}$ denotes the reverse arc $(v,u)$ of $a=(u,v)$ . This plays an important role in algorithms for network flow problems (and our algorithms). For a flow $f\colon A\to\mathbb{Z}_{+}$ , its boundary $\partial f\colon V\to\mathbb{Z}$ is defined by $\partial f(v)\coloneqq\sum_{a\in\Delta^{+}(v)}f(a)-\sum_{a\in\Delta^{-}(v)}f(a)$ for each $v\in V$ .

In the following (except for Section 5.4), we assume that Prob is a combinatorial problem having the properties (S) and (R). For an instance $\mathbf{I}$ of Prob, we denote by $\mathcal{S}(\mathbf{I})$ the family of solutions of $\mathbf{I}$ , $T_{P}(\mathbf{I})$ the time required to construct a DAG representing the poset $P$ in (R), and $T_{r}(\mathbf{I})$ the time required to construct a reduction map $r$ in (R).

3 Reduction to the minimum $𝒌$ -potential problem

In this section, we provide a reduction from the $k$ -diverse problem of Prob to the minimum $k$ -potential problem, which we introduce later.

In Sum- $k$ -Diverse and Cov- $k$ -Diverse of Prob, we can regard the diversity measures $d_{\mathrm{sum}}$ and $d_{\mathrm{cov}}$ as the functions over $(\mathcal{S}(\mathbf{I}))^{k}$ for each instance $\mathbf{I}$ of Prob. Since $\mathcal{S}(\mathbf{I})\subseteq\binom{E}{q}$ holds by the property (S), we have $\sum_{e\in E}\mu_{e}(\mathbf{S})=\sum_{i=1}^{k}|S_{i}|=kq$ for any $\mathbf{S}=(S_{1},S_{2},\dots,S_{k})\in(\mathcal{S}(\mathbf{I}))^{k}$ , which is a constant. Hence, the functions $d_{\mathrm{sum}}$ and $d_{\mathrm{cov}}$ are representable as

\displaystyle\begin{array}[]{l}\displaystyle d_{\mathrm{sum}}(\mathbf{S})=\sum% _{e\in E}\mu_{e}(\mathbf{S})(k-\mu_{e}(\mathbf{S}))=\mathrm{Const.}-\sum_{e\in E% }\mu_{e}(\mathbf{S})^{2},\\[15.0pt] \displaystyle d_{\mathrm{cov}}(\mathbf{S})=\sum_{e\in E}\min\{1,\mu_{e}(% \mathbf{S})\}=\mathrm{Const.}-\sum_{e\in E}\max\{0,\mu_{e}(\mathbf{S})-1\}\end% {array}

(8)

for each $\mathbf{S}\in(\mathcal{S}(\mathbf{I}))^{k}$ . Thus, the problems Sum- $k$ -Diverse and Cov- $k$ -Diverse of Prob, namely, the problems of maximizing the functions $d_{\mathrm{sum}}$ and $d_{\mathrm{cov}}$ over $(\mathcal{S}(\mathbf{I}))^{k}$ , are equivalent to those of minimizing

\displaystyle d_{\mathrm{sum}}^{*}(\mathbf{S})\coloneqq\sum_{e\in E}\mu_{e}(% \mathbf{S})^{2},\qquad d_{\mathrm{cov}}^{*}(\mathbf{S})\coloneqq\sum_{e\in E}% \max\{0,\mu_{e}(\mathbf{S})-1\},

(9)

respectively.

By using the concept of discrete convex functions, we can uniformly handle these functions $d_{\mathrm{sum}}^{*}$ and $d_{\mathrm{cov}}^{*}$ as follows. A function $\varphi\colon\mathbb{Z}\to\mathbb{Z}$ is said to be discrete convex [20, Chapter 3.4] if $\varphi(x-1)+\varphi(x+1)\geq 2\varphi(x)$ for all $x\in\mathbb{Z}$ , and said to be non-decreasing on $\mathbb{Z}_{+}$ if $\varphi(x)\leq\varphi(x+1)$ for all $x\in\mathbb{Z}_{+}$ . For a discrete convex function $\varphi$ with $\varphi(0)=0$ that is non-decreasing on $\mathbb{Z}_{+}$ , we define

\displaystyle d_{\varphi}^{*}(\mathbf{S})\coloneqq\sum_{e\in E}\varphi(\mu_{e}% (\mathbf{S}))

(10)

for $\mathbf{S}\in(\mathcal{S}(\mathbf{I}))^{k}$ . Since the functions $x\mapsto x^{2}$ and $x\mapsto\max\{0,x-1\}$ are discrete convex functions that are non-decreasing on $\mathbb{Z}_{+}$ and satisfy $0\mapsto 0$ , both of $d_{\mathrm{sum}}^{*}$ and $d_{\mathrm{cov}}^{*}$ admit such representations.

Our framework can be applied to the $k$ -diverse problem with respect to the diversity measure of the form

d_{\varphi}(\mathbf{S})\coloneqq\mathrm{Const.}-d_{\varphi}^{*}(\mathbf{S})

(11)

for $\mathbf{S}\in(\mathcal{S}(\mathbf{I}))^{k}$ , where $\varphi$ is a discrete convex function with $\varphi(0)=0$ that is non-decreasing on $\mathbb{Z}_{+}$ . In the following, we consider the $k$ -diverse problem with respect to $d_{\varphi}$ of Prob, or equivalently, the problem of minimizing $d_{\varphi}^{*}$ over $(\mathcal{S}(\mathbf{I}))^{k}$ for a discrete convex function $\varphi$ .

We then introduce the minimum $k$ -potential problem, to which we reduce the problem of minimizing $d_{\varphi}^{*}$ later. Let $G=(V,A)$ be a DAG having unique source vertex ${\top}$ and sink vertex ${\bot}$ with $\top\neq\bot$ . We refer to an assignment $p\colon V\to\mathbb{Z}_{+}$ of integers to vertices as a $k$ -potential if $p$ satisfies the following conditions:

(P1) $p(\bot)=k$ and $p(\top)=0$ .

(P2) $0\leq p(v)\leq k$ for each $v\in V$ .

(P3) $p$ is monotone non-increasing with respect to $A$ , i.e., $p(u)\leq p(v)$ for each $(u,v)\in A$ .
In the minimum $k$ -potential problem, we are given a DAG $G=(V,A)$ with unique source ${\top}$ and sink ${\bot}$ , an arc weight $w\colon A\to\mathbb{Z}_{+}$ , and a discrete convex function $\varphi\colon\mathbb{Z}\to\mathbb{Z}$ with $\varphi(0)=0$ that is non-decreasing on $\mathbb{Z}_{+}$ , and asked to find a $k$ -potential $p$ of $G$ that minimizes

\displaystyle H(p)\coloneqq\sum_{a=(u,v)\in A}w(a)\varphi(p(v)-p(u)).

(12)

We note that the function $H$ does not change even if we remove vertices $v\in V\setminus\{\bot,\top\}$ with $\Delta^{+}(v)=\Delta^{-}(v)=\emptyset$ , called isolated vertices, from $G$ .

Finally, we reduce, for an instance $\mathbf{I}$ of Prob, the problem of minimizing $d_{\varphi}^{*}$ over $(\mathcal{S}(\mathbf{I}))^{k}$ to the minimum $k$ -potential problem by utilizing the property (R). Let $(P,\preceq)$ be a poset having minimum element $\bot$ and maximum element $\top$ , and let $r\colon E\to P^{2}$ be a reduction map as in the property (R). Then, we construct a DAG $G_{\mathbf{I}}$ whose vertex set is $V=P$ and whose arc set is $A=A_{P}\cup A_{E}$ , where $A_{P}$ is an arc set such that a DAG $(P,A_{P})$ represents the poset $(P,\preceq)$ , and $A_{E}\coloneqq\{(e^{-},e^{+})\mid e\in E\}$ . The resulting $G_{\mathbf{I}}$ is still a DAG that represents $P$ and has unique source $\top$ and sink $\bot$ ; each arc $(e^{-},e^{+})\in A_{E}$ is compatible with the partial order $\preceq$ of $P$ , since $e^{-}\succeq e^{+}$ . An arc weight $w_{\mathbf{I}}\colon A\to\mathbb{Z}_{+}$ is defined by $w_{\mathbf{I}}(a)\coloneqq|\{e\in E\mid a=(e^{-},e^{+})\}|$ for $a\in A$ . Note that $w_{\mathbf{I}}(a)=0$ for each $a\in A\setminus A_{E}$ . We set $\varphi$ , which satisfies the non-decreasing property on $\mathbb{Z}_{+}$ and $\varphi(0)=0$ , as the input discrete convex function of the minimum $k$ -potential problem. Then, the triple $(G_{\mathbf{I}},w_{\mathbf{I}},\varphi)$ is an instance of the minimum $k$ -potential problem; its construction time is $|E|$ .

Example 4.

Recall the toy example in Example 1. A graph $G=(P=\{\bot,u,v,\top\},A_{P})$ with $A_{P}=\{(u,\bot),(v,u),(\top,v)\}$ is a DAG representing $(P,\preceq)$ . By construction, $A_{E}=\{(u,\bot),(v,\bot),(\top,u),(\top,v)\}$ . Then, the DAG $G_{\mathbf{I}}=(P,A_{P}\cup A_{E})$ is illustrated in Figure 1.

Figure 1: A DAG

G_{\mathbf{I}}

for the toy example. Arcs in

A_{P}

and

A_{E}\setminus A_{P}

are shown as solid and dashed lines, respectively.

The arc weights are $w_{\mathbf{I}}(v,u)=0$ , and $w_{\mathbf{I}}(a)=1$ for all other arcs $a\in(A_{P}\cup A_{E})\setminus\{(v,u)\}$ .

Intuitively, any $k$ -potential represents the direct sum of $k$ ideals $I_{1},I_{2},\dots,I_{k}\in\mathcal{I}(P^{*})$ as a multiset. Conversely, for any $k$ ideals in $\mathcal{I}(P^{*})$ , there exists a $k$ -potential that represents their direct sum as a multiset. Furthermore, by the property (R), each ideal of $P^{*}$ corresponds to a solution of $\mathbf{I}$ via ${\sup}_{r}$ . The following lemma verifies this intuition.

Lemma 5 ( $\star$ ).

For each $k$ -tuple $\mathbf{S}\in(\mathcal{S}(\mathbf{I}))^{k}$ , there is a $k$ -potential $p_{\mathbf{S}}$ of $G_{\mathbf{I}}$ such that $H(p_{\mathbf{S}})=d_{\varphi}^{*}(\mathbf{S})$ . Conversely, for each $k$ -potential $p$ of $G_{\mathbf{I}}$ , there is a $k$ -tuple $\mathbf{S}_{p}\in(\mathcal{S}(\mathbf{I}))^{k}$ of solutions of $\mathbf{I}$ such that $H(p)=d_{\varphi}^{*}(\mathbf{S}_{p})$ , and we can construct $\mathbf{S}_{p}$ from $p$ in $O(|E|+kq)$ time.

Lemma 5 immediately implies that we can construct in $O(|E|+kq)$ time a minimizer $\mathbf{S}_{p}$ of $d_{\varphi}^{*}$ over $(\mathcal{S}(\mathbf{I}))^{k}$ from a minimum $k$ -potential $p$ for the instance $(G_{\mathbf{I}},w_{\mathbf{I}},\varphi)$ of the minimum $k$ -potential problem. Therefore, we obtain the following.

Theorem 6 ( $\star$ ).

We can solve the $k$ -diverse problem with respect to $d_{\varphi}$ of Prob in $O(T_{P}(\mathbf{I})+T_{r}(\mathbf{I})+T_{\textup{mp}}(\mathbf{I})+|E|+kq)$ time, where $\mathbf{I}$ is a given instance of Prob and $T_{\mathrm{mp}}(\mathbf{I})$ is the time of solving the instance of the minimum $k$ -potential problem reduced from $\mathbf{I}$ .

4 Solving the minimum $𝒌$ -potential problem via minimum cost flow

Here, we reduce the minimum $k$ -potential problem to the minimum cost flow problem.

Let us first recall the minimum cost flow problem (see e.g., [19, Chapter 9] and [23, Chapter 12] for details). In this problem, we are given a digraph $G=(V,A)$ , arc cost $\gamma\colon A\to\mathbb{Z}$ , arc capacity $c\colon A\to\mathbb{Z}_{+}$ , and vertex demand $d\colon V\to\mathbb{Z}$ , and asked to find a feasible flow $f\colon A\to\mathbb{Z}_{+}$ (i.e., $f(a)\leq c(a)$ for $a\in A$ ) that minimizes $\sum_{a\in A}\gamma(a)f(a)$ subject to $\partial f(v)=d(v)$ for all $v\in V$ . An optimal solution of the minimum cost flow problem is called a minimum cost flow. We denote by $T_{\mathrm{mcf}}(n,m,\Gamma,C,D)$ the time required to solve the minimum cost flow problem for a network of $n$ vertices and $m$ arcs with cost at most $\Gamma$ in absolute values, capacity at most $C$ , and a demand vector with values at most $D$ in absolute values. Using the state-of-the-art algorithm for the minimum cost flow problem given in [25], we have $T_{\mathrm{mcf}}(n,m,\Gamma,C,D)=O(m^{1+o(1)}\log(\max\{C,D\})\log\Gamma)$ .

Our reduction follows the work by Ahuja, Hochbaum, and Orlin [1], who dealt with a more general problem called the convex cost integer dual network flow problem. They showed that the Lagrangian dual of their problem is reduced to the minimum cost flow problem. For the complexity analysis, we explicitly provide the reduction in our case and the construction of a minimum $k$ -potential from a minimum cost flow.

For a discrete convex function $\psi\colon\mathbb{Z}\to\mathbb{Z}$ , an integer $x\in\mathbb{Z}$ is called a breakpoint of $\psi$ if $\psi(x+1)+\psi(x-1)>2\psi(x)$ , i.e., the left slope $\psi(x)-\psi(x-1)$ and right slope $\psi(x+1)-\psi(x)$ of $\psi$ at $x$ are different. We can observe that, for each $x\in\mathbb{Z}$ , the left slope of $\psi$ at $x$ is at most the right slope of $\psi$ at $x$ . Let $B(\psi)$ denote the set of breakpoints of $\psi$ .

Let $(G=(V,A),w\colon A\to\mathbb{Z}_{+},\varphi\colon\mathbb{Z}\to\mathbb{Z})$ be an instance of the minimum $k$ -potential problem. We define $B_{k}(\varphi)\coloneqq\left(B(\varphi)\cap[0,k]\right)\cup\{0,k\}$ and suppose that $B_{k}(\varphi)=\{b_{0},b_{1},\dots,b_{z}\}$ with $(0=)b_{0}<b_{1}<\cdots<b_{z}(=k)$ . Let $s_{i}^{-}$ (resp. $s_{i}^{+}$ ) denote the left (resp. right) slope of $\varphi$ at $b_{i}\in B(\varphi)\cap[k-1]$ ; note that $s_{i}^{+}=s_{i+1}^{-}<s_{i+1}^{+}$ . We set $M\in\mathbb{Z}_{+}$ as a sufficiently large integer satisfying $M>H(p)$ for any $k$ -potential $p$ of $G$ , e.g., $M=\sum_{a\in A}w(a)\varphi(k)+1$ .

We construct an instance of the minimum cost flow problem. The vertex set of the input digraph $\bar{G}$ is $\bar{V}\coloneqq V\cup\{0\}$ . We set the arc set $\bar{A}$ of $\bar{G}$ , arc cost $\bar{\gamma}\colon\bar{A}\to\mathbb{Z}$ , and capacity $\bar{c}\colon\bar{A}\to\mathbb{Z}_{+}$ by creating

$\blacksquare$

$|B_{k}(\varphi)|=z+1$ copies of each $a\in A$ satisfying $w(a)>0$ with costs $b_{0},b_{1},\dots,b_{z-1},b_{z}$ and capacities $w(a)s_{1}^{-}+M,w(a)(s_{1}^{+}-s_{1}^{-}),w(a)(s_{2}^{+}-s_{2}^{-}),\dots,w(a)% (s_{z-1}^{+}-s_{z-1}^{-}),M-w(a)s_{z-1}^{+}$ , respectively,
$\blacksquare$

two copies of each $a\in A$ satisfying $w(a)=0$ or each $a=(0,u)$ for $u\in V\setminus\{\bot,\top\}$ with costs $0,k$ and capacities $M, M$ , respectively,
$\blacksquare$

arcs $(0,\bot)$ and $(0,\top)$ with cost $k$ and $0$ , respectively, and capacity $2M$ .

The vertex demand $\bar{d}\colon\bar{V}\to\mathbb{Z}$ is set as $\bar{d}(v)\coloneqq M(|\Delta^{+}_{G^{\prime}}(v)|-|\Delta^{-}_{G^{\prime}}(v)|)$ for each $v\in\bar{V}$ , where $G^{\prime}\coloneqq(\bar{V},A^{\prime})$ and $A^{\prime}\coloneqq A\cup\{(0,v)\mid v\in V\}$ .

Let $f^{*}$ be a minimum cost flow of the resulting instance $(\bar{G},\bar{\gamma},\bar{c},\bar{d})$ . Then we construct the residual graph $\bar{G}_{f^{*}}$ of $\bar{G}$ with respect to $f^{*}$ , and set the arc length $\ell$ of $\bar{G}_{f^{*}}$ as $\ell(a)\coloneqq\gamma(a)$ if $a\in\bar{A}$ , and $\ell(a)\coloneqq-\gamma(a)$ if $\bar{a}\in\bar{A}$ . Let us define $\bar{p}^{*}\colon\bar{V}\to\mathbb{Z}$ as a feasible potential with $\bar{p}^{*}(0)=0$ in $\bar{G}_{f^{*}}$ with respect to arc length $\ell$ , i.e., an assignment $\bar{p}^{*}\colon\bar{V}\to\mathbb{Z}$ satisfying $\bar{p}^{*}(0)=0$ and $\ell(a)\geq\bar{p}^{*}(v)-\bar{p}^{*}(u)$ for each arc $a=(u,v)\in\bar{A}$ . The following lemma justifies our reduction.

Lemma 7 ( $\star$ ).

The restriction $p^{*}\colon V\to\mathbb{Z}$ of $\bar{p}^{*}$ to $V$ forms a minimum $k$ -potential of $(G,w,\varphi)$ .

The validity of Lemma 7 follows from exactly the same argument given in [1]. For the sake of completeness, we provide a direct proof in the full version.

We then show the time complexity of solving the minimum $k$ -potential problem via our reduction. The digraph $\bar{G}$ has $|V|+1=O(|V|)$ vertices, and at most $2(|V|-1+|A_{0}|)+|B_{k}(\varphi)||A_{+}|=O(|V|+|A_{0}|+|B_{k}(\varphi)||A_{+}|)$ arcs. Here, $A_{0}$ and $A_{+}$ denote the set of arcs $a$ with $w(a)=0$ and $w(a)>0$ , respectively. Thus, we can construct the minimum cost flow instance $(\bar{G},\bar{\gamma},\bar{c},\bar{d})$ in $O(|V|+|A_{0}|+|B_{k}(\varphi)||A_{+}|)$ time. The costs and capacities are nonnegative integers at most $k$ and $\bar{C}\coloneqq\max\{2M,\max_{a\in A}w(a)\varphi(k)+M\}$ , respectively. The demands are at most $M|V|$ in absolute values. Hence, we can find a minimum cost flow $f^{*}\colon\bar{A}\to\mathbb{Z}_{+}$ of $(\bar{G},\bar{\gamma},\bar{c},\bar{d})$ in $O\left(T_{\mathrm{mcf}}(|V|,|V|+|A_{0}|+|B_{k}(\varphi)||A_{+}|,k,\bar{C},M|V|% )\right)$ time. The residual graph $\bar{G}_{f^{*}}$ of $\bar{G}$ with respect to $f^{*}$ is constructed in $O(|\bar{A}|)=O(|V|+|A_{0}|+|B_{k}(\varphi)||A_{+}|)$ time. A feasible potential $\bar{p}^{*}$ in $\bar{G}_{f^{*}}$ with respect to $\ell$ is found in $O\left(T_{\mathrm{mcf}}(|V|,|V|+|A_{0}|+|B_{k}(\varphi)||A_{+}|,k,|V|,|V|)\right)$ time by computing shortest distances from a supernode to vertices in $\bar{V}$ . Therefore, by Lemma 7, we can obtain a minimum $k$ -potential $p^{*}$ , which is a restriction of $\bar{p}^{*}$ to $V$ , in $O\left(T_{\mathrm{mcf}}(|V|,|V|+|A_{0}|+|B_{k}(\varphi)||A_{+}|,k,\bar{C},M|V|% )\right)$ time.

By applying the algorithm in [25], we obtain the following, which implies Theorem 2.

Theorem 8 ( $\star$ ).

We can solve the $k$ -diverse problem with respect to $d_{\varphi}$ of Prob for an instance $\mathbf{I}$ in $O(T_{P}(\mathbf{I})+T_{r}(\mathbf{I})+(|A_{P}|+|B_{k}(\varphi)||E|)^{1+o(1)}% \log(\varphi(k))\log k+kq)$ time. In particular, the problems Sum- $k$ -Diverse of Prob and Cov- $k$ -Diverse of Prob can be solved in $O(T_{P}(\mathbf{I})+T_{r}(\mathbf{I})+(|A_{P}|+k|E|)^{1+o(1)})$ time and in $O(T_{P}(\mathbf{I})+T_{r}(\mathbf{I})+(|A_{P}|+|E|)^{1+o(1)}\log^{2}k+kq)$ time, respectively.

5 Applications

In this section, we introduce two applications of our framework; one is the $k$ -diverse problems of Unweighted Minimum $s$ - $t$ Cut and the other is that of Stable Matching.

In order to apply our framework to the $k$ -diverse problem of a concrete combinatorial problem, we need to construct a poset and a reduction map appearing in the property (R) for each instance. In Section 5.1, we develop a common strategy of building these components. In fact, it is known that both families of minimum $s$ - $t$ cuts and stable matchings are naturally identified with set families called ring families. A ring family with inclusion order forms a poset, particularly a distributive lattice. We will utilize these facts to construct a reduction map for Unweighted Minimum $s$ - $t$ Cut and Stable Matching in Sections 5.2 and 5.3, respectively. In Section 5.4, we briefly describe a framework for the $k$ -diverse problem recently developed by De Berg, Martínez, and Spieksma in [8], and show that our framework can capture theirs in the case of Sum- $k$ -Diverse/Cov- $k$ -Diverse.

5.1 How to construct a poset and a reduction map

We start this subsection with introducing terminology on lattices (see e.g., [6] for details). A poset $(L,\preceq)$ is called a lattice if, for any two elements $x,y\in L$ , their least upper bound (join) $x\vee y$ and greatest lower bound (meet) $x\wedge y$ exist in $L$ . A lattice $L$ is said to be distributive if the distributive law $x\wedge(y\vee z)=(x\wedge y)\vee(x\wedge z)$ holds for any $x,y,z\in L$ .

A typical example of a distributive lattice is the family $\mathcal{I}(P)$ of ideals of a poset over $P$ with inclusion order $\subseteq$ . In this case, the join of two ideals $I$ and $I^{\prime}$ is their union $I\cup I^{\prime}$ , and the meet is their intersection $I\cap I^{\prime}$ ; we can easily see that they are also ideals. More generally, a ring family, which is a nonempty family $\mathcal{R}\subseteq 2^{R}$ of subsets of a nonempty finite set $R$ such that it is closed under the union and intersection, endowed with inclusion order $\subseteq$ forms a distributive lattice $(\mathcal{R},\subseteq)$ , where $X\vee Y=X\cup Y$ and $X\wedge Y=X\cap Y$ for $X,Y\in\mathcal{R}$ .

The celebrated Birkhoff’s representation theorem [5] states that every distributive lattice $(L,\preceq)$ is isomorphic to the distributive lattice $(\mathcal{I}(P),\subseteq)$ for some poset $P$ . Here, two lattices $(L,\preceq)$ and $(L^{\prime},\preceq^{\prime})$ are said to be isomorphic if there is a bijection $h\colon L\to L^{\prime}$ such that $x\preceq y$ if and only if $h(x)\preceq h(y)$ for any $x,y\in L$ . For a distributive lattice $(L,\preceq)$ , we refer to a poset $P$ such that $(L,\preceq)$ and $(\mathcal{I}(P),\subseteq)$ are isomorphic as a Birkhoff representation of $(L,\preceq)$ .

A Birkhoff representation of the distributive lattice $(\mathcal{R},\subseteq)$ over a ring family $\mathcal{R}$ can be obtained as follows. The ring family $\mathcal{R}$ has the unique minimal set $X_{\bot}\coloneqq\bigcap_{X\in\mathcal{R}}X$ and unique maximal set $X_{\top}\coloneqq\bigcup_{X\in\mathcal{R}}X$ . Take any maximal chain $X_{\bot}\eqqcolon X_{0}\subsetneq X_{1}\subsetneq\cdots\subsetneq X_{n}% \coloneqq X_{\top}$ from $X_{\bot}$ to $X_{\top}$ in $\mathcal{R}$ ; namely, there are no $X\in\mathcal{R}$ and $i\in[n]$ with $X_{i-1}\subsetneq X\subsetneq X_{i}$ . Then, $\Pi^{*}(\mathcal{R})\coloneqq\{X_{i}\setminus X_{i-1}\mid i\in[n]\}$ forms a partition of $X_{\top}\setminus X_{\bot}$ . We define a partial order $\preceq$ on $\Pi^{*}(\mathcal{R})$ by setting $\hat{X}\preceq\hat{Y}$ if and only if every $Z\in\mathcal{R}$ with $Z\supseteq\hat{Y}$ also includes $\hat{X}$ . The resulting $(\Pi^{*}(\mathcal{R}),\preceq)$ is actually a poset and is independent of the choice of a maximal chain from $X_{\bot}$ to $X_{\top}$ . It is known (see e.g., [21, Chapter 2.2.2]) that $(\Pi^{*}(\mathcal{R}),\preceq)$ is a Birkhoff representation of $(\mathcal{R},\subseteq)$ ; more precisely, the map $\mathcal{Y}\mapsto(\bigcup_{\hat{X}\in\mathcal{Y}}\hat{X})\cup X_{\bot}$ is an isomorphism from $(\mathcal{I}(\Pi^{*}(\mathcal{R})),\subseteq)$ to $(\mathcal{R},\subseteq)$ . In this paper, we refer to $(\Pi^{*}(\mathcal{R}),\preceq)$ as the Birkhoff representation of $(\mathcal{R},\subseteq)$ .

We are ready to develop a strategy to construct a reduction map by using a ring family. Let $\mathcal{R}\subseteq 2^{R}$ be a ring family over a nonempty finite set $R$ . We may assume that the minimal set $X_{\bot}$ is nonempty and the maximal set $X_{\top}$ is a proper subset of $R$ ; otherwise we add two elements $\bot$ and $\top$ to $R$ and update each subset $X\in\mathcal{R}$ as $X\cup\{\bot\}$ , which makes $\mathcal{R}$ satisfy $\bot\in X_{\bot}$ and $\top\in\overline{X}_{\top}\coloneqq R\setminus X_{\top}$ . Let $\Pi(\mathcal{R})$ denote the partition $\Pi^{*}(\mathcal{R})\cup\{X_{\bot},\overline{X}_{\top}\}$ of $R$ . We extend the partial order $\preceq$ on $\Pi^{*}(\mathcal{R})$ to that on $\Pi(\mathcal{R})$ by setting $X_{\bot}\prec\hat{X}\prec\overline{X}_{\top}$ for any $\hat{X}\in\Pi^{*}(\mathcal{R})$ . For a map $\hat{r}\colon E\to R^{2}$ given by $e\mapsto(\hat{e}^{+},\hat{e}^{-})$ , we define

\displaystyle{\sup}_{\hat{r}}(X)\coloneqq\{e\in E\mid\hat{e}^{+}\in X\not\ni% \hat{e}^{-}\}

(13)

for $X\in\mathcal{R}$ . We say that $\hat{r}\colon E\to R^{2}$ is a pre-reduction map if $\hat{e}^{-}\in X$ implies $\hat{e}^{+}\in X$ for all $X\in\mathcal{R}$ and $e\in E$ , and $\mathcal{S}(\mathbf{I})=\{{\sup}_{\hat{r}}(X)\mid X\in\mathcal{R}\}$ . We show that we can construct a reduction map as long as we have a pre-reduction map.

Lemma 9 ( $\star$ ).

Suppose that $\hat{r}\colon E\to R^{2}$ given by $e\mapsto(\hat{e}^{+},\hat{e}^{-})$ is a pre-reduction map. Then, the map $r\colon E\to\Pi(\mathcal{R})^{2}$ defined by

\displaystyle r(e)\coloneqq(\Pi(\hat{e}^{+}),\Pi(\hat{e}^{-}))\qquad(e\in E)

(14)

is a reduction map, where $\Pi(x)$ ( $x\in R$ ) denotes the unique member of $\Pi(\mathcal{R})$ that contains $x$ .

Thus, we obtain the following:

Proposition 10.

Let $R$ be a nonempty finite set, and let $\mathcal{R}$ be a ring family over $R$ . Suppose we are given a pre-reduction map $\hat{r}\colon E\to R^{2}$ and the partition $\Pi(\mathcal{R})$ of $R$ . Then, we can construct a reduction map $r\colon E\to\Pi(\mathcal{R})^{2}$ in constant time.

The property (R) requires that, for each instance $\mathbf{I}$ of Prob, the family $\mathcal{S}(\mathbf{I})$ of its solutions is closely related to the family $\mathcal{I}(P^{*})$ of ideals of the poset $P^{*}$ . Hence, it is natural to expect that $\mathcal{S}(\mathbf{I})$ admits the distributive lattice structure. We conclude this subsection by showing that this indeed holds, which might be viewed as a necessary condition on Prob for our framework to be applicable.

Recall the notation in the property (R). We may assume that $e^{+}\prec e^{-}$ for each $e\in E$ , since if $e^{+}=e^{-}$ , then no member $S$ in $\mathcal{S}(\mathbf{I})$ contains $e$ , i.e., $\mathcal{S}(\mathbf{I})\subseteq\binom{E\setminus\{e\}}{q}$ , and hence we can remove $e$ from $E$ . Let $\leq$ denote a partial order on $E$ such that $e\leq e^{\prime}$ if and only if $e=e^{\prime}$ or $e^{-}\preceq e^{\prime+}$ . This $\leq$ can be extended for $\mathcal{S}(\mathbf{I})$ by setting $S\leq T$ if and only if there exists a bijection $\pi\colon S\to T$ such that $e\leq\pi(e)$ for all $e\in S$ . Then, the following holds.

Theorem 11 ( $\star$ ).

Suppose that a combinatorial problem Prob has the properties (S) and (R). For any instance $\mathbf{I}$ of Prob, the poset $(\mathcal{S}(\mathbf{I}),\leq)$ forms a distributive lattice.

5.2 The $𝒌$ -diverse unweighted minimum $𝒔$ - $𝒕$ cut problem

In this subsection, we consider the $k$ -diverse problem of Unweighted Minimum $s$ - $t$ Cut. Here, this is the minimum $s$ - $t$ cut problem for a digraph $G=(V,A)$ with unit arc capacity $c\colon A\to\mathbb{Z}_{+}$ , i.e., $c(a)=1$ for all $a\in A$ . Clearly, Unweighted Minimum $s$ - $t$ Cut has the property (S) by setting the ground set $E$ as the arc set $A$ and the integer $q$ as the size of minimum $s$ - $t$ cuts. Our aim is to construct, for some ring family $\mathcal{R}$ , a DAG $D$ representing the Birkhoff representation $(\Pi^{*}(\mathcal{R}),\preceq)$ , the partition $\Pi(\mathcal{R})$ , and a pre-reduction map $\hat{r}$ based on $\mathcal{R}$ , which implies that Unweighted Minimum $s$ - $t$ Cut has the property (R) by Lemma 9.

Let $G=(V,A)$ with $s,t\in V$ be a digraph that is an instance of Unweighted Minimum $s$ - $t$ Cut. Then, the family $\mathcal{S}(G)\subseteq 2^{A}$ of solutions of the instance $G$ is the set of all minimum $s$ - $t$ cuts, which is a subset of the family $\{\Delta^{+}(X)\mid\text{$X\subseteq V$ with $s\in X\not\ni t$}\}$ . Let $\mathcal{R}_{st}$ denote the family of vertex subsets $X\subseteq V$ with $s\in X\not\ni t$ such that its outgoing arcs form a minimum $s$ - $t$ cut of $G$ , i.e.,

\displaystyle\mathcal{R}_{st}\coloneqq\{X\subseteq V\mid\text{$s\in X\not\ni t% $, $\Delta^{+}(X)\in\mathcal{S}(G)$}\}.

(15)

It is well-known that $\mathcal{R}_{st}$ forms a ring family, which directly follows from Lemma 12 below. Moreover, since the unique minimal set $X_{\bot}$ in $\mathcal{R}_{st}$ contains $s$ and the unique maximal set $X_{\top}$ in $\mathcal{R}_{st}$ excludes $t$ , we have $X_{\bot}\neq\emptyset\neq\overline{X}_{\top}\coloneq V\setminus X_{\top}$ . We will construct a pre-reduction map based on $\mathcal{R}_{st}$ .

We utilize the representation introduced by Picard and Queyranne [22], which is used for enumerating all minimum $s$ - $t$ cuts of a digraph (with arbitrary positive capacities). In our case, we consider the unit capacity case, and use this representation to construct a DAG representing the Birkhoff representation $(\Pi^{*}(\mathcal{R}_{st}),\preceq)$ of the distributive lattice $(\mathcal{R}_{st},\subseteq)$ , as well as the partition $\Pi(\mathcal{R}_{st})$ of $V$ .

To begin, we briefly describe the representation proposed by Picard and Queyranne. Let $f$ be an arbitrary maximum $s$ - $t$ flow of $G$ , where each arc has unit capacity. Let $G_{f}$ denote the residual graph of $f$ . Then, the following characterization is known.

Lemma 12 ([22]).

An $s$ - $t$ cut $\Delta^{+}(X)$ is a minimum $s$ - $t$ cut in $G$ if and only if $X$ is a closed set under reachability in $G_{f}$ , containing $s$ but not $t$ .

Using this, Picard and Queyranne constructed a DAG $D=(V^{\prime},A^{\prime})$ from $G$ through the following steps: (1) Contract each strongly connected component in $G_{f}$ into a single vertex, (2) remove the component containing $s$ along with all vertices reachable from $s$ , and (3) remove the component containing $t$ along with all vertices reachable to $t$ . For each vertex $v^{\prime}\in V^{\prime}$ , let $R(v^{\prime})$ denote the set of vertices in $G_{f}$ contracted into $v^{\prime}$ . Similarly, let $R(s^{\prime})$ and $R(t^{\prime})$ denote the sets of vertices in $G_{f}$ removed in steps (2) and (3), respectively. By Lemma 12, the DAG $D$ represents all minimum $s$ - $t$ cuts in $G$ . Specifically, for any $X\subseteq V^{\prime}$ that is closed under reachability in $D$ , the set of outgoing arcs from $R(X)\coloneqq(\bigcup_{v^{\prime}\in X}R(v^{\prime}))\cup R(s^{\prime})$ in $G$ forms a minimum $s$ - $t$ cut in $G$ . Conversely, for any minimum $s$ - $t$ cut $\Delta^{+}(X)$ in $G$ , the set of components corresponding to $X\setminus R(s^{\prime})$ is closed under reachability in $D$ .

In fact, $D$ is the desired DAG that represents the Birkhoff representation $(\Pi^{*}(\mathcal{R}_{st}),\preceq)$ of $(\mathcal{R}_{st},\subseteq)$ . Let $(V^{\prime},\preceq)$ be a poset where reachability in $D$ defines the partial order i.e., for any $u,v\in V^{\prime}$ , $u\preceq v$ if and only if $u$ is reachable from $v$ in $D$ . From the above discussion, the map $R\colon\mathcal{I}(V^{\prime})\to\mathcal{R}_{st}$ is an isomorphism between $(\mathcal{I}(V^{\prime}),\subseteq)$ and $(\mathcal{R}_{st},\subseteq)$ , because for all $X,X^{\prime}\in\mathcal{I}(V^{\prime})$ , we have $X\subseteq X^{\prime}$ if and only if $R(X)\subseteq R(X^{\prime})$ . Moreover, choose an arbitrary maximal chain $\emptyset=X_{0}\subsetneq X_{1}\subsetneq\cdots\subsetneq X_{n}=V^{\prime}$ in $\mathcal{I}(V^{\prime})$ . Since $R(X_{0})\subsetneq R(X_{1})\subsetneq\cdots\subsetneq R(X_{n})$ forms a maximal chain from $X_{\bot}$ to $X_{\top}$ in $\mathcal{R}_{st}$ , and $|X_{i}\setminus X_{i-1}|=1$ for all $i\in[n]$ , we obtain an isomorphism that maps each $v\in V^{\prime}$ to $R(v)$ , implying the equivalence between $(V^{\prime},\preceq)$ and $(\Pi^{*}(\mathcal{R}_{st}),\preceq)$ .

Thus, as all strongly connected components in $G$ can be found in linear time by [24], the partition $\Pi(\mathcal{R}_{st})$ and a DAG representing $(\Pi^{*}(\mathcal{R}_{st}),\preceq)$ can be constructed from $G_{f}$ in $O(|V|+|A|)$ time. Furthermore, the number of vertices and arcs representing $(\Pi^{*}(\mathcal{R}_{st}),\preceq)$ are at most $|V|$ and $|A|$ , respectively.

We then define a map $\hat{r}\colon A\to V^{2}$ by

\displaystyle\hat{r}(a)\coloneqq\begin{cases}a&\text{if $f(a)=1$},\\ (t,t)&\text{if $f(a)=0$}.\end{cases}

(16)

We confirm that $\hat{r}$ is a pre-reduction map.

Lemma 13 ( $\star$ ).

The map $\hat{r}\colon A\to V^{2}$ defined as (16) is a pre-reduction map.

This together with Proposition 10 and the above argument implies that $T_{P}(G)+T_{r}(G)=O(|V|+|A|)$ . Thus, by Theorem 8, we obtain the following, which implies Theorem 3 (1).

Theorem 14 ( $\star$ ).

The $k$ -diverse problem with respect to $d_{\varphi}$ of Unweighted Minimum $s$ - $t$ Cut can be solved in $O(n+(|B_{k}(\varphi)|m)^{1+o(1)}\log(\varphi(k))\log k+kq)$ time, where $n$ denotes the number of vertices in the input digraph $G$ , $m$ the number of arcs, and $q$ the size of any minimum $s$ - $t$ cut in $G$ . In particular, Sum- $k$ -Diverse and Cov- $k$ -Diverse of Unweighted Minimum $s$ - $t$ Cut can be solved in $O(n+(km)^{1+o(1)})$ time and in $O(n+m^{1+o(1)}\log^{2}k+kq)$ time, respectively.

5.3 The $𝒌$ -diverse stable matching problem

In this subsection, we consider the $k$ -diverse problem of Stable Matching. Let us first introduce the problem Stable Matching (see e.g., [14] for details). Let $U$ and $V$ be disjoint finite sets with the same size $n$ endowed with total orders $\leq_{u}$ on $V$ ( $u\in U$ ) and $\leq_{v}$ on $U$ ( $v\in V$ ). Intuitively, the total order $\leq_{u}$ (resp. $\leq_{v}$ ) represents the preference of $u$ on $V$ (resp. $v$ on $U$ ); $v<_{u}v^{\prime}$ means that “ $u$ prefer $v$ to $v^{\prime}$ .” A subset $M=\{(u_{1},v_{1}),(u_{2},v_{2}),\dots,(u_{n},v_{n})\}\subseteq U\times V$ is called a matching if $M$ provides the one-to-one correspondence between $U$ and $V$ , i.e., all $u_{i},v_{j}$ are different, $U=\{u_{1},u_{2},\dots,u_{n}\}$ , and $V=\{v_{1},v_{2},\dots,v_{n}\}$ . For a matching $M$ , we denote by $p_{M}(u)$ (resp. $p_{M}(v)$ ) the partner of $u\in U$ (resp. $v\in V$ ) in $M$ , i.e., $(u,p_{M}(u))\in M$ (resp. $(p_{M}(v),v)\in M$ ). A matching $M$ is said to be stable if there is no pair $(u,v)\in U\times V$ such that $v<_{u}p_{M}(u)$ and $u<_{v}p_{M}(v)$ . In Stable Matching, we are given a tuple $(U,V;(\leq_{u})_{u\in U},(\leq_{v})_{v\in V})$ of finite sets $U, V$ with the same size $n$ and the total orders $(\leq_{u})_{u\in U}$ on $V$ and $(\leq_{v})_{v\in V}$ on $U$ , and asked to find a stable matching $M$ . A stable matching always exists and can be found in $O(n^{2})$ time by the Gale–Shapley algorithm [13].

Clearly, Stable Matching has the property (S), in which $q=|U|=|V|$ . Similarly to Section 5.2, our aim is to construct, for some ring family $\mathcal{R}$ , a DAG representing the Birkhoff representation $(\Pi^{*}(\mathcal{R}),\preceq)$ , the partition $\Pi(\mathcal{R})$ , and a pre-reduction map $\hat{r}$ based on $\mathcal{R}$ .

Let $\mathbf{I}\coloneqq(U,V;(\leq_{u})_{u\in U},(\leq_{v})_{v\in V})$ be an instance of Stable Matching, and let $n\coloneqq|U|=|V|$ . Then, the family $\mathcal{S}(\mathbf{I})$ of solutions of the instance $\mathbf{I}$ is the set of all stable matchings of $\mathbf{I}$ . It is known [14, Theorem 1.3.2] that $\mathcal{S}(\mathbf{I})$ forms a distributive lattice with the partial order $\preceq$ defined by setting $M\preceq M^{\prime}$ if and only if $p_{M}(u)\leq_{u}p_{M^{\prime}}(u)$ for all $u\in U$ . For each stable matching $M\in\mathcal{S}(\mathbf{I})$ , the P-set of $M$ , denoted by $P(M)$ , is the set of pairs $(u,v)$ such that $v$ is at least as preferred as $p_{M}(u)$ by $u\in U$ , i.e., $P(M)\coloneqq\{(u,v)\in U\times V\mid v\leq_{u}p_{M}(u)\}$ . Let $\mathcal{R}_{\mathbf{I}}$ denote the family of all P-sets for $\mathbf{I}$ , i.e.,

\displaystyle\mathcal{R}_{\mathbf{I}}\coloneqq\{P(M)\mid\text{$M\in\mathcal{S}% (\mathbf{I})$}\}.

(17)

The map that sends a stable matching to its P-set is an isomorphism between $(S(\mathbf{I}),\preceq)$ and $(\mathcal{R}_{\mathbf{I}},\subseteq)$ . Hence, $\mathcal{R}_{\mathbf{I}}$ forms a ring family, which serves the base of our pre-reduction map. We note that the unique minimal set $X_{\bot}$ of $\mathcal{R}_{\mathbf{I}}$ is nonempty. To ensure that $X_{\top}$ is a proper subset of the ground set, we add a new element $\top$ to $V$ , and we regard $U\times(V\cup\{\top\})$ as the ground set in the following.

We can construct the Birkhoff representation $(\Pi^{*}(\mathcal{R}_{\mathbf{I}}),\preceq)$ of the distributive lattice $(\mathcal{R}_{\mathbf{I}},\subseteq)$ following the work of Gusfield and Irving [14, Chapter 3]. For each stable matching $M$ and each $u\in U$ , let $s_{M}(u)$ denote the most preferable element $v\in V$ for $u$ such that $u<_{v}p_{M}(v)$ , if such a $v$ exists. A rotation is an ordered list $\rho=((u_{0},v_{0}),(u_{1},v_{1}),\ldots,(u_{c-1},v_{c-1}))$ of pairs in some stable matching $M$ , satisfying $s_{M}(u_{i})=v_{i+1}$ for all $i\in[0,c-1]$ (where $i+1$ is taken modulo $c$ ). Let $\Lambda_{\mathbf{I}}$ be the set of all rotations of $\mathbf{I}$ . It is shown that the map $d$ defined by

d(\rho)=\{(u_{i},v)\in U\times V\mid i\in[0,c-1],v_{i}<_{u_{i}}v\leq_{u_{i}}v_% {i+1}\}

(18)

is a bijection between $\Lambda_{\mathbf{I}}$ and $\Pi^{*}(\mathcal{R}_{\mathbf{I}})$ . A partial order on $\Lambda_{\mathbf{I}}$ is induced by $\Pi^{*}(\mathcal{R}_{\mathbf{I}})$ through $d$ . Gusfield and Irving present an $O(n^{2})$ -time algorithm that constructs $\Lambda_{\mathbf{I}}$ along with a DAG representing the poset over $\Lambda_{\mathbf{I}}$ having $O(n^{2})$ arcs [14, Lemma 3.3.2 and Corollary 3.3.1]. In particular, their algorithm computes the unique minimal and maximal stable matchings.

We can construct $\Pi^{*}(\mathcal{R}_{\mathbf{I}})$ , which is a partition of $X_{\top}\setminus X_{\bot}$ , from $\Lambda_{\mathbf{I}}$ in $O(n^{2})$ time using $d$ . Clearly, $X_{\bot}$ is the P-set of the unique minimal stable matching. Thus, we can construct the partition $\Pi(\mathcal{R}_{\mathbf{I}})$ and a DAG with $O(n^{2})$ arcs representing $(\Pi^{*}(\mathcal{R}_{\mathbf{I}}),\preceq)$ in $O(n^{2})$ time.

We then construct a pre-reduction map. For each $u\in U$ and $v\in V$ , we denote by $v_{u}^{-}$ the element in $V$ that is the cover of $v$ with respect to $\leq_{u}$ , i.e., $v<_{u}v_{u}^{-}$ and there is no element $v^{\prime}$ in $V$ with $v<_{u}v^{\prime}<_{u}v_{u}^{-}$ . If $v$ is the maximum element with respect to $\leq_{u}$ , then we define $v_{u}^{-}\coloneqq\top$ . We define a map $\hat{r}\colon U\times V\to\left(U\times(V\cup\{\top\})\right)^{2}$ by

\displaystyle\hat{r}(u,v)\coloneqq\left((u,v),(u,v_{u}^{-})\right).

(19)

We show that this is a pre-reduction map.

Lemma 15 ( $\star$ ).

The map $\hat{r}\colon U\times V\to\left((U\times V)\cup\{\top\}\right)^{2}$ defined as (19) is a pre-reduction map.

This together with Proposition 10 and the above argument implies that we have $T_{P}(\mathbf{I})+T_{r}(\mathbf{I})=O(n^{2})$ . Thus, by Theorem 8, we obtain the following, which implies Theorem 3 (2).

Theorem 16 ( $\star$ ).

The $k$ -diverse problem with respect to $d_{\varphi}$ of Stable Matching can be solved in $O((|B_{k}(\varphi)|n^{2})^{1+o(1)}\log(\varphi(k))\log k+kn)$ time. In particular, Sum- $k$ -Diverse of Stable Matching and Cov- $k$ -Diverse of Stable Matching can be solved in $O((kn^{2})^{1+o(1)})$ time and in $O(n^{2+o(1)}\log^{2}k+kn)$ time, respectively.

5.4 The $𝒌$ -diverse problem on the product of total orders

Very recently, De Berg, Martínez, and Spieksma [8] introduce a framework for the $k$ -diverse problem on the product of total orders, which leads to the polynomial-time solvability of Sum- $k$ -Diverse/Cov- $k$ -Diverse of Unweighted Minimum $s$ - $t$ Cut/Stable Matching. In this subsection, we briefly introduce their framework and show that our framework can capture theirs for Sum- $k$ -Diverse and Cov- $k$ -Diverse. Here, we do not assume that Prob has the properties (S) and (R).

We first introduce some terminology. For a distributive lattice $(\mathcal{L},\preceq)$ , an element $x\in\mathcal{L}$ is said to be join-irreducible if $x\neq y\vee z$ for any $y,z\in\mathcal{L}\setminus\{x\}$ . Let $\mathcal{L}_{\textup{ir}}$ denote the set of join-irreducible elements in $\mathcal{L}$ . Then, it is known [5] that the subposet $(\mathcal{L}_{\textup{ir}},\preceq)$ of $\mathcal{L}$ induced by $\mathcal{L}_{\textup{ir}}$ forms a Birkhoff representation of $\mathcal{L}$ ; the map $I\mapsto\bigvee_{x\in I}x$ is an isomorphism from $(\mathcal{I}(\mathcal{L}_{\textup{ir}}),\subseteq)$ to $(\mathcal{L},\preceq)$ . We refer to this Birkhoff representation $(\mathcal{L}_{\textup{ir}},\preceq)$ as the join-irreducible representation of $\mathcal{L}$ . For total orders $(E_{1},\leq_{1}),(E_{2},\leq_{2}),\dots,(E_{q},\leq_{q})$ , their product $(\mathcal{E},\leq)$ is the poset such that its ground set $\mathcal{E}$ is the product $E_{1}\times E_{2}\times\dots\times E_{q}$ of $E_{1},E_{2},\dots,E_{q}$ and the partial order $\leq$ is defined by setting $(e_{1},e_{2},\dots,e_{q})\leq(e_{1}^{\prime},e_{2}^{\prime},\dots,e_{q}^{% \prime})$ if and only if $e_{i}\leq_{i}e_{i}^{\prime}$ for all $i\in[n]$ . Actually, this $\mathcal{E}$ forms a lattice; the meet $(e_{1},e_{2},\dots,e_{q})\wedge(e_{1}^{\prime},e_{2}^{\prime},\dots,e_{q}^{% \prime})$ is given by $(\min\{e_{i},e_{i}^{\prime}\})_{i\in[q]}$ and the join $(e_{1},e_{2},\dots,e_{q})\vee(e_{1}^{\prime},e_{2}^{\prime},\dots,e_{q}^{% \prime})$ is given by $(\max\{e_{i},e_{i}^{\prime}\})_{i\in[q]}$ , where

\displaystyle\min\{e_{i},e_{i}^{\prime}\}\coloneqq\begin{cases}e_{i}&\text{if % $e_{i}\leq_{i}e_{i}^{\prime}$},\\ e_{i}^{\prime}&\text{if $e_{i}^{\prime}<_{i}e_{i}$},\end{cases}\qquad\max\{e_{% i},e_{i}^{\prime}\}\coloneqq\begin{cases}e_{i}^{\prime}&\text{if $e_{i}\leq_{i% }e_{i}^{\prime}$},\\ e_{i}&\text{if $e_{i}^{\prime}<_{i}e_{i}$}\end{cases}

(20)

for each $i\in[n]$ . We say that a subset $\mathcal{L}\subseteq\mathcal{E}$ is a sublattice if $\mathcal{L}$ is closed under the meet $\wedge$ and join $\vee$ , i.e., $x,y\in\mathcal{L}$ implies $x\wedge y,x\vee y\in\mathcal{L}$ . We can observe that a sublattice of the product of total orders is distributive (see the paragraph after Theorem 17).

De Berg, Martínez, and Spieksma [8] impose the following property on Prob:

(T) For any instance $\mathbf{I}$ of Prob, there are $q$ total orders $(E_{1},\leq_{1}),(E_{2},\leq_{2}),\dots,(E_{q},\leq_{q})$ such that the family $\mathcal{S}(\mathbf{I})$ of solutions of $\mathbf{I}$ is a sublattice of the product of those total orders.

Then they show that, if Prob has the property (T) and we can construct (a DAG representing) the join-irreducible representation $(\mathcal{S}(\mathbf{I})_{\textup{ir}},\leq)$ of $\mathcal{S}(\mathbf{I})$ in polynomial time, then we can solve Sum- $k$ -Diverse/Cov- $k$ -Diverse of Prob in polynomial time by using an algorithm for the SFM problem over the distributive lattice $\mathcal{I}(\mathcal{S}(\mathbf{I})_{\textup{ir}})$ . The problem Stable Matching has the property (T). Indeed, for an instance $\mathbf{I}\coloneqq(U,V;(\leq_{u})_{u\in U},(\leq_{v})_{v\in V})$ of Stable Matching, we define $E_{u}\coloneqq\{(u,v)\mid v\in V\}$ for each $u\in U$ and extend the total order $\leq_{u}$ on $V$ to that on $E_{u}$ by setting $(u,v)\leq_{u}(u,v^{\prime})$ if and only if $v\leq_{u}v^{\prime}$ . Then, $\mathcal{S}(\mathbf{I})$ forms a sublattice of the product of $(E_{u},\leq_{u})$ for all $u\in U$ . Similarly, Unweighted Minimum $s$ - $t$ Cut also has the property (T) by introducing the left-right order to $q$ arc disjoint paths, where $q$ denotes the size of a minimum $s$ - $t$ cut (or the maximum number of arc disjoint $s$ - $t$ paths); see [7, 8] for details.

We can show the following, whose proof is almost the same as that in Section 5.3.

Theorem 17 ( $\star$ ).

If a combinatorial problem Prob has the property (T), then it also has the properties (S) and (R).

Here, for a solution $X=(e_{1},e_{2},\dots,e_{q})\in\mathcal{S}(\mathbf{I})$ , we define its P-set $P(X)$ by $P(X)\coloneqq\{(e_{1}^{\prime},e_{2}^{\prime},\dots,e_{q}^{\prime})\mid e_{i}^% {\prime}\leq_{i}e_{i}\ (i\in[q])\}$ and denote by $\mathcal{R}_{\mathbf{I}}$ the family of all P-sets of $\mathbf{I}$ , as in Section 5.3. Then we can easily see that $(\mathcal{S}(\mathbf{I}),\leq)$ and $(\mathcal{R}_{\mathbf{I}},\subseteq)$ are isomorphic, both of which are distributive.

Let us see the result of De Berg, Martínez, and Spieksma [8] in detail to compare the time complexity of their algorithm with ours. Let $E$ be the disjoint union (or direct sum) of $E_{1},E_{2},\dots,E_{q}$ . We denote by $T_{\textup{ir}}(\mathbf{I})$ the time required to construct a DAG representing the join-irreducible representation $(\mathcal{S}(\mathbf{I})_{\textup{ir}},\leq)$ , and by $T_{\textup{SFM}}(n,\textup{EO})$ the time required to minimize a submodular function $f$ with $n$ variables such that one value evaluation of $f$ takes EO time. Then, it is shown in [8] that we can solve Sum- $k$ -Diverse/Cov- $k$ -Diverse of Prob having the property (T) in $O(T_{\textup{ir}}(\mathbf{I})+T_{\textup{SFM}}(k|E|,k^{2}|E|q))$ time. Using the state-of-the-art algorithm for SFM given in [18] with we have $T_{\textup{SFM}}(n,\textup{EO})=O(n^{3}\textup{EO})$ . Hence, the running-time of the algorithms of De Berg, Martínez, and Spieksma is $O(T_{\textup{ir}}(\mathbf{I})+k^{5}|E|^{4}q)$ .

Our framework provides much faster algorithms for Sum- $k$ -Diverse and Cov- $k$ -Diverse of Prob as follows.

Theorem 18 ( $\star$ ).

Suppose that a combinatorial problem Prob has the property (T). Then we can solve Sum- $k$ -Diverse and Cov- $k$ -Diverse of Prob in $O(T_{\textup{ir}}(\mathbf{I})+|E|q+(|A|+k|E|)^{1+o(1)})$ time and in $O(T_{\textup{ir}}(\mathbf{I})+(|E|+k)q+(|A|+|E|)^{1+o(1)}\log^{2}k)$ time for an instance $\mathbf{I}$ of Prob, respectively, where $A$ denotes the arc set of a constructed DAG representing $(\mathcal{S}(\mathbf{I})_{\textup{ir}},\leq)$ .

Since $|\mathcal{S}(\mathbf{I})_{\textup{ir}}|$ is upper-bounded by the length of a maximal chain in the product of total orders $(E_{1},\leq_{1}),(E_{2},\leq_{2}),\dots,(E_{q},\leq_{q})$ , we have $|\mathcal{S}(\mathbf{I})_{\textup{ir}}|=O(|E|)$ . Thus, $|A|$ is upper-bounded by $|\mathcal{S}(\mathbf{I})_{\textup{ir}}|(|\mathcal{S}(\mathbf{I})_{\textup{ir}}% |-1)=O(|E|^{2})$ . Even in the worst case of $|A|=\Theta(|E|^{2})$ , the running-time of our algorithms for Sum- $k$ -Diverse and Cov- $k$ -Diverse of Prob are $O(T_{\textup{ir}}(\mathbf{I})+(|E|^{2}+k|E|)^{1+o(1)})$ time and $O(T_{\textup{ir}}(\mathbf{I})+kq+|E|^{2+o(1)}\log^{2}k)$ time, respectively, which are much faster than the previous ones that take $O(T_{\textup{ir}}(\mathbf{I})+k^{5}|E|^{4}q)$ time.

We also mention that the framework of [8] works for a diversity measure based on lengths of the maximal chains. It may be interesting to generalize our result to treat such a measure.

References

[1] R. K. Ahuja, D. S. Hochbaum, and J. B. Orlin. Solving the convex cost integer dual network flow problem. Management Science, 49(7):950–964, 2003. doi:10.1287/MNSC.49.7.950.16384.
[2] F. Bach. Learning with submodular functions: A convex optimization perspective. Foundations and Trends® in Machine Learning, 6(2-3):145–373, 2013. doi:10.1561/2200000039.
[3] J. Baste, M. R. Fellows, L. Jaffke, T. Masařík, M. de Oliveira Oliveira, G. Philip, and F. A. Rosamond. Diversity of solutions: An exploration through the lens of fixed-parameter tractability theory. Artificial Intelligence, 303(103644):1–15, 2022.
[4] J. Baste, L. Jaffke, T. Masařík, G. Philip, and G. Rote. FPT algorithms for diverse collections of hitting sets. Algorithms, 12(12):254, 2019. doi:10.3390/A12120254.
[5] G. Birkhoff. Rings of sets. Duke Mathematical Journal, 3(3):443–454, 1937.
[6] B. A. Davey and H. A. Priestley. Introduction to Lattices and Order. Cambridge University Press, 2nd edition, 2002.
[7] M. de Berg, A. L. Martínez, and F. Spieksma. Finding diverse minimum s-t cuts. In Proceedings of the 34th International Symposium on Algorithms and Computation (ISAAC 2023), volume 283, pages 24:1–24:17, 2023. doi:10.4230/LIPICS.ISAAC.2023.24.
[8] M. de Berg, A. L. Martínez, and F. Spieksma. Finding diverse solutions in combinatorial problems with a distributive lattice structure. arXiv:2504.02369, 2025. doi:10.48550/arXiv.2504.02369.
[9] E. Eiben, T. Koana, and M. Wahlström. Determinantal sieving. In Proceedings of the 2024 Annual ACM-SIAM Symposium on Discrete Algorithms (SODA 2024), pages 377–423, 2024.
[10] F. V. Fomin, P. A. Golovach, L. Jaffke, G. Philip, and D. Sagunov. Diverse pairs of matchings. Algorithmica, 86(6):2026–2040, 2024. doi:10.1007/S00453-024-01214-7.
[11] F. V. Fomin, P. A. Golovach, F. Panolan, G. Philip, and S. Saurabh. Diverse collections in matroids and graphs. Mathematical Programming, 204(1):415–447, 2024. doi:10.1007/S10107-023-01959-Z.
[12] S. Fujishige. Submodular Functions and Optimization. Annals of Discrete Mathematics. Elsevier Science, London, England, 2nd edition, 2005.
[13] D. Gale and L. S. Shapley. College admissions and the stability of marriage. The American Mathematical Monthly, 69(1):9–15, 1962.
[14] D. Gusfield and R. W. Irving. The Stable Marriage Problem: Structure and Algorithms. Foundations of Computing Series. The MIT Press, 1989.
[15] T. Hanaka, M. Kiyomi, Y. Kobayashi, Y. Kobayashi, K. Kurita, and Y. Otachi. A framework to design approximation algorithms for finding diverse solutions in combinatorial problems. In Proceedings of the 37th AAAI Conference on Artificial Intelligence (AAAI 2023), volume 37, pages 3968–3976, 2023. doi:10.1609/AAAI.V37I4.25511.
[16] T. Hanaka, Y. Kobayashi, K. Kurita, S. W. Lee, and Y. Otachi. Computing diverse shortest paths efficiently: A theoretical and experimental study. In Proceedings of the 36th AAAI Conference on Artificial Intelligence (AAAI 2022), volume 36, pages 3758–3766, 2022. doi:10.1609/AAAI.V36I4.20290.
[17] T. Hanaka, Y. Kobayashi, K. Kurita, and Y. Otachi. Finding diverse trees, paths, and more. In Proceedings of the 35th AAAI Conference on Artificial Intelligence (AAAI 2021), volume 35, pages 3778–3786, 2021. doi:10.1609/AAAI.V35I5.16495.
[18] H. Jiang. Minimizing convex functions with rational minimizers. Journal of the ACM, 70(1):5:1–5:27, 2022. doi:10.1145/3566050.
[19] B. Korte and J. Vygen. Combinatorial Optimization: Theory and Algorithms. Springer, Berlin, Heidelberg, 6th edition, 2018.
[20] K. Murota. Discrete Convex Analysis. Society for Industrial and Applied Mathematics, 2003.
[21] K. Murota. Matrices and Matroids for Systems Analysis. Springer, Berlin, Heidelberg, 2010.
[22] J.-C. Picard and M. Queyranne. On the structure of all minimum cuts in a network and applications, volume 13 of Mathematical Programming Studies, pages 8–16. Springer, 1980.
[23] A. Schrijver. Combinatorial Optimization: Polyhedra and Efficiency. Algorithms and Combinatorics. Springer, 2003.
[24] M. Sharir. A strong-connectivity algorithm and its applications in data flow analysis. Computers & Mathematics with Applications, 7(1):67–72, 1981.
[25] J. van den Brand, L. Chen, R. Kyng, Y. P. Liu, R. Peng, M. P. Gutenberg, S. Sachdeva, and A. Sidford. A deterministic almost-linear time algorithm for minimum-cost flow. In Proceedings of the IEEE 64th Annual Symposium on Foundations of Computer Science (FOCS 2023), pages 503–514, 2023. doi:10.1109/FOCS57990.2023.00037.

[bib.bib1] [1] R. K. Ahuja, D. S. Hochbaum, and J. B. Orlin. Solving the convex cost integer dual network flow problem. Management Science, 49(7):950–964, 2003. doi:10.1287/MNSC.49.7.950.16384.

[bib.bib2] [2] F. Bach. Learning with submodular functions: A convex optimization perspective. Foundations and Trends® in Machine Learning, 6(2-3):145–373, 2013. doi:10.1561/2200000039.

[bib.bib3] [3] J. Baste, M. R. Fellows, L. Jaffke, T. Masařík, M. de Oliveira Oliveira, G. Philip, and F. A. Rosamond. Diversity of solutions: An exploration through the lens of fixed-parameter tractability theory. Artificial Intelligence, 303(103644):1–15, 2022.

[bib.bib4] [4] J. Baste, L. Jaffke, T. Masařík, G. Philip, and G. Rote. FPT algorithms for diverse collections of hitting sets. Algorithms, 12(12):254, 2019. doi:10.3390/A12120254.

[bib.bib5] [5] G. Birkhoff. Rings of sets. Duke Mathematical Journal, 3(3):443–454, 1937.

[bib.bib6] [6] B. A. Davey and H. A. Priestley. Introduction to Lattices and Order. Cambridge University Press, 2nd edition, 2002.

[bib.bib7] [7] M. de Berg, A. L. Martínez, and F. Spieksma. Finding diverse minimum s-t cuts. In Proceedings of the 34th International Symposium on Algorithms and Computation (ISAAC 2023), volume 283, pages 24:1–24:17, 2023. doi:10.4230/LIPICS.ISAAC.2023.24.

[bib.bib8] [8] M. de Berg, A. L. Martínez, and F. Spieksma. Finding diverse solutions in combinatorial problems with a distributive lattice structure. arXiv:2504.02369, 2025. doi:10.48550/arXiv.2504.02369.

[bib.bib9] [9] E. Eiben, T. Koana, and M. Wahlström. Determinantal sieving. In Proceedings of the 2024 Annual ACM-SIAM Symposium on Discrete Algorithms (SODA 2024), pages 377–423, 2024.

[bib.bib10] [10] F. V. Fomin, P. A. Golovach, L. Jaffke, G. Philip, and D. Sagunov. Diverse pairs of matchings. Algorithmica, 86(6):2026–2040, 2024. doi:10.1007/S00453-024-01214-7.

[bib.bib11] [11] F. V. Fomin, P. A. Golovach, F. Panolan, G. Philip, and S. Saurabh. Diverse collections in matroids and graphs. Mathematical Programming, 204(1):415–447, 2024. doi:10.1007/S10107-023-01959-Z.

[bib.bib12] [12] S. Fujishige. Submodular Functions and Optimization. Annals of Discrete Mathematics. Elsevier Science, London, England, 2nd edition, 2005.

[bib.bib13] [13] D. Gale and L. S. Shapley. College admissions and the stability of marriage. The American Mathematical Monthly, 69(1):9–15, 1962.

[bib.bib14] [14] D. Gusfield and R. W. Irving. The Stable Marriage Problem: Structure and Algorithms. Foundations of Computing Series. The MIT Press, 1989.

[bib.bib15] [15] T. Hanaka, M. Kiyomi, Y. Kobayashi, Y. Kobayashi, K. Kurita, and Y. Otachi. A framework to design approximation algorithms for finding diverse solutions in combinatorial problems. In Proceedings of the 37th AAAI Conference on Artificial Intelligence (AAAI 2023), volume 37, pages 3968–3976, 2023. doi:10.1609/AAAI.V37I4.25511.

[bib.bib16] [16] T. Hanaka, Y. Kobayashi, K. Kurita, S. W. Lee, and Y. Otachi. Computing diverse shortest paths efficiently: A theoretical and experimental study. In Proceedings of the 36th AAAI Conference on Artificial Intelligence (AAAI 2022), volume 36, pages 3758–3766, 2022. doi:10.1609/AAAI.V36I4.20290.

[bib.bib17] [17] T. Hanaka, Y. Kobayashi, K. Kurita, and Y. Otachi. Finding diverse trees, paths, and more. In Proceedings of the 35th AAAI Conference on Artificial Intelligence (AAAI 2021), volume 35, pages 3778–3786, 2021. doi:10.1609/AAAI.V35I5.16495.

[bib.bib18] [18] H. Jiang. Minimizing convex functions with rational minimizers. Journal of the ACM, 70(1):5:1–5:27, 2022. doi:10.1145/3566050.

[bib.bib19] [19] B. Korte and J. Vygen. Combinatorial Optimization: Theory and Algorithms. Springer, Berlin, Heidelberg, 6th edition, 2018.

[bib.bib20] [20] K. Murota. Discrete Convex Analysis. Society for Industrial and Applied Mathematics, 2003.

[bib.bib21] [21] K. Murota. Matrices and Matroids for Systems Analysis. Springer, Berlin, Heidelberg, 2010.

[bib.bib22] [22] J.-C. Picard and M. Queyranne. On the structure of all minimum cuts in a network and applications, volume 13 of Mathematical Programming Studies, pages 8–16. Springer, 1980.

[bib.bib23] [23] A. Schrijver. Combinatorial Optimization: Polyhedra and Efficiency. Algorithms and Combinatorics. Springer, 2003.

[bib.bib24] [24] M. Sharir. A strong-connectivity algorithm and its applications in data flow analysis. Computers & Mathematics with Applications, 7(1):67–72, 1981.

[bib.bib25] [25] J. van den Brand, L. Chen, R. Kyng, Y. P. Liu, R. Peng, M. P. Gutenberg, S. Sachdeva, and A. Sidford. A deterministic almost-linear time algorithm for minimum-cost flow. In Proceedings of the IEEE 64th Annual Symposium on Foundations of Computer Science (FOCS 2023), pages 503–514, 2023. doi:10.1109/FOCS57990.2023.00037.

A General Framework for Finding Diverse Solutions via Network Flow and Its Applications

Abstract

Keywords and phrases:

Funding:

Copyright and License:

2012 ACM Subject Classification:

Related Version:

Acknowledgements:

DOI:

Event:

Editors:

Series and Publisher:

1 Introduction

Example 1.

Theorem 2.

Theorem 3.

Related work.

2 Preliminaries

Posets.

Network flows.

3 Reduction to the minimum 𝒌-potential problem

Example 4.

Lemma 5 (⋆).

Theorem 6 (⋆).

4 Solving the minimum 𝒌-potential problem via minimum cost flow

Lemma 7 (⋆).

Theorem 8 (⋆).

5 Applications

5.1 How to construct a poset and a reduction map

Lemma 9 (⋆).

Proposition 10.

Theorem 11 (⋆).

5.2 The 𝒌-diverse unweighted minimum 𝒔-𝒕 cut problem

Lemma 12 ([22]).

Lemma 13 (⋆).

Theorem 14 (⋆).

5.3 The 𝒌-diverse stable matching problem

Lemma 15 (⋆).

Theorem 16 (⋆).

5.4 The 𝒌-diverse problem on the product of total orders

Theorem 17 (⋆).

Theorem 18 (⋆).

References

3 Reduction to the minimum $𝒌$ -potential problem

Lemma 5 ( $\star$ ).

Theorem 6 ( $\star$ ).

4 Solving the minimum $𝒌$ -potential problem via minimum cost flow

Lemma 7 ( $\star$ ).

Theorem 8 ( $\star$ ).

Lemma 9 ( $\star$ ).

Theorem 11 ( $\star$ ).

5.2 The $𝒌$ -diverse unweighted minimum $𝒔$ - $𝒕$ cut problem

Lemma 13 ( $\star$ ).

Theorem 14 ( $\star$ ).

5.3 The $𝒌$ -diverse stable matching problem

Lemma 15 ( $\star$ ).

Theorem 16 ( $\star$ ).

5.4 The $𝒌$ -diverse problem on the product of total orders

Theorem 17 ( $\star$ ).

Theorem 18 ( $\star$ ).