On the Complexity of Constrained Determinantal Point Processes

Celis, L. Elisa; Deshpande, Amit; Kathuria, Tarun; Straszak, Damian; Vishnoi, Nisheeth K.

doi:10.4230/LIPIcs.APPROX-RANDOM.2017.36

Abstract

Determinantal Point Processes (DPPs) are probabilistic models that arise in quantum physics and random matrix theory and have recently found numerous applications in theoretical computer science and machine learning. DPPs define probability distributions over subsets of a given ground set, they exhibit interesting properties such as negative correlation, and, unlike other models of negative correlation such as Markov random fields, have efficient algorithms for sampling. When applied to kernel methods in machine learning, DPPs favor subsets of the given data with more diverse features. However, many real-world applications require efficient algorithms to sample from DPPs with additional constraints on the sampled subset, e.g., partition or matroid constraints that are important from the viewpoint of ensuring priors, resource or fairness constraints on the sampled subset. Whether one can efficiently sample from DPPs in such constrained settings is an important problem that was first raised in a survey of DPPs for machine learning by Kulesza and Taskar and studied in some recent works.

The main contribution of this paper is the first  resolution of the complexity of sampling from DPPs with constraints. On the one hand, we give exact efficient algorithms for sampling from constrained DPPs when the description of the constraints is  in unary; this includes special cases of practical importance such as a small number of partition, knapsack or budget constraints. On the other hand, we prove that when the constraints are specified in binary, this problem is #P-hard via a reduction from the problem of computing mixed discriminants; implying that it may be unlikely that there is an FPRAS. Technically, our algorithmic result benefits from viewing the constrained sampling  problem via the lens of polynomials and we obtain our complexity results by providing an equivalence between computing mixed discriminants and sampling from partition constrained DPPs. As a consequence, we obtain a few  corollaries of independent interest: 1) An  algorithm to count, sample (and, hence, optimize) over the base polytope of regular matroids when there are additional (succinct) budget constraints and, 
2) An algorithm to evaluate and compute mixed characteristic polynomials, that played a central role in the resolution of the Kadison-Singer problem, for certain special cases.

N. Anari, S. O. Gharan, and A. Rezaei. Monte Carlo Markov Chain Algorithms for Sampling Strongly Rayleigh Distributions and Determinantal Point Processes. In COLT, pages 103-115, 2016.
N. Anari, S. O. Gharan, A. Saberi, and N. Srivastava. Approximating the largest root and applications to interlacing families. CoRR, abs/1704.03892, 2017. URL: http://arxiv.org/abs/1704.03892.
R. B. Bapat. Mixed discriminants of positive semidefinite matrices. Lin. Algebra &Applications, 126, 1989.
A. Barvinok. Computing mixed discriminants, mixed volumes, and permanents. Discrete & Computational Geometry, 18, 1997.
D. Bertsimas and S. Vempala. Solving convex programs by random walks. J. ACM, July 2004.
A. Z. Broder and E. W. Mayr. Counting minimum weight spanning trees. J. of Algorithms, 24(1), 1997.
L. E. Celis, A. Deshpande, T. Kathuria, and N. K. Vishnoi. How to be fair and diverse? Fairness, Accountability, and Transparency in Machine Learning, 2016.
C. Chekuri, J. Vondrak, and R. Zenklusen. Dependent randomized rounding via exchange properties of combinatorial structures. In FOCS, 2010.
A. Deshpande, T. Kathuria, D. Straszak, and N. K. Vishnoi. Combinatorial Determinantal Point Processes. ArXiv e-prints, 2016. URL: http://arxiv.org/abs/1608.00554.
A. Deshpande and L. Rademacher. Efficient Volume Sampling for Row/Column Subset Selection. In FOCS, Oct 2010.
M. E. Dyer, A. M. Frieze, and R. Kannan. A random polynomial time algorithm for approximating the volume of convex bodies. J. ACM, 38(1):1-17, 1991.
D. Eppstein. Representing all minimum spanning trees with applications to counting and generation. UC Irvine, 1995.
S. Friedland and D. Levy. A polynomial-time approximation algorithm for the number of k-matchings in bipartite graphs. ArXiv, 2006. URL: http://arxiv.org/abs/0607135.
M. Gartrell, U. Paquet, and N. Koenigstein. Bayesian low-rank determinantal point processes. In Proceedings of the 10th ACM Conference on Recommender Systems, Boston, MA, USA, September 15-19, 2016, pages 349-356, 2016.
L. Gurvits. On the complexity of mixed discriminants and related problems. In MFCS, 2005.
L. Gurvits and A. Samorodnitsky. A deterministic algorithm for approximating the mixed discriminant and mixed volume, and a combinatorial corollary. Discrete &Computational Geometry, 27(4), 2002.
B. Hajek. Cooling schedules for optimal annealing. Mathematics of operations research, 13(2), 1988.
N. Harvey. An introduction to the Kadison-Singer problem and the paving conjecture, 2013.
N. Harvey and N. Olver. Pipage rounding, pessimistic estimators and matrix concentration. In SODA'14, pages 926-945, 2014.
J. B. Hough, M. Krishnapur, Y. Peres, and B. Virág. Determinantal Processes and Independence. ArXiv Mathematics e-prints, March 2005. URL: http://arxiv.org/abs/math/0503110.
M. Jerrum, A. Sinclair, and E. Vigoda. A Polynomial-time Approximation Algorithm for the Permanent of a Matrix with Nonnegative Entries. J. ACM, 51(4), July 2004.
M. R. Jerrum, L. G. Valiant, and V. V. Vazirani. Random generation of combinatorial structures from a uniform distribution. Theoretical Computer Science, 43:169-188, 1986.
A. Krause, A. Singh, and C. Guestrin. Near-Optimal Sensor Placements in Gaussian Processes: Theory, Efficient Algorithms and Empirical Studies. J. Mach. Learn. Res., 9, June 2008.
A. Kulesza and B. Taskar. Determinantal point processes for machine learning. ArXiv, July 2012. URL: http://arxiv.org/abs/1207.6083.
C. Li, S. Jegelka, and S. Sra. Markov chain sampling in discrete probabilistic models with constraints. In NIPS, 2016.
H. Lin and J. Bilmes. A Class of Submodular Functions for Document Summarization. In Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, HLT'11, 2011.
A. W. Marcus, D. A. Spielman, and N. Srivastava. Interlacing families. II: Mixed characteristic polynomials and the Kadison-Singer problem. Ann. Math. (2), 182(1):327-350, 2015.
M. Mezard and A. Montanari. Information, physics, and computation. Oxford University Press, 2009.
R. Pemantle. Uniform random spanning trees. arXiv preprint math/0404099, 2004.
A. Prasad, S. Jegelka, and D. Batra. Submodular meets structured: Finding diverse subsets in exponentially-large structured item sets. In Advances in Neural Information Processing Systems, December 8-13 2014, Montreal, Quebec, Canada, pages 2645-2653, 2014.
P. Rebeschini and A. Karbasi. Fast mixing for discrete point processes. In Proceedings of The 28th Conference on Learning Theory, COLT 2015, Paris, France, July 3-6, 2015, pages 1480-1500, 2015.
Damian Straszak and Nisheeth K. Vishnoi. Real stable polynomials and matroids: Optimization and counting. In Proceedings of the 49th Annual ACM Symposium on Theory of Computing. ACM, 2017.
M. J. Wainwright, M. I. Jordan, et al. Graphical models, exponential families, and variational inference. Foundations and Trendsregistered in Machine Learning, 1(1-2):1-305, 2008.
K. Wei, R. K. Iyer, and J. A. Bilmes. Submodularity in data subset selection and active learning. In Proceedings of the 32nd International Conference on Machine Learning, ICML 2015, Lille, France, 6-11 July 2015, pages 1954-1963, 2015.
Y. Yue and T. Joachims. Predicting Diverse Subsets Using Structural SVMs. In ICML, 2008.
C. X. Zhai, W. W. Cohen, and J. Lafferty. Beyond Independent Relevance: Methods and Evaluation Metrics for Subtopic Retrieval. In SIGIR, 2003.
T. Zhou, Z. Kuscsik, J. Liu, M. Medo, J. R. Wakeling, and Y. Zhang. Solving the apparent diversity-accuracy dilemma of recommender systems. PNAS, 107(10), 2010.

On the Complexity of Constrained Determinantal Point Processes

Authors L. Elisa Celis, Amit Deshpande, Tarun Kathuria, Damian Straszak, Nisheeth K. Vishnoi

File

Document Identifiers

Subject Classification

Keywords

Metrics

Abstract

Cite As Get BibTex

Author Details

References

Thanks for your feedback!

Could not send message