A PTAS for a Class of Stochastic Dynamic Programs

Fu, Hao; Li, Jian; Xu, Pan

doi:10.4230/LIPIcs.ICALP.2018.56

File

LIPIcs.ICALP.2018.56.pdf

Filesize: 0.58 MB
14 pages

Document Identifiers

DOI: 10.4230/LIPIcs.ICALP.2018.56
URN: urn:nbn:de:0030-drops-90609

Author Details

Hao Fu

Institute for Interdisciplinary Information Sciences, Tsinghua University, Beijng, China

Jian Li

Institute for Interdisciplinary Information Sciences, Tsinghua University, Beijng, China

Pan Xu

Department of Computer Science, University of Maryland, College Park, USA

Cite AsGet BibTex

Hao Fu, Jian Li, and Pan Xu. A PTAS for a Class of Stochastic Dynamic Programs. In 45th International Colloquium on Automata, Languages, and Programming (ICALP 2018). Leibniz International Proceedings in Informatics (LIPIcs), Volume 107, pp. 56:1-56:14, Schloss Dagstuhl – Leibniz-Zentrum für Informatik (2018)
https://doi.org/10.4230/LIPIcs.ICALP.2018.56

Abstract

We develop a framework for obtaining polynomial time approximation schemes (PTAS) for a class of stochastic dynamic programs. Using our framework, we obtain the first PTAS for the following stochastic combinatorial optimization problems: 1) Probemax [Munagala, 2016]: We are given a set of n items, each item i in [n] has a value X_i which is an independent random variable with a known (discrete) distribution pi_i. We can probe a subset P subseteq [n] of items sequentially. Each time after {probing} an item i, we observe its value realization, which follows the distribution pi_i. We can adaptively probe at most m items and each item can be probed at most once. The reward is the maximum among the m realized values. Our goal is to design an adaptive probing policy such that the expected value of the reward is maximized. To the best of our knowledge, the best known approximation ratio is 1-1/e, due to Asadpour et al. [Asadpour and Nazerzadeh, 2015]. We also obtain PTAS for some generalizations and variants of the problem. 2) Committed Pandora's Box [Weitzman, 1979; Singla, 2018]: We are given a set of n boxes. For each box i in [n], the cost c_i is deterministic and the value X_i is an independent random variable with a known (discrete) distribution pi_i. Opening a box i incurs a cost of c_i. We can adaptively choose to open the boxes (and observe their values) or stop. We want to maximize the expectation of the realized value of the last opened box minus the total opening cost. 3) Stochastic Target [{I}lhan et al., 2011]: Given a predetermined target T and n items, we can adaptively insert the items into a knapsack and insert at most m items. Each item i has a value X_i which is an independent random variable with a known (discrete) distribution. Our goal is to design an adaptive policy such that the probability of the total values of all items inserted being larger than or equal to T is maximized. We provide the first bi-criteria PTAS for the problem. 4) Stochastic Blackjack Knapsack [Levin and Vainer, 2014]: We are given a knapsack of capacity C and probability distributions of n independent random variables X_i. Each item i in [n] has a size X_i and a profit p_i. We can adaptively insert the items into a knapsack, as long as the capacity constraint is not violated. We want to maximize the expected total profit of all inserted items. If the capacity constraint is violated, we lose all the profit. We provide the first bi-criteria PTAS for the problem.

Subject Classification

ACM Subject Classification

Theory of computation → Stochastic approximation

Keywords

stochastic optimization
dynamic program
markov decision process
block policy
approximation algorithm

Metrics

Access Statistics
Total Accesses (updated on a weekly basis)

0

PDF Downloads

0

Metadata Views

References

Marek Adamczyk, Maxim Sviridenko, and Justin Ward. Submodular stochastic probing on matroids. Mathematics of Operations Research, 41(3):1022-1038, 2016.
Arash Asadpour and Hamid Nazerzadeh. Maximizing stochastic monotone submodular functions. Management Science, 62(8):2374-2391, 2015.
Richard Bellman. Dynamic programming. In Princeton University Press, 1957.
Dimitri P. Bertsekas. Dynamic programming and optimal control. Athena scientific Belmont, MA, 1995.
Anand Bhalgat. A (2 + ε)-approximation algorithm for the stochastic knapsack problem. Unpublished Manuscript, 2011.
Anand Bhalgat, Ashish Goel, and Sanjeev Khanna. Improved approximation results for stochastic knapsack problems. Proceedings of the twenty-second annual ACM-SIAM symposium on Discrete Algorithms, pages 1647-1665, 2011.
Kai Chen and Sheldon M. Ross. An adaptive stochastic knapsack problem. European Journal of Operational Research, 239(3):625-635, 2014. URL: http://dx.doi.org/10.1016/j.ejor.2014.06.027.
Wei Chen, Wei Hu, Fu Li, Jian Li, Yu Liu, and Pinyan Lu. Combinatorial multi-armed bandit with general reward functions. Advances in Neural Information Processing Systems, 2016.
Brian C Dean, Michel X Goemans, and Jan Vondrák. Adaptivity and approximation for stochastic packing problems. In Proceedings of the sixteenth annual ACM-SIAM symposium on Discrete algorithms, pages 395-404. Society for Industrial and Applied Mathematics, 2005.
Anupam Gupta and Viswanath Nagarajan. A stochastic probing problem with applications. In International Conference on Integer Programming and Combinatorial Optimization, pages 205-216. Springer, 2013.
Anupam Gupta, Viswanath Nagarajan, and Sahil Singla. Algorithms and adaptivity gaps for stochastic probing. In Proceedings of the Twenty-Seventh Annual ACM-SIAM Symposium on Discrete Algorithms, pages 1731-1747. Society for Industrial and Applied Mathematics, 2016.
Nir Halman, Diego Klabjan, Chung Lun Li, James Orlin, and David Simchi-Levi. Fully polynomial time approximation schemes for stochastic dynamic programs. In Nineteenth Acm-Siam Symposium on Discrete Algorithms, pages 700-709, 2008.
Nir Halman, Diego Klabjan, Chung-Lun Li, James Orlin, and David Simchi-Levi. Fully polynomial time approximation schemes for stochastic dynamic programs. SIAM Journal on Discrete Mathematics, 28(4):1725-1796, 2014.
Nir Halman, Giacomo Nannicini, and James Orlin. A computationally efficient fptas for convex stochastic dynamic programs. SIAM Journal on Optimization, 25(1):317-350, 2015.
Taylan İlhan, Seyed MR Iravani, and Mark S Daskin. The adaptive knapsack problem with stochastic rewards. Operations Research, 59(1):242-248, 2011.
Asaf Levin and Aleksander Vainer. Adaptivity in the stochastic blackjack knapsack problem. Theoretical Computer Science, 516:121-126, 2014.
Jian Li and Wen Yuan. Stochastic combinatorial optimization via poisson approximation. Proceedings of the forty-fifth annual ACM symposium on Theory of computing, pages 971-980, 2013.
Will Ma. Improvements and generalizations of stochastic knapsack and markovian bandits approximation algorithms. Mathematics of Operations Research, 2017. URL: http://dx.doi.org/10.1287/moor.2017.0884.
Kamesh Munagala. Approximation algorithms for stochastic optimization. https://simons.berkeley.edu/talks/kamesh-munagala-08-22-2016-1, Simons Institue for the Theory of Computing, 2016.
Warren B Powell. Approximate Dynamic Programming: Solving the Curses of Dimensionality, volume 842. John Wiley &Sons, 2011.
David B Shmoys and Chaitanya Swamy. An approximation scheme for stochastic linear programming and its application to stochastic integer programs. Journal of the ACM (JACM), 53(6):978-1012, 2006.
Sahil Singla. The price of information in combinatorial optimization. In Proceedings of the Twenty-Ninth Annual ACM-SIAM Symposium on Discrete Algorithms, pages 2523-2532. SIAM, 2018.
Jan Vondrák, Chandra Chekuri, and Rico Zenklusen. Submodular function maximization via the multilinear relaxation and contention resolution schemes. In Proceedings of the forty-third annual ACM symposium on Theory of computing, pages 783-792. ACM, 2011.
Martin L. Weitzman. Optimal search for the best alternative. Econometrica, 47(3):641-654, 1979.

A PTAS for a Class of Stochastic Dynamic Programs

Authors Hao Fu, Jian Li, Pan Xu

File

Document Identifiers

Author Details

Cite AsGet BibTex

Abstract

Subject Classification

ACM Subject Classification

Keywords

Metrics

References

Thanks for your feedback!

Could not send message

A PTAS for a Class of Stochastic Dynamic Programs

Authors Hao Fu, Jian Li, Pan Xu

File

Document Identifiers

Author Details

Funding

Cite AsGet BibTex

Abstract

Subject Classification

ACM Subject Classification

Keywords

Metrics

Related Versions

References

Thanks for your feedback!

Could not send message