Minimal Disclosure in Partially Observable Markov Decision Processes

Authors Nathalie Bertrand, Blaise Genest



PDF
Thumbnail PDF

File

LIPIcs.FSTTCS.2011.411.pdf
  • Filesize: 450 kB
  • 12 pages

Document Identifiers

Author Details

Nathalie Bertrand
Blaise Genest

Cite As Get BibTex

Nathalie Bertrand and Blaise Genest. Minimal Disclosure in Partially Observable Markov Decision Processes. In IARCS Annual Conference on Foundations of Software Technology and Theoretical Computer Science (FSTTCS 2011). Leibniz International Proceedings in Informatics (LIPIcs), Volume 13, pp. 411-422, Schloss Dagstuhl – Leibniz-Zentrum für Informatik (2011) https://doi.org/10.4230/LIPIcs.FSTTCS.2011.411

Abstract

For security and efficiency reasons, most systems do not give the
users a full access to their information.  One key specification
formalism for these systems are the so called Partially Observable Markov Decision Processes (POMDP for short), which have been extensively studied in several research communities, among which AI and model-checking. In this paper we tackle the problem of the minimal information a user needs at runtime to achieve a simple goal, modeled as reaching an objective with probability one.  More precisely, to achieve her goal, the user can at each step either choose to use the partial information, or pay a fixed cost and receive the full information. The natural question is then to minimize the cost the user needs to fulfill her objective. This optimization question gives rise to two different problems, whether we consider to minimize the worst case cost, or the average cost. On
the one hand, concerning the worst case cost, we show that efficient
techniques from the model checking community can be adapted to  compute the optimal worst case cost and give optimal strategies for  the users.  On the other hand, we show that the optimal average price (a question typically considered in the AI community) cannot be computed in general, nor can it be approximated in polynomial time even up to a large approximation factor.

Subject Classification

Keywords
  • Partially Observable Markov Decision Processes
  • Stochastic Games
  • Model-Checking
  • Worst-Case/Average-Case Analysis

Metrics

  • Access Statistics
  • Total Accesses (updated on a weekly basis)
    0
    PDF Downloads
Questions / Remarks / Feedback
X

Feedback for Dagstuhl Publishing


Thanks for your feedback!

Feedback submitted

Could not send message

Please try again later or send an E-mail