Go to the corresponding LIPIcs Volume Portal 
Kretínský, Jan ; Pérez, Guillermo A. ; Raskin, JeanFrançois
pdfformat: 

@InProceedings{kretnsk_et_al:LIPIcs:2018:9546, author = {Jan Kret{\'i}nsk{\'y} and Guillermo A. P{\'e}rez and JeanFran{\c{c}}ois Raskin}, title = {{LearningBased MeanPayoff Optimization in an Unknown MDP under OmegaRegular Constraints}}, booktitle = {29th International Conference on Concurrency Theory (CONCUR 2018)}, pages = {8:18:18}, series = {Leibniz International Proceedings in Informatics (LIPIcs)}, ISBN = {9783959770873}, ISSN = {18688969}, year = {2018}, volume = {118}, editor = {Sven Schewe and Lijun Zhang}, publisher = {Schloss DagstuhlLeibnizZentrum fuer Informatik}, address = {Dagstuhl, Germany}, URL = {http://drops.dagstuhl.de/opus/volltexte/2018/9546}, URN = {urn:nbn:de:0030drops95468}, doi = {10.4230/LIPIcs.CONCUR.2018.8}, annote = {Keywords: Markov decision processes, Reinforcement learning, Beyond worst case} }
Keywords:  Markov decision processes, Reinforcement learning, Beyond worst case  
Collection:  29th International Conference on Concurrency Theory (CONCUR 2018)  
Issue Date:  2018  
Date of publication:  31.08.2018 