TY - JOUR
T1 - Average optimal strategies for zero-sum Markov games with poorly known payoff function on one side
AU - Luque-Vásquez, Fernando
AU - Minjárez-Sosa, J. Adolfo
N1 - Publisher Copyright:
© 2014, American Institute of Mathematical Sciences.
PY - 2014
Y1 - 2014
N2 - We are concerned with two-person zero-sum Markov games with Borel spaces under a long-run average criterion. The payoff function is possibly unbounded and depends on a parameter which is unknown to one of the players. The parameter and the payoff function can be estimated by implementing statistical methods. Thus, our main objective is to combine such estimation procedure with a variant of the so-called vanishing discount approach to construct an average optimal pair of strategies for the game. Our results are applied to a class of zero-sum semi-Markov games.
AB - We are concerned with two-person zero-sum Markov games with Borel spaces under a long-run average criterion. The payoff function is possibly unbounded and depends on a parameter which is unknown to one of the players. The parameter and the payoff function can be estimated by implementing statistical methods. Thus, our main objective is to combine such estimation procedure with a variant of the so-called vanishing discount approach to construct an average optimal pair of strategies for the game. Our results are applied to a class of zero-sum semi-Markov games.
KW - Average payoff criterion
KW - Incomplete information
KW - Payoff estimation
KW - Zero-sum Markov and semi-Markov games
UR - http://www.scopus.com/inward/record.url?scp=85044313173&partnerID=8YFLogxK
U2 - 10.3934/jdg.2014.1.105
DO - 10.3934/jdg.2014.1.105
M3 - Artículo
SN - 2164-6074
VL - 1
SP - 105
EP - 119
JO - Journal of Dynamics and Games
JF - Journal of Dynamics and Games
IS - 1
ER -