TY - JOUR
T1 - Asymptotically optimal strategies for adaptive zero-sum discounted markov games
AU - Minjárez-Sosa, J. Adolfo
AU - Vega-Amaya, Oscar
PY - 2009
Y1 - 2009
N2 - We consider a class of discrete-time two person zero-sum Markov games with Borel state and action spaces, and possibly unbounded payoffs. The ga me evolves according to the recursive equation xn+1 = F(xn,a n,bn,n), n = 0, 1,..., where the disturbance process {n} is formed by independent and identically distributed Rk-valued random vectors, which are observable but whose common density p is unknown to both players. Under certain continuity and compactness conditions, we combine a nonstationary iteration procedure and suitable den sity estimation methods to construct asymptotically discounted optimal strategies for both players.
AB - We consider a class of discrete-time two person zero-sum Markov games with Borel state and action spaces, and possibly unbounded payoffs. The ga me evolves according to the recursive equation xn+1 = F(xn,a n,bn,n), n = 0, 1,..., where the disturbance process {n} is formed by independent and identically distributed Rk-valued random vectors, which are observable but whose common density p is unknown to both players. Under certain continuity and compactness conditions, we combine a nonstationary iteration procedure and suitable den sity estimation methods to construct asymptotically discounted optimal strategies for both players.
KW - Adaptive strategies
KW - Asymptotic opti- Mality
KW - Discounted payoff
KW - Zero-sum markov games
UR - http://www.scopus.com/inward/record.url?scp=67649268464&partnerID=8YFLogxK
U2 - 10.1137/060651458
DO - 10.1137/060651458
M3 - Artículo
SN - 0363-0129
VL - 48
SP - 1405
EP - 1421
JO - SIAM Journal on Control and Optimization
JF - SIAM Journal on Control and Optimization
IS - 3
ER -