TY - JOUR
T1 - Empirical approximation of Nash equilibria in finite Markov games with discounted payoffs
AU - Robles-Aguilar, Alan D.
AU - González-Sánchez, David
AU - Minjárez-Sosa, J. Adolfo
N1 - Publisher Copyright:
© 2022 Chinese Automatic Control Society and John Wiley & Sons Australia, Ltd.
PY - 2023/3
Y1 - 2023/3
N2 - This paper deals with finite nonzero-sum Markov games under a discounted optimality criterion and infinite horizon. The state process evolves according to a stochastic difference equation and depends on players' actions as well as a random disturbance whose distribution is unknown to the players. The actions, the states, and the values of the disturbance are observed by the players, then they use the empirical distribution of the disturbances to estimate the true distribution and make choices based on the available information. In this context, we propose an almost surely convergent procedure—possibly after passing to a subsequence—to approximate Nash equilibria of the Markov game with the true distribution of the random disturbance.
AB - This paper deals with finite nonzero-sum Markov games under a discounted optimality criterion and infinite horizon. The state process evolves according to a stochastic difference equation and depends on players' actions as well as a random disturbance whose distribution is unknown to the players. The actions, the states, and the values of the disturbance are observed by the players, then they use the empirical distribution of the disturbances to estimate the true distribution and make choices based on the available information. In this context, we propose an almost surely convergent procedure—possibly after passing to a subsequence—to approximate Nash equilibria of the Markov game with the true distribution of the random disturbance.
KW - Markov games
KW - Nash equilibrium
KW - discounted criterion
KW - empirical estimation
UR - http://www.scopus.com/inward/record.url?scp=85136520221&partnerID=8YFLogxK
U2 - 10.1002/asjc.2932
DO - 10.1002/asjc.2932
M3 - Artículo
AN - SCOPUS:85136520221
SN - 1561-8625
VL - 25
SP - 722
EP - 734
JO - Asian Journal of Control
JF - Asian Journal of Control
IS - 2
ER -