PARTIALLY OBSERVABLE MARKOV DECISION PROCESSES WITH PARTIALLY OBSERVABLE RANDOM DISCOUNT FACTORS

E. Everardo Martinez-Garcia, J. Adolfo Minjárez-Sosa, Oscar Vega-Amaya

Producción científica: Contribución a una revistaArtículorevisión exhaustiva

Resumen

This paper deals with a class of partially observable discounted Markov decision processes defined on Borel state and action spaces, under unbounded one-stage cost. The discount rate is a stochastic process evolving according to a difference equation, which is also assumed to be partially observable. Introducing a suitable control model and filtering processes, we prove the existence of optimal control policies. In addition, we illustrate our results in a class of GI/GI/1 queueing systems where we obtain explicitly the corresponding optimality equation and the filtering process.

Idioma originalInglés
Páginas (desde-hasta)960-983
Número de páginas24
PublicaciónKybernetika
Volumen58
N.º6
DOI
EstadoPublicada - 2022

Nota bibliográfica

Publisher Copyright:
© 2022 Institute of Information Theory and Automation of The Czech Academy of Sciences. All rights reserved.

Huella

Profundice en los temas de investigación de 'PARTIALLY OBSERVABLE MARKOV DECISION PROCESSES WITH PARTIALLY OBSERVABLE RANDOM DISCOUNT FACTORS'. En conjunto forman una huella única.

Citar esto