Adaptive policies for stochastic systems under a randomized discounted cost criterion

J. González-Hernández*, R. R. López-Martínez, J. A. Minjárez-Sosa

*Autor correspondiente de este trabajo

Producción científica: Contribución a una revistaArtículorevisión exhaustiva

11 Citas (Scopus)

Resumen

The paper deals with a class of discrete-time stochastic control processes under a discounted optimality criterion with random discount rate, and possibly unbounded costs. The state process {x t} and the discount process {a t;} evolve according to the coupled difference equations x t/+i = F(x t,a tt,a tξ t), a t+1 = G(a t,n t) where the state and discount disturbance processes {ξ t} and {η t} are sequences of i.i.d. random variables with unknown distributions θξ and θ η" respectively. Assuming observability of the process {(St, ξ t)), we use the empirical estimator of its distribution to construct asymptotically discounted optimal policie.

Idioma originalInglés
Páginas (desde-hasta)149-163
Número de páginas15
PublicaciónBoletin de la Sociedad Matematica Mexicana
Volumen14
N.º1
EstadoPublicada - abr. 2008

Huella

Profundice en los temas de investigación de 'Adaptive policies for stochastic systems under a randomized discounted cost criterion'. En conjunto forman una huella única.

Citar esto