Adaptive policies for stochastic systems under a randomized discounted cost criterion

J. González-Hernández*, R. R. López-Martínez, J. A. Minjárez-Sosa

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

11 Scopus citations

Abstract

The paper deals with a class of discrete-time stochastic control processes under a discounted optimality criterion with random discount rate, and possibly unbounded costs. The state process {x t} and the discount process {a t;} evolve according to the coupled difference equations x t/+i = F(x t,a tt,a tξ t), a t+1 = G(a t,n t) where the state and discount disturbance processes {ξ t} and {η t} are sequences of i.i.d. random variables with unknown distributions θξ and θ η" respectively. Assuming observability of the process {(St, ξ t)), we use the empirical estimator of its distribution to construct asymptotically discounted optimal policie.

Original languageEnglish
Pages (from-to)149-163
Number of pages15
JournalBoletin de la Sociedad Matematica Mexicana
Volume14
Issue number1
StatePublished - Apr 2008

Keywords

  • Discounted cost criterion
  • Discrete-time stochastic systems
  • Empirical distribution
  • Optimal adaptive policy
  • Random rate

Fingerprint

Dive into the research topics of 'Adaptive policies for stochastic systems under a randomized discounted cost criterion'. Together they form a unique fingerprint.

Cite this