KIT | KIT-Bibliothek | Impressum | Datenschutz

Target Value Criterion in Markov Decision Processes

Moritz, Lars Norman

We apply the Target Value Criterion to an MDP with a random planning horizon, derive an optimality equation and prove the existence of an optimal stationary policy in a generalized state space. The structure of the value function is exploited to approximate the target space by a finite subset and to derive upper and lower bounds as well as nearly optimal policies. As an extension we combine the Total Reward Criterion and the Target Value Criterion in a penalty approach.

Open Access Logo

Volltext §
DOI: 10.5445/IR/1000047288
Cover der Publikation
Zugehörige Institution(en) am KIT Institut für Operations Research (IOR)
Publikationstyp Hochschulschrift
Publikationsjahr 2014
Sprache Englisch
Identifikator urn:nbn:de:swb:90-472888
KITopen-ID: 1000047288
Verlag KIT, Karlsruhe
Abschlussart Dissertation
Fakultät Fakultät für Wirtschaftswissenschaften (WIWI)
Institut Institut für Operations Research (IOR)
Prüfungsdaten 04.12.2014
Referent/Betreuer Prof. K.-H. Waldmann
Schlagwörter Markov Decision Processes, Risk Sensitive Optimization, Target Value Criterion
KIT – Die Forschungsuniversität in der Helmholtz-Gemeinschaft
KITopen Landing Page