KIT | KIT-Bibliothek | Impressum | Datenschutz

Yet Another Distributional Bellman Equation

Bäuerle, Nicole ORCID iD icon 1; Göll, Tamara ORCID iD icon 1; Jaśkiewicz, Anna
1 Institut für Stochastik (STOCH), Karlsruher Institut für Technologie (KIT)

Abstract:

We consider non-standard Markov Decision Processes (MDPs) where the target function is not only a simple expectation of the accumulated reward. Instead, we consider rather general functionals of the joint distribution of terminal state and accumulated reward which have to be optimized. For finite state and compact action space, we show how to solve these problems by defining a lifted MDP whose state space is the space of distributions over the true states of the process. We derive a Bellman equation in this setting, which can be considered as a distributional Bellman equation. Well-known cases like the standard MDP and quantile MDPs are shown to be special examples of our framework. We also apply our model to a variant of an optimal transport problem.


Volltext §
DOI: 10.5445/IR/1000185824
Veröffentlicht am 17.10.2025
Cover der Publikation
Zugehörige Institution(en) am KIT Institut für Stochastik (STOCH)
Publikationstyp Forschungsbericht/Preprint
Publikationsdatum 15.10.2025
Sprache Englisch
Identifikator KITopen-ID: 1000185824
Verlag arxiv
Serie Mathematics - Optimization and Control
Vorab online veröffentlicht am 27.05.2025
Nachgewiesen in arXiv
Dimensions
OpenAlex
KIT – Die Universität in der Helmholtz-Gemeinschaft
KITopen Landing Page