KIT | KIT-Bibliothek | Impressum | Datenschutz

Bridging Continuous-time LQR and Reinforcement Learning via Gradient Flow of the Bellman Error

Gießler, Armin ORCID iD icon 1; Malan, Albertus Johannes ORCID iD icon 1; Hohmann, Sören 1
1 Institut für Regelungs- und Steuerungssysteme (IRS), Karlsruher Institut für Technologie (KIT)

Abstract:

In this paper, we present a novel method for computing the optimal feedback gain of the infinite-horizon Linear Quadratic Regulator (LQR) problem via an ordinary differential equation. We introduce a novel continuous-time Bellman error, derived from the Hamilton-Jacobi-Bellman (HJB) equation, which quantifies the suboptimality of stabilizing policies and is parametrized in terms of the feedback gain. We analyze its properties, including its effective domain, smoothness, and coerciveness, and show the existence of a unique stationary point within the stability region. Furthermore, we derive a closed-form gradient expression of the Bellman error that induces a gradient flow. This converges to the optimal feedback and generates a unique trajectory that exclusively comprises stabilizing feedback policies. Additionally, this work advances interesting connections between LQR theory and Reinforcement Learning (RL) by redefining suboptimality of the Algebraic Riccati Equation (ARE) as a Bellman error, adapting a state-independent formulation, and leveraging Lyapunov equations to overcome the infinite-horizon challenge. We validate our method in a simulation and compare it to the state of the art.


Originalveröffentlichung
DOI: 10.1109/CDC57313.2025.11312822
Zugehörige Institution(en) am KIT Institut für Regelungs- und Steuerungssysteme (IRS)
Publikationstyp Proceedingsbeitrag
Publikationsdatum 09.12.2025
Sprache Englisch
Identifikator ISBN: 979-8-3315-2627-6
ISSN: 0743-1546
KITopen-ID: 1000191553
Erschienen in 2025 IEEE 64th Conference on Decision and Control (CDC)
Veranstaltung 64th IEEE Conference on Decision and Control (CDC 2025), Rio de Janeiro, Brasilien, 09.12.2025 – 12.12.2025
Verlag Institute of Electrical and Electronics Engineers (IEEE)
Seiten 3761 - 3768
Nachgewiesen in OpenAlex
Scopus
KIT – Die Universität in der Helmholtz-Gemeinschaft
KITopen Landing Page