Adaptive Optimal Control for Reference Tracking Independent of Exo-System Dynamics

Köpf, Florian; Westermann, Johannes; Flad, Michael; Hohmann, Sören

doi:10.1016/j.neucom.2020.04.140

Adaptive Optimal Control for Reference Tracking Independent of Exo-System Dynamics

Köpf, Florian ¹; Westermann, Johannes ²^,3; Flad, Michael ¹; Hohmann, Sören ¹
¹ Institut für Regelungs- und Steuerungssysteme (IRS), Karlsruher Institut für Technologie (KIT)
² Graduiertenkolleg 1126: Intelligente Chirurgie (Graduiertenkolleg 1126), Karlsruher Institut für Technologie (KIT)
³ Institut für Anthropomatik und Robotik (IAR), Karlsruher Institut für Technologie (KIT)

Abstract:

Model-free control based on the idea of Reinforcement Learning is a promising approach that has recently gained extensive attention. However, Reinforcement-Learning-based control methods solely focus on the regulation problem or learn to track a reference that is generated by a time-invariant exo-system. In the latter case, controllers are only able to track the time-invariant reference dynamics which they have been trained on and need to be re-trained each time the reference dynamics change. Consequently, these methods fail in a number of applications which obviously rely on a trajectory not being generated by an exo-system. One prominent example is autonomous driving. This paper provides for the first time an adaptive optimal control method capable to track arbitrary reference trajectories that are provided on a moving horizon. The main innovation is a novel Q-function that directly incorporates the given reference trajectory. This new Q-function exhibits a particular structure which allows the design of an efficient, iterative, provably convergent Reinforcement Learning algorithm that enables optimal tracking. Two real-world examples demonstrate the effectiveness of our new method.

Zugehörige Institution(en) am KIT	Institut für Anthropomatik und Robotik (IAR) Institut für Regelungs- und Steuerungssysteme (IRS)
Publikationstyp	Zeitschriftenaufsatz
Publikationsdatum	10.09.2020
Sprache	Englisch
Identifikator	ISSN: 0925-2312, 1872-8286 KITopen-ID: 1000118892
Erschienen in	Neurocomputing
Verlag	Elsevier
Band	405
Seiten	173-185
Schlagwörter	Adaptive Dynamic Programming, Optimal Tracking, Reinforcement Learning, Optimal Control
Nachgewiesen in	Dimensions Web of Science OpenAlex Scopus
Globale Ziele für nachhaltige Entwicklung

Externe Links

Download

Originalveröffentlichung
DOI: 10.1016/j.neucom.2020.04.140

Scopus
Zitationen: 8

Web of Science
Zitationen: 8

Dimensions
Zitationen: 10

Export

Statistiken

Seitenaufrufe: 222
seit 05.05.2020

Repository KITopen

Adaptive Optimal Control for Reference Tracking Independent of Exo-System Dynamics

Abstract: