Adaptive Dynamic Programming for Model-free Tracking of Trajectories with Time-varying Parameters

Köpf, Florian; Ramsteiner, Simon; Puccetti, Luca; Flad, Michael; Hohmann, Sören

doi:10.1002/acs.3106

Adaptive Dynamic Programming for Model-free Tracking of Trajectories with Time-varying Parameters

Köpf, Florian ¹; Ramsteiner, Simon ¹; Puccetti, Luca

¹; Flad, Michael ¹; Hohmann, Sören ¹
¹ Institut für Regelungs- und Steuerungssysteme (IRS), Karlsruher Institut für Technologie (KIT)

Abstract:

Recently proposed Adaptive Dynamic Programming (ADP) tracking controllers assume that the reference trajectory follows time-invariant exo-system dynamics—an assumption that does not hold for many applications. In order to overcome this limitation, we propose a new Q-function which explicitly incorporates a parametrized approximation of the reference trajectory. This allows learning to track a general class of trajectories by means of ADP. Once our Q-function has been learned, the associated controller handles time-varying reference trajectories without the need for further training and independent of exo-system dynamics. ... mehr

Abstract (englisch):

Model-free control based on the idea of Reinforcement Learning is a promising approach that has recently gained extensive attention. However, Reinforcement-Learning-based control methods solely focus on the regulation problem or learn to track a reference that is generated by a time-invariant exo-system. In the latter case, controllers are only able to track the time-invariant reference dynamics which they have been trained on and need to be re-trained each time the reference dynamics change. Consequently, these methods fail in a number of applications which obviously rely on a trajectory not being generated by an exo-system. ... mehr

Zugehörige Institution(en) am KIT	Institut für Regelungs- und Steuerungssysteme (IRS)
Publikationstyp	Zeitschriftenaufsatz
Publikationsmonat/-jahr	07.2020
Sprache	Englisch
Identifikator	ISSN: 0890-6327, 1099-1115 KITopen-ID: 1000105799
Erschienen in	International journal of adaptive control and signal processing
Verlag	John Wiley and Sons
Band	34
Heft	7
Seiten	839-856
Vorab online veröffentlicht am	03.03.2020
Schlagwörter	Learning Systems, Adaptive Control, Intelligent Control, Adaptive Dynamic Programming, Optimal Tracking, Reinforcement Learning, Optimal Control, Adaptive Dynamic Programming
Nachgewiesen in	Scopus Web of Science OpenAlex Dimensions
Relationen in KITopen	Verweist auf Self-Learning Longitudinal Control for On-Road Vehicles. Puccetti, Luca; Hohmann, Sören; Adamy, Jürgen (2023) Hochschulschrift (1000156193)

KITopen-Download

Verlagsausgabe

DOI: 10.5445/IR/1000105799

Veröffentlicht am 24.03.2020

Externe Links

Originalveröffentlichung
DOI: 10.1002/acs.3106

Scopus
Zitationen: 16

Web of Science
Zitationen: 14

Dimensions
Zitationen: 17

Export

Statistiken

Seitenaufrufe: 359
seit 15.02.2020

Downloads: 214
seit 26.03.2020

Repository KITopen

Adaptive Dynamic Programming for Model-free Tracking of Trajectories with Time-varying Parameters

Abstract:

Abstract (englisch):