KIT | KIT-Bibliothek | Impressum | Datenschutz

Deep Decentralized Reinforcement Learning for Cooperative Control

Köpf, Florian; Tesfazgi, Samuel; Flad, Michael; Hohmann, Sören

Abstract:

In order to collaborate effciently with unknown partners in cooperative control settings, adaptation of the partners based on online experience is required. The rather general and widely applicable control setting, where each cooperation partner might strive for individual goals while the control laws and objectives of the partners are unknown, entails various challenges such as the non-stationarity of the environment, the multi-agent credit assignment problem, the alter-exploration problem and the coordination problem. We propose new, modular deep decentralized Multi-Agent Reinforcement Learning mechanisms to account for these challenges. Therefore, our method uses a time-dependent prioritization of samples, incorporates a model of the system dynamics and utilizes variable, accountability-driven learning rates and simulated, artificial experiences in order to guide the learning process. The effectiveness of our method is demonstrated by means of a simulated, nonlinear cooperative control task.


Download
Originalveröffentlichung
DOI: 10.1016/j.ifacol.2020.12.2181
Dimensions
Zitationen: 2
Zugehörige Institution(en) am KIT Institut für Regelungs- und Steuerungssysteme (IRS)
Publikationstyp Proceedingsbeitrag
Publikationsjahr 2020
Sprache Englisch
Identifikator ISSN: 2405-8963
KITopen-ID: 1000117640
Erschienen in 21th IFAC World Congress. Ed.: R. Findeisen
Veranstaltung 21st World Congress of the International Federation of Automatic Control (IFAC-V 2020), Online, 11.07.2020 – 17.07.2020
Verlag Elsevier
Seiten 1555-1562
Serie IFAC-PapersOnLine ; 53/2
Schlagwörter Reinforcement Learning, Deep Learning, Learning Control, Shared Control, Decentralized Control, Machine Learning, Non-stationary Systems, Nonlinear Control
Nachgewiesen in Dimensions
KIT – Die Forschungsuniversität in der Helmholtz-Gemeinschaft
KITopen Landing Page