Microsecond-Latency Feedback at a Particle Accelerator by Online Reinforcement Learning on Hardware

Scomparin, Luca ORCID iD icon 1; Caselle, Michele; Garcia, Andrea Santamaria ORCID iD icon 2; Xu, Chenran ORCID iD icon 3; Blomley, Edmund ORCID iD icon 3; Dritschler, Timo ORCID iD icon; Mochihashi, Akira 3; Schuh, Marcel ORCID iD icon 3; Steinmann, Johannes L. ORCID iD icon 3; Bründermann, Erik ORCID iD icon 3; Kopmann, Andreas ORCID iD icon 1; Becker, Juergen; Müller, Anke-Susanne ORCID iD icon 3; Weber, Marc
1 Institut für Prozessdatenverarbeitung und Elektronik (IPE), Karlsruher Institut für Technologie (KIT)
2 Laboratorium für Applikationen der Synchrotronstrahlung (LAS), Karlsruher Institut für Technologie (KIT)
3 Institut für Beschleunigerphysik und Technologie (IBPT), Karlsruher Institut für Technologie (KIT)


The commissioning and operation of future large-scale scientific experiments will challenge current tuning and control methods. Reinforcement learning (RL) algorithms are a promising solution thanks to their capability of autonomously tackling a control problem based on a task parameterized by a reward function. The conventionally utilized machine learning (ML) libraries are not intended for microsecond latency applications, as they mostly optimize for throughput performance. On the other hand, most of the programmable logic implementations are meant for computation acceleration, not being intended to work in a real-time environment. To overcome these limitations of current implementations, RL needs to be deployed on-the-edge, i.e. on to the device gathering the training data. In this paper we present the design and deployment of an experience accumulator system in a particle accelerator. In this system deep-RL algorithms run using hardware acceleration and act within a few microseconds, enabling the use of RL for control of ultra-fast phenomena. The training is performed offline to reduce the number of operations carried out on the acceleration hardware. ... mehr

DOI: 10.5445/IR/1000174533
Veröffentlicht am 26.09.2024
Zugehörige Institution(en) am KIT Institut für Beschleunigerphysik und Technologie (IBPT)
Institut für Prozessdatenverarbeitung und Elektronik (IPE)
Laboratorium für Applikationen der Synchrotronstrahlung (LAS)
Publikationstyp Forschungsbericht/Preprint
Publikationsdatum 24.09.2024
Sprache Englisch
Identifikator KITopen-ID: 1000174533
HGF-Programm 54.11.11 (POF IV, LK 01) Accelerator Operation, Research and Development
Verlag arxiv
Umfang 12 S.
Externe Relationen Siehe auch
Schlagwörter Accelerator Physics (physics.acc-ph), High Energy Physics - Experiment (hep-ex), Instrumentation and Detectors (physics.ins-det)
Nachgewiesen in arXiv
