KIT | KIT-Bibliothek | Impressum | Datenschutz

Microsecond-latency feedback at a particle accelerator by online reinforcement learning on hardware

Scomparin, Luca ORCID iD icon 1; Caselle, Michele 1; Santamaria Garcia, Andrea ORCID iD icon 2; Xu, Chenran ORCID iD icon 3; Blomley, Edmund ORCID iD icon 3; Dritschler, Timo ORCID iD icon 1; Mochihashi, Akira 3; Schuh, Marcel ORCID iD icon 3; Steinmann, Johannes L. ORCID iD icon 3; Bründermann, Erik ORCID iD icon 3; Kopmann, Andreas ORCID iD icon 1; Becker, Jürgen; Müller, Anke-Susanne ORCID iD icon 3; Weber, Marc
1 Institut für Prozessdatenverarbeitung und Elektronik (IPE), Karlsruher Institut für Technologie (KIT)
2 Laboratorium für Applikationen der Synchrotronstrahlung (LAS), Karlsruher Institut für Technologie (KIT)
3 Institut für Beschleunigerphysik und Technologie (IBPT), Karlsruher Institut für Technologie (KIT)

Abstract:

The commissioning and operation of future large-scale scientific experiments will challenge current tuning and control methods. Reinforcement learning (RL) algorithms are a promising solution due to their ability to dynamically adapt to changing environments and consider delayed consequences. In many real-world applications, RL policies must produce actions in real time, often within microseconds to milliseconds, imposing significant constraints on system latency and computational overhead that conventional machine learning libraries are not designed to handle. To control phenomena in real time at these timescales, RL needs to be deployed on-the-edge, namely on dedicated hardware located near the system it controls, without relying on a host CPU or cloud-based inference. In this work we present the design and deployment of an experience accumulator system in a particle accelerator. In this system, deep-RL algorithms run using hardware acceleration and act within a few microseconds, enabling the use of RL for control of phenomena like beam instabilities. The training uses the collected data offline to reduce the number of operations carried out on the acceleration hardware. ... mehr


Verlagsausgabe §
DOI: 10.5445/IR/1000192342
Veröffentlicht am 17.04.2026
Originalveröffentlichung
DOI: 10.1088/2632-2153/ae5b20
Cover der Publikation
Zugehörige Institution(en) am KIT Institut für Beschleunigerphysik und Technologie (IBPT)
Institut für Produktentwicklung (IPEK)
Institut für Prozessdatenverarbeitung und Elektronik (IPE)
Institut für Technik der Informationsverarbeitung (ITIV)
Laboratorium für Applikationen der Synchrotronstrahlung (LAS)
Publikationstyp Zeitschriftenaufsatz
Publikationsdatum 01.04.2026
Sprache Englisch
Identifikator ISSN: 2632-2153
KITopen-ID: 1000192342
HGF-Programm 54.11.11 (POF IV, LK 01) Accelerator Operation, Research and Development
Weitere HGF-Programme 54.12.03 (POF IV, LK 01) Science Systems
Erschienen in Machine Learning: Science and Technology
Verlag Institute of Physics Publishing Ltd (IOP Publishing Ltd)
Band 7
Heft 2
Seiten Art.Nr: 025056
Nachgewiesen in OpenAlex
Globale Ziele für nachhaltige Entwicklung Ziel 4 – Hochwertige BildungZiel 7 – Bezahlbare und saubere EnergieZiel 9 – Industrie, Innovation und Infrastruktur
KIT – Die Universität in der Helmholtz-Gemeinschaft
KITopen Landing Page