KIT | KIT-Bibliothek | Impressum | Datenschutz

Pilot: Power-Aware Hybrid Fault Tolerance in Multi-Core Embedded Systems

Ansari, Amir Hossein; Esnaashari, Moein; Safari, Sepideh; Ansari, Mohsen ; Ejlali, Alireza; Henkel, Jörg 1
1 Institut für Technische Informatik (ITEC), Karlsruher Institut für Technologie (KIT)

Abstract:

With the advancement of technology size and the integration of multiple cores on a single chip, the probability of fault occurrence has increased. These faults can be transient or permanent, requiring techniques to manage both types. Hybrid fault tolerance techniques have emerged as effective solutions to handle both types. In this paper, we propose a power-aware hybrid fault tolerance (called Pilot). Our approach utilizes checkpointing with rollback-recovery and primary/backup techniques, tolerating two kinds of faults. Moreover, in real-time embedded systems, power consumption is a critical constraint that must be managed. To do this, we exploit the Thermal Safe Power (TSP) constraint for each processing core. Based on this constraint and the utilization of each core, tasks are mapped and scheduled, while guaranteeing the timing constraints. Our experimental results demonstrate that our proposed methods can meet the reliability target by tolerating the optimal number of fault occurrences in each task while reducing power consumption. Our proposed methods are compared to state-of-the-art techniques in terms of schedulability, power consumption, Quality of Service (QoS), energy consumption, and reliability. ... mehr


Zugehörige Institution(en) am KIT Institut für Technische Informatik (ITEC)
Publikationstyp Zeitschriftenaufsatz
Publikationsmonat/-jahr 03.2026
Sprache Englisch
Identifikator ISSN: 1045-9219, 2161-9883, 1086-3702, 1558-2183
KITopen-ID: 1000190268
Erschienen in IEEE Transactions on Parallel and Distributed Systems
Verlag Institute of Electrical and Electronics Engineers (IEEE)
Band 37
Heft 3
Seiten 726–743
Vorab online veröffentlicht am 20.01.2026
Schlagwörter Power consumption, hybrid fault-tolerant technique, thermal safe power, scheduling, multi-core embedded systems
Nachgewiesen in Scopus
OpenAlex
Dimensions
KIT – Die Universität in der Helmholtz-Gemeinschaft
KITopen Landing Page