KIT | KIT-Bibliothek | Impressum | Datenschutz

Automated Search for Deep Neural Network Inference Partitioning on Embedded FPGA

Kreß, Fabian ORCID iD icon 1; Hoefer, Julian ORCID iD icon 1; Hotfilter, Tim ORCID iD icon 1; Walter, Iris ORCID iD icon 1; El Annabi, El Mahdi 1; Harbaum, Tanja 1; Becker, Jürgen 1
1 Institut für Technik der Informationsverarbeitung (ITIV), Karlsruher Institut für Technologie (KIT)

Abstract (englisch):

Deep Neural Networks (DNNs) are currently making their way into a broad range of applications. While until recently they were mainly executed on high-performance computers, they are now also increasingly found in hardware platforms of edge applications. In order to meet the constantly changing demands, deployment of embedded Field Programmable Gate Arrays (FPGAs) is particularly suitable. Despite the tremendous advantage of high flexibility, embedded FPGAs are usually resource-constrained as they require more area than comparable Application-Specific Integrated Circuits (ASICs). Consequently, co-execution of a DNN on multiple platforms with dedicated partitioning is beneficial. Typical systems consist of FPGAs and Graphics Processing Units (GPUs). Combining the advantages of these platforms while keeping the communication overhead low is a promising way to meet the increasing requirements.
In this paper, we present an automated approach to efficiently partition DNN inference between an embedded FPGA and a GPU-based central compute platform. Our toolchain focuses on the limited hardware resources available on the embedded FPGA and the link bandwidth required to send intermediate results to the GPU. ... mehr


Zugehörige Institution(en) am KIT Institut für Technik der Informationsverarbeitung (ITIV)
Publikationstyp Proceedingsbeitrag
Publikationsjahr 2023
Sprache Englisch
Identifikator ISBN: 978-3-031-23618-1
KITopen-ID: 1000155303
Erschienen in Machine Learning and Principles and Practice of Knowledge Discovery in Databases. Hrsg.: I. Koprinska. Pt. 1
Veranstaltung International Workshops of European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML PKDD 2022), Grenoble, Frankreich, 19.09.2022 – 23.09.2022
Verlag Springer International Publishing
Seiten 557–568
Vorab online veröffentlicht am 31.01.2023
Schlagwörter Distributed sensor systems; Neural network inference partitioning; Design space exploration; Embedded FPGA
Nachgewiesen in Dimensions
Scopus
KIT – Die Forschungsuniversität in der Helmholtz-Gemeinschaft
KITopen Landing Page