KIT | KIT-Bibliothek | Impressum | Datenschutz

Efficient Zeroth-Order Federated Finetuning of Language Models on Resource-Constrained Devices

Ahmed, Mohamed Aboelenien; Pfeiffer, Kilian ORCID iD icon 1; Khalili, Ramin; Khdr, Heba ORCID iD icon 1; Henkel, Jörg 1
1 Institut für Technische Informatik (ITEC), Karlsruher Institut für Technologie (KIT)

Abstract:

Federated Learning (FL) is a promising paradigm for finetuning Large Language Models (LLMs) across distributed data sources while preserving data privacy. However, finetuning such large models is challenging on edge devices due to its high resource demand. Zerothorder Optimization (ZO) estimates gradients through finite-difference approximations, which rely on function evaluations under random perturbations of the model parameters. Consequently, ZO with task alignment provides a potential solution, allowing finetuning using only forward passes with inference-level memory requirements and low communication overhead, but it suffers from slow convergence and higher computational demand. In this paper, we propose a new ZO-based method that applies a more efficient technique to reduce the computational demand associated with using a large number of perturbations while preserving their convergence benefits. This is achieved by splitting the model into consecutive blocks and allocating a higher number of perturbations to the second block, enabling efficient reuse of intermediate activations to update the full network with fewer forward evaluations. ... mehr


Zugehörige Institution(en) am KIT Institut für Technische Informatik (ITEC)
Publikationstyp Zeitschriftenaufsatz
Publikationsjahr 2026
Sprache Englisch
Identifikator ISSN: 2835-8856
KITopen-ID: 1000194672
Erschienen in Transactions on Machine Learning Research
Verlag OpenReview.net
Band 2026-June
Seiten 1
Externe Relationen Siehe auch
Nachgewiesen in Scopus
KIT – Die Universität in der Helmholtz-Gemeinschaft
KITopen Landing Page