KIT | KIT-Bibliothek | Impressum | Datenschutz

Information-Theoretic Trust Regions for Stochastic Gradient-Based Optimization

Dahlinger, Philipp; Becker, Philipp 1; Maximilian, H.; Neumann, Gerhard 1
1 Institut für Anthropomatik und Robotik (IAR), Karlsruher Institut für Technologie (KIT)

Abstract (englisch):

Stochastic gradient-based optimization is crucial to optimize neural networks. While popular approaches heuristically adapt the step size and direction by rescaling gradients, a more principled
approach to improve optimizers requires second-order information. Such methods precondition
the gradient using the objective’s Hessian. Yet, computing the Hessian is usually expensive and
effectively using second-order information in the stochastic gradient setting is non-trivial. We propose using Information-Theoretic Trust Region Optimization (arTuRO) for improved updates with
uncertain second-order information. By modeling the network parameters as a Gaussian distribution and using a Kullback-Leibler divergence-based trust region, our approach takes bounded steps
accounting for the objective’s curvature and uncertainty in the parameters. Before each update, it
solves the trust region problem for an optimal step size, resulting in a more stable and faster optimization process. We approximate the diagonal elements of the Hessian from stochastic gradients
using a simple recursive least squares approach, constructing a model of the expected Hessian over
... mehr


Preprint §
DOI: 10.5445/IR/1000168996
Veröffentlicht am 01.03.2024
Cover der Publikation
Zugehörige Institution(en) am KIT Fakultät für Informatik (INFORMATIK)
Publikationstyp Proceedingsbeitrag
Publikationsjahr 2023
Sprache Englisch
Identifikator KITopen-ID: 1000168996
Erschienen in OPT2023: 15th Annual Workshop on Optimization for Machine Learning
Veranstaltung 37th Conference on Neural Information Processing Systems (NeurIPS 2023), New Orleans, LA, USA, 10.12.2023 – 16.12.2023
Bemerkung zur Veröffentlichung in press
Externe Relationen Abstract/Volltext
KIT – Die Forschungsuniversität in der Helmholtz-Gemeinschaft
KITopen Landing Page