May 18 – 22, 2026
Virginia Tech
America/New_York timezone

Understanding and Leveraging Adaptive Algorithms' Sensitivity to Change-of-Basis

May 18, 2026, 11:00 AM
25m
McBryde Hall 113 (Virginia Tech)

McBryde Hall 113

Virginia Tech

Minisymposium Talk Numerical Linear Algebra in Machine Learning Numerical Linear Algebra in Machine Learning

Speaker

Adela DePavia (The University of Chicago)

Description

Adaptive gradient optimization algorithms—including Adam, Adagrad, and their variants—have found widespread use in machine learning, signal processing, and many other settings. However many algorithms in this family are not rotationally equivariant: in this talk we examine how a simple change-of-basis in either parameter space or data space can drastically impact both the convergence rates and the generalization of these algorithms. We begin by studying reparameterizations in parameter space, and describe a data-driven method proposed in our recent work which produces a “favorable” basis for adaptive algorithms. Our method is an orthonormal transformation based on the expected gradient outer product (EGOP) matrix. We present theoretical results and empirical evidence that reparameterizations based on the EGOP eigenbasis can improve convergence of adaptive gradient methods, even when these leading eigenspaces are approximated using randomized numerical linear algebra methods. We show that for a broad class of functions, the sensitivity of adaptive algorithms to choice-of-basis is influenced by the decay of the EGOP matrix spectrum. We illustrate the potential impact of EGOP reparameterization by presenting empirical evidence and theoretical arguments that common machine learning tasks with ``natural'' data exhibit EGOP spectral decay.

Authors

Adela DePavia (The University of Chicago) Prof. Rebecca Willett (The University of Chicago) Prof. Vasileios Charisopoulos (The University of Washington)

Presentation materials

There are no materials yet.