Nonlinearly Preconditioned Gradient Methods: Momentum and Stochastic AnalysisKonstantinos Oikonomidis, Jan Quan, Panagiotis Patrinoshttps://arxiv.org/abs/2510.11312 https://…
Nonlinearly Preconditioned Gradient Methods: Momentum and Stochastic AnalysisWe study nonlinearly preconditioned gradient methods for smooth nonconvex optimization problems, focusing on sigmoid preconditioners that inherently perform a form of gradient clipping akin to the widely used gradient clipping technique. Building upon this idea, we introduce a novel heavy ball-type algorithm and provide convergence guarantees under a generalized smoothness condition that is less restrictive than traditional Lipschitz smoothness, thus covering a broader class of functions. Addit…