2024-05-10 08:46:36
This https://arxiv.org/abs/2211.07861 has been replaced.
link: https://scholar.google.com/scholar?q=a
This https://arxiv.org/abs/2211.07861 has been replaced.
link: https://scholar.google.com/scholar?q=a
A global Barzilai and Borwein's gradient normalization descent method for multiobjective optimization
Yingxue Yang
https://arxiv.org/abs/2403.05070 htt…
Symmetry-guided gradient descent for quantum neural networks
Kaiming Bian, Shitao Zhang, Fei Meng, Wen Zhang, Oscar Dahlsten
https://arxiv.org/abs/2404.06108
Learning of deep convolutional network image classifiers via stochastic gradient descent and over-parametrization
Michael Kohler, Adam Krzyzak, Alisha S\"anger
https://arxiv.org/abs/2404.07128
Symmetry-guided gradient descent for quantum neural networks
Kaiming Bian, Shitao Zhang, Fei Meng, Wen Zhang, Oscar Dahlsten
https://arxiv.org/abs/2404.06108
This https://arxiv.org/abs/2402.02731 has been replaced.
link: https://scholar.google.com/scholar?q=a
Gradient Descent is Pareto-Optimal in the Oracle Complexity and Memory Tradeoff for Feasibility Problems
Moise Blanchard
https://arxiv.org/abs/2404.06720 h…
This https://arxiv.org/abs/2209.05564 has been replaced.
link: https://scholar.google.com/scholar?q=a
Generalized Gradient Descent is a Hypergraph Functor
Tyler Hanks, Matthew Klawonn, James Fairbanks
https://arxiv.org/abs/2403.19845 https://
Heavy-Tailed Class Imbalance and Why Adam Outperforms Gradient Descent on Language Models
Frederik Kunstner, Robin Yadav, Alan Milligan, Mark Schmidt, Alberto Bietti
https://arxiv.org/abs/2402.19449
This https://arxiv.org/abs/2309.05955 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csRO_…
This https://arxiv.org/abs/1901.09057 has been replaced.
link: https://scholar.google.com/scholar?q=a
This https://arxiv.org/abs/2308.02958 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_ees…
Finite Sample Analysis and Bounds of Generalization Error of Gradient Descent in In-Context Linear Regression
Karthik Duraisamy
https://arxiv.org/abs/2405.02462
Generalized Gradient Descent is a Hypergraph Functor
Tyler Hanks, Matthew Klawonn, James Fairbanks
https://arxiv.org/abs/2403.19845 https://
On the Origins of Linear Representations in Large Language Models
Yibo Jiang, Goutham Rajendran, Pradeep Ravikumar, Bryon Aragam, Victor Veitch
https://arxiv.org/abs/2403.03867
This https://arxiv.org/abs/2309.05955 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csRO_…
Projected gradient descent algorithm for $\textit{ab initio}$ crystal structure relaxation under a fixed unit cell volume
Yukuan Hu, Junlei Yin, Xingyu Gao, Xin Liu, Haifeng Song
https://arxiv.org/abs/2405.02934
This https://arxiv.org/abs/1901.09057 has been replaced.
link: https://scholar.google.com/scholar?q=a
This https://arxiv.org/abs/2405.03073 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_mat…
This https://arxiv.org/abs/2401.02565 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_ees…
SOFIM: Stochastic Optimization Using Regularized Fisher Information Matrix
Gayathri C, Mrinmay Sen, A. K. Qin, Raghu Kishore N, Yen-Wei Chen, Balasubramanian Raman
https://arxiv.org/abs/2403.02833
This https://arxiv.org/abs/2107.12416 has been replaced.
link: https://scholar.google.com/scholar?q=a
This https://arxiv.org/abs/2404.19027 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_qu…
This https://arxiv.org/abs/2210.15531 has been replaced.
link: https://scholar.google.com/scholar?q=a
JaxDecompiler: Redefining Gradient-Informed Software Design
Pierrick Pochelu
https://arxiv.org/abs/2403.10571 https://arxiv.org/pdf/2…
This https://arxiv.org/abs/2310.14085 has been replaced.
link: https://scholar.google.com/scholar?q=a
Projected Gradient Descent for Spectral Compressed Sensing via Symmetric Hankel Factorization
Jinsheng Li, Wei Cui, Xu Zhang
https://arxiv.org/abs/2403.09031
This https://arxiv.org/abs/2305.19394 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_qbi…
Wirtinger gradient descent methods for low-dose Poisson phase retrieval
Benedikt Diederichs, Frank Filbir, Patricia R\"omer
https://arxiv.org/abs/2403.18527
The Blind Normalized Stein Variational Gradient Descent-Based Detection for Intelligent Massive Random Access
Xin Zhu, Ahmet Enis Cetin
https://arxiv.org/abs/2403.18846
This https://arxiv.org/abs/1905.12948 has been replaced.
link: https://scholar.google.com/scholar?q=a
This https://arxiv.org/abs/2401.11176 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_ees…
This https://arxiv.org/abs/2304.00707 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_mat…
Optimisation challenge for superconducting adiabatic neural network implementing XOR and OR boolean functions
D. S. Pashin, M. V. Bastrakova, D. A. Rybin, I. I. Soloviev, A. E. Schegolev, N. V. Klenov
https://arxiv.org/abs/2405.03521
Fast Quantum Process Tomography via Riemannian Gradient Descent
Daniel Volya, Andrey Nikitin, Prabhat Mishra
https://arxiv.org/abs/2404.18840 https://
This https://arxiv.org/abs/2211.08212 has been replaced.
link: https://scholar.google.com/scholar?q=a
How Transformers Learn Causal Structure with Gradient Descent
Eshaan Nichani, Alex Damian, Jason D. Lee
https://arxiv.org/abs/2402.14735 https://
This https://arxiv.org/abs/2211.17157 has been replaced.
link: https://scholar.google.com/scholar?q=a
Noise misleads rotation invariant algorithms on sparse targets
Manfred K. WarmuthGoogle Inc, Wojciech Kot{\l}owskiInstitute of Computing Science, Poznan University of Technology, Poznan, Poland, Matt JonesUniversity of Colorado Boulder, Colorado, USA, Ehsan AmidGoogle Inc
https://arxiv.org/abs/2403.02697
On a Family of Relaxed Gradient Descent Methods for Quadratic Minimization
Liam MacDonald, Rua Murray, Rachael Tappenden
https://arxiv.org/abs/2404.19255 h…
Efficient simulations of Hartree--Fock equations by an accelerated gradient descent method
Y. Ohno, A. Del Maestro, T. I. Lakoba
https://arxiv.org/abs/2402.17843
This https://arxiv.org/abs/2403.14045 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_mat…
Noise misleads rotation invariant algorithms on sparse targets
Manfred K. WarmuthGoogle Inc, Wojciech Kot{\l}owskiInstitute of Computing Science, Poznan University of Technology, Poznan, Poland, Matt JonesUniversity of Colorado Boulder, Colorado, USA, Ehsan AmidGoogle Inc
https://arxiv.org/abs/2403.02697
Subdifferentially polynomially bounded functions and Gaussian smoothing-based zeroth-order optimization
Ming Lei, Ting Kei Pong, Shuqin Sun, Man-Chung Yue
https://arxiv.org/abs/2405.04150
Failures and Successes of Cross-Validation for Early-Stopped Gradient Descent
Pratik Patil, Yuchen Wu, Ryan J. Tibshirani
https://arxiv.org/abs/2402.16793 …
This https://arxiv.org/abs/2211.10777 has been replaced.
link: https://scholar.google.com/scholar?q=a
Attacking Large Language Models with Projected Gradient Descent
Simon Geisler, Tom Wollschl\"ager, M. H. I. Abdalla, Johannes Gasteiger, Stephan G\"unnemann
https://arxiv.org/abs/2402.09154
Shuffling Momentum Gradient Algorithm for Convex Optimization
Trang H. Tran, Quoc Tran-Dinh, Lam M. Nguyen
https://arxiv.org/abs/2403.03180 https://…
MegaParticles: Range-based 6-DoF Monte Carlo Localization with GPU-Accelerated Stein Particle Filter
Kenji Koide, Shuji Oishi, Masashi Yokozuka, Atsuhiko Banno
https://arxiv.org/abs/2404.16370
This https://arxiv.org/abs/2303.10599 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_sta…
This https://arxiv.org/abs/2403.14045 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_mat…
Organizing Physics with Open Energy-Driven Systems
Matteo Capucci, Owen Lynch, David I. Spivak
https://arxiv.org/abs/2404.16140 https://
This https://arxiv.org/abs/2403.03473 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csLG_…
Faster Convergence of Stochastic Accelerated Gradient Descent under Interpolation
Aaron Mishkin, Mert Pilanci, Mark Schmidt
https://arxiv.org/abs/2404.02378
MegaParticles: Range-based 6-DoF Monte Carlo Localization with GPU-Accelerated Stein Particle Filter
Kenji Koide, Shuji Oishi, Masashi Yokozuka, Atsuhiko Banno
https://arxiv.org/abs/2404.16370
This https://arxiv.org/abs/2311.00944 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_sta…
This https://arxiv.org/abs/2310.12140 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_mat…
Projected Block Coordinate Descent for sparse spike estimation
Pierre-Jean B\'enardIMB, Yann TraonmilinIMB, Jean Fran\c{c}ois AujolIMB
https://arxiv.org/abs/2402.12021
This https://arxiv.org/abs/2311.00944 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_sta…
This https://arxiv.org/abs/2311.00521 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_mat…
This https://arxiv.org/abs/2311.00521 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_mat…
Analysing heavy-tail properties of Stochastic Gradient Descent by means of Stochastic Recurrence Equations
Ewa Damek, Sebastian Mentemeier
https://arxiv.org/abs/2403.13868
This https://arxiv.org/abs/2306.11246 has been replaced.
link: https://scholar.google.com/scholar?q=a
This https://arxiv.org/abs/2306.05896 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_mat…
This https://arxiv.org/abs/2006.07769 has been replaced.
link: https://scholar.google.com/scholar?q=a
This https://arxiv.org/abs/2402.11858 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_sta…
This https://arxiv.org/abs/2311.18426 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_mat…
This https://arxiv.org/abs/2402.14423 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_qu…
On the Convergence Rate of the Stochastic Gradient Descent (SGD) and application to a modified policy gradient for the Multi Armed Bandit
Stefana Anita, Gabriel Turinici
https://arxiv.org/abs/2402.06388
This https://arxiv.org/abs/2305.17224 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_mat…
Faster Convergence for Transformer Fine-tuning with Line Search Methods
Philip Kenneweg, Leonardo Galli, Tristan Kenneweg, Barbara Hammer
https://arxiv.org/abs/2403.18506
This https://arxiv.org/abs/2306.10529 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_mat…
Convergence and Complexity Guarantee for Inexact First-order Riemannian Optimization Algorithms
Yuchen Li, Laura Balzano, Deanna Needell, Hanbaek Lyu
https://arxiv.org/abs/2405.03073
Improving Line Search Methods for Large Scale Neural Network Training
Philip Kenneweg, Tristan Kenneweg, Barbara Hammer
https://arxiv.org/abs/2403.18519 ht…
This https://arxiv.org/abs/2302.05797 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_sta…
This https://arxiv.org/abs/2306.10529 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_mat…
This https://arxiv.org/abs/2305.18502 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_sta…
This https://arxiv.org/abs/2403.16829 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csLG_…
A Penalty-Based Guardrail Algorithm for Non-Decreasing Optimization with Inequality Constraints
Ksenija Stepanovic, Wendelin B\"ohmer, Mathijs de Weerdt
https://arxiv.org/abs/2405.01984
This https://arxiv.org/abs/2402.11858 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_sta…
Generative Adversarial Network with Soft-Dynamic Time Warping and Parallel Reconstruction for Energy Time Series Anomaly Detection
Hardik Prabhu, Jayaraman Valadi, Pandarasamy Arjunan
https://arxiv.org/abs/2402.14384
This https://arxiv.org/abs/2402.12493 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_mat…
This https://arxiv.org/abs/2402.11858 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_sta…
This https://arxiv.org/abs/2310.07922 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_mat…
This https://arxiv.org/abs/2403.14045 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_mat…
Online Policy Learning and Inference by Matrix Completion
Congyuan Duan, Jingyang Li, Dong Xia
https://arxiv.org/abs/2404.17398 https://
This https://arxiv.org/abs/2403.14045 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_mat…
Convergence and Trade-Offs in Riemannian Gradient Descent and Riemannian Proximal Point
David Mart\'inez-Rubio, Christophe Roux, Sebastian Pokutta
https://arxiv.org/abs/2403.10429
This https://arxiv.org/abs/2402.04691 has been replaced.
link: https://scholar.google.com/scholar?q=a
Gauss-Newton Natural Gradient Descent for Physics-Informed Computational Fluid Dynamics
Anas Jnini, Flavio Vella, Marius Zeinhofer
https://arxiv.org/abs/2402.10680
This https://arxiv.org/abs/2401.00691 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_sta…
Dissipative Gradient Descent Ascent Method: A Control Theory Inspired Algorithm for Min-max Optimization
Tianqi Zheng, Nicolas Loizou, Pengcheng You, Enrique Mallada
https://arxiv.org/abs/2403.09090
This https://arxiv.org/abs/2305.12568 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_mat…
This https://arxiv.org/abs/2403.05070 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_mat…
This https://arxiv.org/abs/2307.12441 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_mat…
This https://arxiv.org/abs/2308.10547 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_mat…
This https://arxiv.org/abs/2403.02967 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_mat…
This https://arxiv.org/abs/2402.04689 has been replaced.
link: https://scholar.google.com/scholar?q=a
Riemannian Optimization and the Hartree-Fock Method
Caio O. da Silva
https://arxiv.org/abs/2403.15024 https://arxiv.org/pdf/2403.1502…