Natural Gradient Descent


Spike-timing-dependent Hebbian learning as noisy gradient descent

Add code
May 15, 2025
Viaarxiv icon

Negative Stepsizes Make Gradient-Descent-Ascent Converge

Add code
May 02, 2025
Viaarxiv icon

Online Functional Principal Component Analysis on a Multidimensional Domain

Add code
May 04, 2025
Viaarxiv icon

Convergence Properties of Natural Gradient Descent for Minimizing KL Divergence

Add code
Apr 27, 2025
Viaarxiv icon

NANO-SLAM : Natural Gradient Gaussian Approximation for Vehicle SLAM

Add code
Apr 27, 2025
Viaarxiv icon

How Transformers Learn Regular Language Recognition: A Theoretical Study on Training Dynamics and Implicit Bias

Add code
May 02, 2025
Viaarxiv icon

A mean teacher algorithm for unlearning of language models

Add code
Apr 18, 2025
Viaarxiv icon

Exact Learning Dynamics of In-Context Learning in Linear Transformers and Its Application to Non-Linear Transformers

Add code
Apr 17, 2025
Viaarxiv icon

Provable Failure of Language Models in Learning Majority Boolean Logic via Gradient Descent

Add code
Apr 07, 2025
Viaarxiv icon

Deep Sturm--Liouville: From Sample-Based to 1D Regularization with Learnable Orthogonal Basis Functions

Add code
Apr 09, 2025
Viaarxiv icon