Picture for Ruoyu Sun

Ruoyu Sun

Xi'an Jiaotong-Liverpool University, School of Mathematics and Physics, Department of Financial and Actuarial Mathematics

Adam-mini: Use Fewer Learning Rates To Gain More

Add code
Jun 26, 2024
Viaarxiv icon

Bridging the Gap: Rademacher Complexity in Robust and Standard Generalization

Add code
Jun 08, 2024
Viaarxiv icon

PDHG-Unrolled Learning-to-Optimize Method for Large-Scale Linear Programming

Add code
Jun 04, 2024
Viaarxiv icon

On the Convergence of Adam under Non-uniform Smoothness: Separability from SGDM and Beyond

Add code
Mar 22, 2024
Viaarxiv icon

Why Transformers Need Adam: A Hessian Perspective

Add code
Feb 26, 2024
Figure 1 for Why Transformers Need Adam: A Hessian Perspective
Figure 2 for Why Transformers Need Adam: A Hessian Perspective
Figure 3 for Why Transformers Need Adam: A Hessian Perspective
Figure 4 for Why Transformers Need Adam: A Hessian Perspective
Viaarxiv icon

Combining Transformer based Deep Reinforcement Learning with Black-Litterman Model for Portfolio Optimization

Add code
Feb 23, 2024
Figure 1 for Combining Transformer based Deep Reinforcement Learning with Black-Litterman Model for Portfolio Optimization
Figure 2 for Combining Transformer based Deep Reinforcement Learning with Black-Litterman Model for Portfolio Optimization
Figure 3 for Combining Transformer based Deep Reinforcement Learning with Black-Litterman Model for Portfolio Optimization
Figure 4 for Combining Transformer based Deep Reinforcement Learning with Black-Litterman Model for Portfolio Optimization
Viaarxiv icon

ReMax: A Simple, Effective, and Efficient Reinforcement Learning Method for Aligning Large Language Models

Add code
Oct 17, 2023
Viaarxiv icon

LEMON: Lossless model expansion

Add code
Oct 12, 2023
Figure 1 for LEMON: Lossless model expansion
Figure 2 for LEMON: Lossless model expansion
Figure 3 for LEMON: Lossless model expansion
Figure 4 for LEMON: Lossless model expansion
Viaarxiv icon

PAC-Bayesian Spectrally-Normalized Bounds for Adversarially Robust Generalization

Add code
Oct 09, 2023
Figure 1 for PAC-Bayesian Spectrally-Normalized Bounds for Adversarially Robust Generalization
Figure 2 for PAC-Bayesian Spectrally-Normalized Bounds for Adversarially Robust Generalization
Figure 3 for PAC-Bayesian Spectrally-Normalized Bounds for Adversarially Robust Generalization
Viaarxiv icon

How Graph Neural Networks Learn: Lessons from Training Dynamics in Function Space

Add code
Oct 08, 2023
Figure 1 for How Graph Neural Networks Learn: Lessons from Training Dynamics in Function Space
Figure 2 for How Graph Neural Networks Learn: Lessons from Training Dynamics in Function Space
Figure 3 for How Graph Neural Networks Learn: Lessons from Training Dynamics in Function Space
Figure 4 for How Graph Neural Networks Learn: Lessons from Training Dynamics in Function Space
Viaarxiv icon