Picture for Ruoyu Sun

Ruoyu Sun

Xi'an Jiaotong-Liverpool University, School of Mathematics and Physics, Department of Financial and Actuarial Mathematics

MoFO: Momentum-Filtered Optimizer for Mitigating Forgetting in LLM Fine-Tuning

Add code
Jul 31, 2024
Viaarxiv icon

Adam-mini: Use Fewer Learning Rates To Gain More

Add code
Jun 26, 2024
Viaarxiv icon

Bridging the Gap: Rademacher Complexity in Robust and Standard Generalization

Add code
Jun 08, 2024
Viaarxiv icon

PDHG-Unrolled Learning-to-Optimize Method for Large-Scale Linear Programming

Add code
Jun 04, 2024
Figure 1 for PDHG-Unrolled Learning-to-Optimize Method for Large-Scale Linear Programming
Figure 2 for PDHG-Unrolled Learning-to-Optimize Method for Large-Scale Linear Programming
Figure 3 for PDHG-Unrolled Learning-to-Optimize Method for Large-Scale Linear Programming
Figure 4 for PDHG-Unrolled Learning-to-Optimize Method for Large-Scale Linear Programming
Viaarxiv icon

On the Convergence of Adam under Non-uniform Smoothness: Separability from SGDM and Beyond

Add code
Mar 22, 2024
Viaarxiv icon

Why Transformers Need Adam: A Hessian Perspective

Add code
Feb 26, 2024
Figure 1 for Why Transformers Need Adam: A Hessian Perspective
Figure 2 for Why Transformers Need Adam: A Hessian Perspective
Figure 3 for Why Transformers Need Adam: A Hessian Perspective
Figure 4 for Why Transformers Need Adam: A Hessian Perspective
Viaarxiv icon

Combining Transformer based Deep Reinforcement Learning with Black-Litterman Model for Portfolio Optimization

Add code
Feb 23, 2024
Figure 1 for Combining Transformer based Deep Reinforcement Learning with Black-Litterman Model for Portfolio Optimization
Figure 2 for Combining Transformer based Deep Reinforcement Learning with Black-Litterman Model for Portfolio Optimization
Figure 3 for Combining Transformer based Deep Reinforcement Learning with Black-Litterman Model for Portfolio Optimization
Figure 4 for Combining Transformer based Deep Reinforcement Learning with Black-Litterman Model for Portfolio Optimization
Viaarxiv icon

ReMax: A Simple, Effective, and Efficient Reinforcement Learning Method for Aligning Large Language Models

Add code
Oct 17, 2023
Figure 1 for ReMax: A Simple, Effective, and Efficient Reinforcement Learning Method for Aligning Large Language Models
Figure 2 for ReMax: A Simple, Effective, and Efficient Reinforcement Learning Method for Aligning Large Language Models
Figure 3 for ReMax: A Simple, Effective, and Efficient Reinforcement Learning Method for Aligning Large Language Models
Figure 4 for ReMax: A Simple, Effective, and Efficient Reinforcement Learning Method for Aligning Large Language Models
Viaarxiv icon

LEMON: Lossless model expansion

Add code
Oct 12, 2023
Figure 1 for LEMON: Lossless model expansion
Figure 2 for LEMON: Lossless model expansion
Figure 3 for LEMON: Lossless model expansion
Figure 4 for LEMON: Lossless model expansion
Viaarxiv icon

PAC-Bayesian Spectrally-Normalized Bounds for Adversarially Robust Generalization

Add code
Oct 09, 2023
Figure 1 for PAC-Bayesian Spectrally-Normalized Bounds for Adversarially Robust Generalization
Figure 2 for PAC-Bayesian Spectrally-Normalized Bounds for Adversarially Robust Generalization
Figure 3 for PAC-Bayesian Spectrally-Normalized Bounds for Adversarially Robust Generalization
Viaarxiv icon