Picture for Zaiwen Wen

Zaiwen Wen

A Memory Efficient Randomized Subspace Optimization Method for Training Large Language Models

Add code
Feb 11, 2025
Viaarxiv icon

Enhancing Zeroth-order Fine-tuning for Language Models with Low-rank Structures

Add code
Oct 10, 2024
Figure 1 for Enhancing Zeroth-order Fine-tuning for Language Models with Low-rank Structures
Figure 2 for Enhancing Zeroth-order Fine-tuning for Language Models with Low-rank Structures
Figure 3 for Enhancing Zeroth-order Fine-tuning for Language Models with Low-rank Structures
Figure 4 for Enhancing Zeroth-order Fine-tuning for Language Models with Low-rank Structures
Viaarxiv icon

ODE-based Learning to Optimize

Add code
Jun 04, 2024
Figure 1 for ODE-based Learning to Optimize
Figure 2 for ODE-based Learning to Optimize
Figure 3 for ODE-based Learning to Optimize
Figure 4 for ODE-based Learning to Optimize
Viaarxiv icon

An Improved Finite-time Analysis of Temporal Difference Learning with Deep Neural Networks

Add code
May 07, 2024
Figure 1 for An Improved Finite-time Analysis of Temporal Difference Learning with Deep Neural Networks
Figure 2 for An Improved Finite-time Analysis of Temporal Difference Learning with Deep Neural Networks
Viaarxiv icon

Monte Carlo Policy Gradient Method for Binary Optimization

Add code
Jul 03, 2023
Figure 1 for Monte Carlo Policy Gradient Method for Binary Optimization
Figure 2 for Monte Carlo Policy Gradient Method for Binary Optimization
Figure 3 for Monte Carlo Policy Gradient Method for Binary Optimization
Figure 4 for Monte Carlo Policy Gradient Method for Binary Optimization
Viaarxiv icon

Provable Convergence of Variational Monte Carlo Methods

Add code
Mar 19, 2023
Figure 1 for Provable Convergence of Variational Monte Carlo Methods
Viaarxiv icon

Provably Efficient Gauss-Newton Temporal Difference Learning Method with Function Approximation

Add code
Feb 25, 2023
Figure 1 for Provably Efficient Gauss-Newton Temporal Difference Learning Method with Function Approximation
Figure 2 for Provably Efficient Gauss-Newton Temporal Difference Learning Method with Function Approximation
Figure 3 for Provably Efficient Gauss-Newton Temporal Difference Learning Method with Function Approximation
Figure 4 for Provably Efficient Gauss-Newton Temporal Difference Learning Method with Function Approximation
Viaarxiv icon

Riemannian Natural Gradient Methods

Add code
Jul 15, 2022
Figure 1 for Riemannian Natural Gradient Methods
Figure 2 for Riemannian Natural Gradient Methods
Figure 3 for Riemannian Natural Gradient Methods
Viaarxiv icon

A Near-Optimal Primal-Dual Method for Off-Policy Learning in CMDP

Add code
Jul 13, 2022
Figure 1 for A Near-Optimal Primal-Dual Method for Off-Policy Learning in CMDP
Viaarxiv icon

NG+ : A Multi-Step Matrix-Product Natural Gradient Method for Deep Learning

Add code
Jun 14, 2021
Figure 1 for NG+ : A Multi-Step Matrix-Product Natural Gradient Method for Deep Learning
Figure 2 for NG+ : A Multi-Step Matrix-Product Natural Gradient Method for Deep Learning
Figure 3 for NG+ : A Multi-Step Matrix-Product Natural Gradient Method for Deep Learning
Figure 4 for NG+ : A Multi-Step Matrix-Product Natural Gradient Method for Deep Learning
Viaarxiv icon