Picture for Jason D. Lee

Jason D. Lee

Reward Collapse in Aligning Large Language Models

Add code
May 28, 2023
Viaarxiv icon

Fine-Tuning Language Models with Just Forward Passes

Add code
May 27, 2023
Figure 1 for Fine-Tuning Language Models with Just Forward Passes
Figure 2 for Fine-Tuning Language Models with Just Forward Passes
Figure 3 for Fine-Tuning Language Models with Just Forward Passes
Figure 4 for Fine-Tuning Language Models with Just Forward Passes
Viaarxiv icon

Provable Offline Reinforcement Learning with Human Feedback

Add code
May 24, 2023
Viaarxiv icon

Implicit Bias of Gradient Descent for Logistic Regression at the Edge of Stability

Add code
May 19, 2023
Viaarxiv icon

Smoothing the Landscape Boosts the Signal for SGD: Optimal Sample Complexity for Learning Single Index Models

Add code
May 18, 2023
Figure 1 for Smoothing the Landscape Boosts the Signal for SGD: Optimal Sample Complexity for Learning Single Index Models
Figure 2 for Smoothing the Landscape Boosts the Signal for SGD: Optimal Sample Complexity for Learning Single Index Models
Viaarxiv icon

Reward-agnostic Fine-tuning: Provable Statistical Benefits of Hybrid Reinforcement Learning

Add code
May 17, 2023
Viaarxiv icon

Provable Guarantees for Nonlinear Feature Learning in Three-Layer Neural Networks

Add code
May 11, 2023
Viaarxiv icon

Local Optimization Achieves Global Optimality in Multi-Agent Reinforcement Learning

Add code
May 08, 2023
Figure 1 for Local Optimization Achieves Global Optimality in Multi-Agent Reinforcement Learning
Viaarxiv icon

Can We Find Nash Equilibria at a Linear Rate in Markov Games?

Add code
Mar 03, 2023
Viaarxiv icon

Provably Efficient Reinforcement Learning via Surprise Bound

Add code
Feb 22, 2023
Viaarxiv icon