Picture for Quanquan Gu

Quanquan Gu

Corruption-Robust Offline Reinforcement Learning with General Function Approximation

Add code
Oct 23, 2023
Viaarxiv icon

Pure Exploration in Asynchronous Federated Bandits

Add code
Oct 17, 2023
Viaarxiv icon

How Many Pretraining Tasks Are Needed for In-Context Learning of Linear Regression?

Add code
Oct 12, 2023
Viaarxiv icon

Why Does Sharpness-Aware Minimization Generalize Better Than SGD?

Add code
Oct 11, 2023
Figure 1 for Why Does Sharpness-Aware Minimization Generalize Better Than SGD?
Figure 2 for Why Does Sharpness-Aware Minimization Generalize Better Than SGD?
Figure 3 for Why Does Sharpness-Aware Minimization Generalize Better Than SGD?
Figure 4 for Why Does Sharpness-Aware Minimization Generalize Better Than SGD?
Viaarxiv icon

Pessimistic Nonlinear Least-Squares Value Iteration for Offline Reinforcement Learning

Add code
Oct 02, 2023
Figure 1 for Pessimistic Nonlinear Least-Squares Value Iteration for Offline Reinforcement Learning
Viaarxiv icon

Variance-Aware Regret Bounds for Stochastic Contextual Dueling Bandits

Add code
Oct 02, 2023
Viaarxiv icon

Understanding Transferable Representation Learning and Zero-shot Transfer in CLIP

Add code
Oct 02, 2023
Viaarxiv icon

Diffusion Language Models Can Perform Many Tasks with Scaling and Instruction-Finetuning

Add code
Aug 25, 2023
Viaarxiv icon

The Implicit Bias of Batch Normalization in Linear Models and Two-layer Linear Convolutional Neural Networks

Add code
Jul 11, 2023
Viaarxiv icon

Robust Learning with Progressive Data Expansion Against Spurious Correlation

Add code
Jun 08, 2023
Viaarxiv icon