Picture for Quanquan Gu

Quanquan Gu

Borda Regret Minimization for Generalized Linear Dueling Bandits

Add code
Mar 15, 2023
Viaarxiv icon

The Benefits of Mixup for Feature Learning

Add code
Mar 15, 2023
Viaarxiv icon

Benign Overfitting for Two-layer ReLU Networks

Add code
Mar 07, 2023
Viaarxiv icon

Learning High-Dimensional Single-Neuron ReLU Networks with Finite Samples

Add code
Mar 03, 2023
Viaarxiv icon

Variance-Dependent Regret Bounds for Linear Bandits and Reinforcement Learning: Adaptivity and Computational Efficiency

Add code
Feb 21, 2023
Figure 1 for Variance-Dependent Regret Bounds for Linear Bandits and Reinforcement Learning: Adaptivity and Computational Efficiency
Figure 2 for Variance-Dependent Regret Bounds for Linear Bandits and Reinforcement Learning: Adaptivity and Computational Efficiency
Viaarxiv icon

Structure-informed Language Models Are Protein Designers

Add code
Feb 09, 2023
Viaarxiv icon

Corruption-Robust Algorithms with Uncertainty Weighting for Nonlinear Contextual Bandits and Markov Decision Processes

Add code
Dec 12, 2022
Viaarxiv icon

Nearly Minimax Optimal Reinforcement Learning for Linear Markov Decision Processes

Add code
Dec 12, 2022
Figure 1 for Nearly Minimax Optimal Reinforcement Learning for Linear Markov Decision Processes
Viaarxiv icon

A General Framework for Sample-Efficient Function Approximation in Reinforcement Learning

Add code
Sep 30, 2022
Figure 1 for A General Framework for Sample-Efficient Function Approximation in Reinforcement Learning
Figure 2 for A General Framework for Sample-Efficient Function Approximation in Reinforcement Learning
Viaarxiv icon

Learning Two-Player Mixture Markov Games: Kernel Function Approximation and Correlated Equilibrium

Add code
Aug 10, 2022
Viaarxiv icon