Picture for Quanquan Gu

Quanquan Gu

Batched Neural Bandits

Add code
Feb 25, 2021
Figure 1 for Batched Neural Bandits
Figure 2 for Batched Neural Bandits
Figure 3 for Batched Neural Bandits
Figure 4 for Batched Neural Bandits
Viaarxiv icon

Nearly Optimal Regret for Learning Adversarial MDPs with Linear Function Approximation

Add code
Feb 17, 2021
Figure 1 for Nearly Optimal Regret for Learning Adversarial MDPs with Linear Function Approximation
Viaarxiv icon

Almost Optimal Algorithms for Two-player Markov Games with Linear Function Approximation

Add code
Feb 15, 2021
Viaarxiv icon

Nearly Minimax Optimal Regret for Learning Infinite-horizon Average-reward MDPs with Linear Function Approximation

Add code
Feb 15, 2021
Figure 1 for Nearly Minimax Optimal Regret for Learning Infinite-horizon Average-reward MDPs with Linear Function Approximation
Viaarxiv icon

Provable Generalization of SGD-trained Neural Networks of Any Width in the Presence of Adversarial Label Noise

Add code
Jan 14, 2021
Figure 1 for Provable Generalization of SGD-trained Neural Networks of Any Width in the Presence of Adversarial Label Noise
Figure 2 for Provable Generalization of SGD-trained Neural Networks of Any Width in the Presence of Adversarial Label Noise
Figure 3 for Provable Generalization of SGD-trained Neural Networks of Any Width in the Presence of Adversarial Label Noise
Figure 4 for Provable Generalization of SGD-trained Neural Networks of Any Width in the Presence of Adversarial Label Noise
Viaarxiv icon

Nearly Minimax Optimal Reinforcement Learning for Linear Mixture Markov Decision Processes

Add code
Jan 07, 2021
Figure 1 for Nearly Minimax Optimal Reinforcement Learning for Linear Mixture Markov Decision Processes
Viaarxiv icon

Provably Efficient Reinforcement Learning with Linear Function Approximation Under Adaptivity Constraints

Add code
Jan 06, 2021
Viaarxiv icon

Neural Contextual Bandits with Deep Representation and Shallow Exploration

Add code
Dec 03, 2020
Figure 1 for Neural Contextual Bandits with Deep Representation and Shallow Exploration
Figure 2 for Neural Contextual Bandits with Deep Representation and Shallow Exploration
Viaarxiv icon

Logarithmic Regret for Reinforcement Learning with Linear Function Approximation

Add code
Nov 23, 2020
Viaarxiv icon

Provable Multi-Objective Reinforcement Learning with Generative Models

Add code
Nov 19, 2020
Viaarxiv icon