Alert button
Picture for Quanquan Gu

Quanquan Gu

Alert button

Variance-Dependent Regret Bounds for Linear Bandits and Reinforcement Learning: Adaptivity and Computational Efficiency

Feb 21, 2023
Heyang Zhao, Jiafan He, Dongruo Zhou, Tong Zhang, Quanquan Gu

Figure 1 for Variance-Dependent Regret Bounds for Linear Bandits and Reinforcement Learning: Adaptivity and Computational Efficiency
Figure 2 for Variance-Dependent Regret Bounds for Linear Bandits and Reinforcement Learning: Adaptivity and Computational Efficiency
Viaarxiv icon

Structure-informed Language Models Are Protein Designers

Feb 09, 2023
Zaixiang Zheng, Yifan Deng, Dongyu Xue, Yi Zhou, Fei YE, Quanquan Gu

Figure 1 for Structure-informed Language Models Are Protein Designers
Figure 2 for Structure-informed Language Models Are Protein Designers
Figure 3 for Structure-informed Language Models Are Protein Designers
Figure 4 for Structure-informed Language Models Are Protein Designers
Viaarxiv icon

Nearly Minimax Optimal Reinforcement Learning for Linear Markov Decision Processes

Dec 12, 2022
Jiafan He, Heyang Zhao, Dongruo Zhou, Quanquan Gu

Figure 1 for Nearly Minimax Optimal Reinforcement Learning for Linear Markov Decision Processes
Viaarxiv icon

Corruption-Robust Algorithms with Uncertainty Weighting for Nonlinear Contextual Bandits and Markov Decision Processes

Dec 12, 2022
Chenlu Ye, Wei Xiong, Quanquan Gu, Tong Zhang

Viaarxiv icon

A General Framework for Sample-Efficient Function Approximation in Reinforcement Learning

Sep 30, 2022
Zixiang Chen, Chris Junchi Li, Angela Yuan, Quanquan Gu, Michael I. Jordan

Figure 1 for A General Framework for Sample-Efficient Function Approximation in Reinforcement Learning
Figure 2 for A General Framework for Sample-Efficient Function Approximation in Reinforcement Learning
Viaarxiv icon

Learning Two-Player Mixture Markov Games: Kernel Function Approximation and Correlated Equilibrium

Aug 10, 2022
Chris Junchi Li, Dongruo Zhou, Quanquan Gu, Michael I. Jordan

Viaarxiv icon

Towards Understanding Mixture of Experts in Deep Learning

Aug 04, 2022
Zixiang Chen, Yihe Deng, Yue Wu, Quanquan Gu, Yuanzhi Li

Figure 1 for Towards Understanding Mixture of Experts in Deep Learning
Figure 2 for Towards Understanding Mixture of Experts in Deep Learning
Figure 3 for Towards Understanding Mixture of Experts in Deep Learning
Figure 4 for Towards Understanding Mixture of Experts in Deep Learning
Viaarxiv icon

The Power and Limitation of Pretraining-Finetuning for Linear Regression under Covariate Shift

Aug 03, 2022
Jingfeng Wu, Difan Zou, Vladimir Braverman, Quanquan Gu, Sham M. Kakade

Figure 1 for The Power and Limitation of Pretraining-Finetuning for Linear Regression under Covariate Shift
Figure 2 for The Power and Limitation of Pretraining-Finetuning for Linear Regression under Covariate Shift
Viaarxiv icon

A Simple and Provably Efficient Algorithm for Asynchronous Federated Contextual Linear Bandits

Jul 07, 2022
Jiafan He, Tianhao Wang, Yifei Min, Quanquan Gu

Figure 1 for A Simple and Provably Efficient Algorithm for Asynchronous Federated Contextual Linear Bandits
Figure 2 for A Simple and Provably Efficient Algorithm for Asynchronous Federated Contextual Linear Bandits
Figure 3 for A Simple and Provably Efficient Algorithm for Asynchronous Federated Contextual Linear Bandits
Viaarxiv icon