Get our free extension to see links to code for papers anywhere online!

Chrome logo  Add to Chrome

Firefox logo Add to Firefox

Variance-Dependent Regret Bounds for Linear Bandits and Reinforcement Learning: Adaptivity and Computational Efficiency


Feb 21, 2023
Heyang Zhao, Jiafan He, Dongruo Zhou, Tong Zhang, Quanquan Gu

Add code

* 43 pages, 2 tables 

   Access Paper or Ask Questions

Structure-informed Language Models Are Protein Designers


Feb 09, 2023
Zaixiang Zheng, Yifan Deng, Dongyu Xue, Yi Zhou, Fei YE, Quanquan Gu

Add code

* 10 pages; ver.2 update: added image credit to RFdiffusion (Watson et al., 2022) in Fig. 1F, and fixed some small presentation errors 

   Access Paper or Ask Questions

Nearly Minimax Optimal Reinforcement Learning for Linear Markov Decision Processes


Dec 12, 2022
Jiafan He, Heyang Zhao, Dongruo Zhou, Quanquan Gu

Add code

* 44 pages, 1 table 

   Access Paper or Ask Questions

Corruption-Robust Algorithms with Uncertainty Weighting for Nonlinear Contextual Bandits and Markov Decision Processes


Dec 12, 2022
Chenlu Ye, Wei Xiong, Quanquan Gu, Tong Zhang

Add code

* We study the corruption-robust MDPs and contextual bandits with general function approximation 

   Access Paper or Ask Questions

A General Framework for Sample-Efficient Function Approximation in Reinforcement Learning


Sep 30, 2022
Zixiang Chen, Chris Junchi Li, Angela Yuan, Quanquan Gu, Michael I. Jordan

Add code


   Access Paper or Ask Questions

Learning Two-Player Mixture Markov Games: Kernel Function Approximation and Correlated Equilibrium


Aug 10, 2022
Chris Junchi Li, Dongruo Zhou, Quanquan Gu, Michael I. Jordan

Add code

* 42 pages 

   Access Paper or Ask Questions

Towards Understanding Mixture of Experts in Deep Learning


Aug 04, 2022
Zixiang Chen, Yihe Deng, Yue Wu, Quanquan Gu, Yuanzhi Li

Add code

* 53 pages, 8 figures, 11 tables 

   Access Paper or Ask Questions

The Power and Limitation of Pretraining-Finetuning for Linear Regression under Covariate Shift


Aug 03, 2022
Jingfeng Wu, Difan Zou, Vladimir Braverman, Quanquan Gu, Sham M. Kakade

Add code

* 32 pages, 1 figure, 1 table 

   Access Paper or Ask Questions

A Simple and Provably Efficient Algorithm for Asynchronous Federated Contextual Linear Bandits


Jul 07, 2022
Jiafan He, Tianhao Wang, Yifei Min, Quanquan Gu

Add code

* 25 pages, 1 figure, 2 tables 

   Access Paper or Ask Questions

<<
1
2
3
4
5
6
7
8
>>