Alert button
Picture for Dongruo Zhou

Dongruo Zhou

Alert button

Batched Neural Bandits

Add code
Bookmark button
Alert button
Feb 25, 2021
Quanquan Gu, Amin Karbasi, Khashayar Khosravi, Vahab Mirrokni, Dongruo Zhou

Figure 1 for Batched Neural Bandits
Figure 2 for Batched Neural Bandits
Figure 3 for Batched Neural Bandits
Figure 4 for Batched Neural Bandits
Viaarxiv icon

Nearly Optimal Regret for Learning Adversarial MDPs with Linear Function Approximation

Add code
Bookmark button
Alert button
Feb 17, 2021
Jiafan He, Dongruo Zhou, Quanquan Gu

Figure 1 for Nearly Optimal Regret for Learning Adversarial MDPs with Linear Function Approximation
Viaarxiv icon

Almost Optimal Algorithms for Two-player Markov Games with Linear Function Approximation

Add code
Bookmark button
Alert button
Feb 15, 2021
Zixiang Chen, Dongruo Zhou, Quanquan Gu

Viaarxiv icon

Nearly Minimax Optimal Regret for Learning Infinite-horizon Average-reward MDPs with Linear Function Approximation

Add code
Bookmark button
Alert button
Feb 15, 2021
Yue Wu, Dongruo Zhou, Quanquan Gu

Figure 1 for Nearly Minimax Optimal Regret for Learning Infinite-horizon Average-reward MDPs with Linear Function Approximation
Viaarxiv icon

Nearly Minimax Optimal Reinforcement Learning for Linear Mixture Markov Decision Processes

Add code
Bookmark button
Alert button
Jan 07, 2021
Dongruo Zhou, Quanquan Gu, Csaba Szepesvari

Figure 1 for Nearly Minimax Optimal Reinforcement Learning for Linear Mixture Markov Decision Processes
Viaarxiv icon

Provably Efficient Reinforcement Learning with Linear Function Approximation Under Adaptivity Constraints

Add code
Bookmark button
Alert button
Jan 06, 2021
Tianhao Wang, Dongruo Zhou, Quanquan Gu

Figure 1 for Provably Efficient Reinforcement Learning with Linear Function Approximation Under Adaptivity Constraints
Figure 2 for Provably Efficient Reinforcement Learning with Linear Function Approximation Under Adaptivity Constraints
Figure 3 for Provably Efficient Reinforcement Learning with Linear Function Approximation Under Adaptivity Constraints
Viaarxiv icon

Logarithmic Regret for Reinforcement Learning with Linear Function Approximation

Add code
Bookmark button
Alert button
Nov 23, 2020
Jiafan He, Dongruo Zhou, Quanquan Gu

Viaarxiv icon

Provable Multi-Objective Reinforcement Learning with Generative Models

Add code
Bookmark button
Alert button
Nov 19, 2020
Dongruo Zhou, Jiahao Chen, Quanquan Gu

Viaarxiv icon

Neural Thompson Sampling

Add code
Bookmark button
Alert button
Oct 02, 2020
Weitong Zhang, Dongruo Zhou, Lihong Li, Quanquan Gu

Figure 1 for Neural Thompson Sampling
Figure 2 for Neural Thompson Sampling
Figure 3 for Neural Thompson Sampling
Figure 4 for Neural Thompson Sampling
Viaarxiv icon