Alert button
Picture for Dongruo Zhou

Dongruo Zhou

Alert button

Learning Contextual Bandits Through Perturbed Rewards

Add code
Bookmark button
Alert button
Jan 24, 2022
Yiling Jia, Weitong Zhang, Dongruo Zhou, Quanquan Gu, Hongning Wang

Figure 1 for Learning Contextual Bandits Through Perturbed Rewards
Figure 2 for Learning Contextual Bandits Through Perturbed Rewards
Figure 3 for Learning Contextual Bandits Through Perturbed Rewards
Figure 4 for Learning Contextual Bandits Through Perturbed Rewards
Viaarxiv icon

Faster Perturbed Stochastic Gradient Methods for Finding Local Minima

Add code
Bookmark button
Alert button
Oct 25, 2021
Zixiang Chen, Dongruo Zhou, Quanquan Gu

Figure 1 for Faster Perturbed Stochastic Gradient Methods for Finding Local Minima
Figure 2 for Faster Perturbed Stochastic Gradient Methods for Finding Local Minima
Viaarxiv icon

Linear Contextual Bandits with Adversarial Corruptions

Add code
Bookmark button
Alert button
Oct 25, 2021
Heyang Zhao, Dongruo Zhou, Quanquan Gu

Figure 1 for Linear Contextual Bandits with Adversarial Corruptions
Viaarxiv icon

Iterative Teacher-Aware Learning

Add code
Bookmark button
Alert button
Oct 17, 2021
Luyao Yuan, Dongruo Zhou, Junhong Shen, Jingdong Gao, Jeffrey L. Chen, Quanquan Gu, Ying Nian Wu, Song-Chun Zhu

Figure 1 for Iterative Teacher-Aware Learning
Figure 2 for Iterative Teacher-Aware Learning
Figure 3 for Iterative Teacher-Aware Learning
Figure 4 for Iterative Teacher-Aware Learning
Viaarxiv icon

Reward-Free Model-Based Reinforcement Learning with Linear Function Approximation

Add code
Bookmark button
Alert button
Oct 12, 2021
Weitong Zhang, Dongruo Zhou, Quanquan Gu

Figure 1 for Reward-Free Model-Based Reinforcement Learning with Linear Function Approximation
Figure 2 for Reward-Free Model-Based Reinforcement Learning with Linear Function Approximation
Viaarxiv icon

Pure Exploration in Kernel and Neural Bandits

Add code
Bookmark button
Alert button
Jun 22, 2021
Yinglun Zhu, Dongruo Zhou, Ruoxi Jiang, Quanquan Gu, Rebecca Willett, Robert Nowak

Figure 1 for Pure Exploration in Kernel and Neural Bandits
Figure 2 for Pure Exploration in Kernel and Neural Bandits
Figure 3 for Pure Exploration in Kernel and Neural Bandits
Viaarxiv icon

Variance-Aware Off-Policy Evaluation with Linear Function Approximation

Add code
Bookmark button
Alert button
Jun 22, 2021
Yifei Min, Tianhao Wang, Dongruo Zhou, Quanquan Gu

Figure 1 for Variance-Aware Off-Policy Evaluation with Linear Function Approximation
Figure 2 for Variance-Aware Off-Policy Evaluation with Linear Function Approximation
Figure 3 for Variance-Aware Off-Policy Evaluation with Linear Function Approximation
Figure 4 for Variance-Aware Off-Policy Evaluation with Linear Function Approximation
Viaarxiv icon

Provably Efficient Representation Learning in Low-rank Markov Decision Processes

Add code
Bookmark button
Alert button
Jun 22, 2021
Weitong Zhang, Jiafan He, Dongruo Zhou, Amy Zhang, Quanquan Gu

Figure 1 for Provably Efficient Representation Learning in Low-rank Markov Decision Processes
Figure 2 for Provably Efficient Representation Learning in Low-rank Markov Decision Processes
Figure 3 for Provably Efficient Representation Learning in Low-rank Markov Decision Processes
Figure 4 for Provably Efficient Representation Learning in Low-rank Markov Decision Processes
Viaarxiv icon

Uniform-PAC Bounds for Reinforcement Learning with Linear Function Approximation

Add code
Bookmark button
Alert button
Jun 22, 2021
Jiafan He, Dongruo Zhou, Quanquan Gu

Viaarxiv icon