Alert button
Picture for Simon S. Du

Simon S. Du

Alert button

Fine-Grained Gap-Dependent Bounds for Tabular MDPs via Adaptive Multi-Step Bootstrap

Add code
Bookmark button
Alert button
Feb 09, 2021
Haike Xu, Tengyu Ma, Simon S. Du

Figure 1 for Fine-Grained Gap-Dependent Bounds for Tabular MDPs via Adaptive Multi-Step Bootstrap
Figure 2 for Fine-Grained Gap-Dependent Bounds for Tabular MDPs via Adaptive Multi-Step Bootstrap
Viaarxiv icon

Variance-Aware Confidence Set: Variance-Dependent Bound for Linear Bandits and Horizon-Free Bound for Linear Mixture MDP

Add code
Bookmark button
Alert button
Jan 29, 2021
Zihan Zhang, Jiaqi Yang, Xiangyang Ji, Simon S. Du

Viaarxiv icon

A Provably Efficient Algorithm for Linear Markov Decision Process with Low Switching Cost

Add code
Bookmark button
Alert button
Jan 02, 2021
Minbo Gao, Tianle Xie, Simon S. Du, Lin F. Yang

Figure 1 for A Provably Efficient Algorithm for Linear Markov Decision Process with Low Switching Cost
Figure 2 for A Provably Efficient Algorithm for Linear Markov Decision Process with Low Switching Cost
Viaarxiv icon

Nearly Minimax Optimal Reward-free Reinforcement Learning

Add code
Bookmark button
Alert button
Oct 23, 2020
Zihan Zhang, Simon S. Du, Xiangyang Ji

Figure 1 for Nearly Minimax Optimal Reward-free Reinforcement Learning
Viaarxiv icon

Provable Benefits of Representation Learning in Linear Bandits

Add code
Bookmark button
Alert button
Oct 13, 2020
Jiaqi Yang, Wei Hu, Jason D. Lee, Simon S. Du

Figure 1 for Provable Benefits of Representation Learning in Linear Bandits
Figure 2 for Provable Benefits of Representation Learning in Linear Bandits
Figure 3 for Provable Benefits of Representation Learning in Linear Bandits
Figure 4 for Provable Benefits of Representation Learning in Linear Bandits
Viaarxiv icon

How Neural Networks Extrapolate: From Feedforward to Graph Neural Networks

Add code
Bookmark button
Alert button
Oct 01, 2020
Keyulu Xu, Jingling Li, Mozhi Zhang, Simon S. Du, Ken-ichi Kawarabayashi, Stefanie Jegelka

Figure 1 for How Neural Networks Extrapolate: From Feedforward to Graph Neural Networks
Figure 2 for How Neural Networks Extrapolate: From Feedforward to Graph Neural Networks
Figure 3 for How Neural Networks Extrapolate: From Feedforward to Graph Neural Networks
Figure 4 for How Neural Networks Extrapolate: From Feedforward to Graph Neural Networks
Viaarxiv icon

Is Reinforcement Learning More Difficult Than Bandits? A Near-optimal Algorithm Escaping the Curse of Horizon

Add code
Bookmark button
Alert button
Sep 28, 2020
Zihan Zhang, Xiangyang Ji, Simon S. Du

Figure 1 for Is Reinforcement Learning More Difficult Than Bandits? A Near-optimal Algorithm Escaping the Curse of Horizon
Viaarxiv icon

On Reward-Free Reinforcement Learning with Linear Function Approximation

Add code
Bookmark button
Alert button
Jun 19, 2020
Ruosong Wang, Simon S. Du, Lin F. Yang, Ruslan Salakhutdinov

Figure 1 for On Reward-Free Reinforcement Learning with Linear Function Approximation
Viaarxiv icon

$Q$-learning with Logarithmic Regret

Add code
Bookmark button
Alert button
Jun 16, 2020
Kunhe Yang, Lin F. Yang, Simon S. Du

Viaarxiv icon