Alert button
Picture for Simon S. Du

Simon S. Du

Alert button

Frank

Optimal Extragradient-Based Bilinearly-Coupled Saddle-Point Optimization

Jun 17, 2022
Simon S. Du, Gauthier Gidel, Michael I. Jordan, Chris Junchi Li

Figure 1 for Optimal Extragradient-Based Bilinearly-Coupled Saddle-Point Optimization
Viaarxiv icon

Learning in Congestion Games with Bandit Feedback

Jun 04, 2022
Qiwen Cui, Zhihan Xiong, Maryam Fazel, Simon S. Du

Figure 1 for Learning in Congestion Games with Bandit Feedback
Viaarxiv icon

On Gap-dependent Bounds for Offline Reinforcement Learning

Jun 01, 2022
Xinqi Wang, Qiwen Cui, Simon S. Du

Figure 1 for On Gap-dependent Bounds for Offline Reinforcement Learning
Figure 2 for On Gap-dependent Bounds for Offline Reinforcement Learning
Viaarxiv icon

Provably Efficient Offline Multi-agent Reinforcement Learning via Strategy-wise Bonus

Jun 01, 2022
Qiwen Cui, Simon S. Du

Viaarxiv icon

Provable General Function Class Representation Learning in Multitask Bandits and MDPs

May 31, 2022
Rui Lu, Andrew Zhao, Simon S. Du, Gao Huang

Figure 1 for Provable General Function Class Representation Learning in Multitask Bandits and MDPs
Figure 2 for Provable General Function Class Representation Learning in Multitask Bandits and MDPs
Figure 3 for Provable General Function Class Representation Learning in Multitask Bandits and MDPs
Viaarxiv icon

Variance-Aware Sparse Linear Bandits

May 26, 2022
Yan Dai, Ruosong Wang, Simon S. Du

Figure 1 for Variance-Aware Sparse Linear Bandits
Viaarxiv icon

Nearly Minimax Algorithms for Linear Bandits with Shared Representation

Mar 29, 2022
Jiaqi Yang, Qi Lei, Jason D. Lee, Simon S. Du

Figure 1 for Nearly Minimax Algorithms for Linear Bandits with Shared Representation
Figure 2 for Nearly Minimax Algorithms for Linear Bandits with Shared Representation
Figure 3 for Nearly Minimax Algorithms for Linear Bandits with Shared Representation
Viaarxiv icon

Horizon-Free Reinforcement Learning in Polynomial Time: the Power of Stationary Policies

Mar 24, 2022
Zihan Zhang, Xiangyang Ji, Simon S. Du

Figure 1 for Horizon-Free Reinforcement Learning in Polynomial Time: the Power of Stationary Policies
Viaarxiv icon

Understanding Curriculum Learning in Policy Optimization for Solving Combinatorial Optimization Problems

Feb 11, 2022
Runlong Zhou, Yuandong Tian, Yi Wu, Simon S. Du

Figure 1 for Understanding Curriculum Learning in Policy Optimization for Solving Combinatorial Optimization Problems
Figure 2 for Understanding Curriculum Learning in Policy Optimization for Solving Combinatorial Optimization Problems
Figure 3 for Understanding Curriculum Learning in Policy Optimization for Solving Combinatorial Optimization Problems
Figure 4 for Understanding Curriculum Learning in Policy Optimization for Solving Combinatorial Optimization Problems
Viaarxiv icon

TransFollower: Long-Sequence Car-Following Trajectory Prediction through Transformer

Feb 04, 2022
Meixin Zhu, Simon S. Du, Xuesong Wang, Hao, Yang, Ziyuan Pu, Yinhai Wang

Viaarxiv icon