Picture for Simon S. Du

Simon S. Du

Frank

Optimal Extragradient-Based Bilinearly-Coupled Saddle-Point Optimization

Add code
Jun 17, 2022
Figure 1 for Optimal Extragradient-Based Bilinearly-Coupled Saddle-Point Optimization
Viaarxiv icon

Learning in Congestion Games with Bandit Feedback

Add code
Jun 04, 2022
Figure 1 for Learning in Congestion Games with Bandit Feedback
Viaarxiv icon

On Gap-dependent Bounds for Offline Reinforcement Learning

Add code
Jun 01, 2022
Figure 1 for On Gap-dependent Bounds for Offline Reinforcement Learning
Figure 2 for On Gap-dependent Bounds for Offline Reinforcement Learning
Viaarxiv icon

Provably Efficient Offline Multi-agent Reinforcement Learning via Strategy-wise Bonus

Add code
Jun 01, 2022
Viaarxiv icon

Provable General Function Class Representation Learning in Multitask Bandits and MDPs

Add code
May 31, 2022
Figure 1 for Provable General Function Class Representation Learning in Multitask Bandits and MDPs
Figure 2 for Provable General Function Class Representation Learning in Multitask Bandits and MDPs
Figure 3 for Provable General Function Class Representation Learning in Multitask Bandits and MDPs
Viaarxiv icon

Variance-Aware Sparse Linear Bandits

Add code
May 26, 2022
Figure 1 for Variance-Aware Sparse Linear Bandits
Viaarxiv icon

Nearly Minimax Algorithms for Linear Bandits with Shared Representation

Add code
Mar 29, 2022
Figure 1 for Nearly Minimax Algorithms for Linear Bandits with Shared Representation
Figure 2 for Nearly Minimax Algorithms for Linear Bandits with Shared Representation
Figure 3 for Nearly Minimax Algorithms for Linear Bandits with Shared Representation
Viaarxiv icon

Horizon-Free Reinforcement Learning in Polynomial Time: the Power of Stationary Policies

Add code
Mar 24, 2022
Figure 1 for Horizon-Free Reinforcement Learning in Polynomial Time: the Power of Stationary Policies
Viaarxiv icon

Understanding Curriculum Learning in Policy Optimization for Solving Combinatorial Optimization Problems

Add code
Feb 11, 2022
Figure 1 for Understanding Curriculum Learning in Policy Optimization for Solving Combinatorial Optimization Problems
Figure 2 for Understanding Curriculum Learning in Policy Optimization for Solving Combinatorial Optimization Problems
Figure 3 for Understanding Curriculum Learning in Policy Optimization for Solving Combinatorial Optimization Problems
Figure 4 for Understanding Curriculum Learning in Policy Optimization for Solving Combinatorial Optimization Problems
Viaarxiv icon

TransFollower: Long-Sequence Car-Following Trajectory Prediction through Transformer

Add code
Feb 04, 2022
Viaarxiv icon