Alert button
Picture for Simon S. Du

Simon S. Du

Alert button

Horizon-Free Regret for Linear Markov Decision Processes

Add code
Bookmark button
Alert button
Mar 15, 2024
Zihan Zhang, Jason D. Lee, Yuxin Chen, Simon S. Du

Viaarxiv icon

Transferable Reinforcement Learning via Generalized Occupancy Models

Add code
Bookmark button
Alert button
Mar 10, 2024
Chuning Zhu, Xinqi Wang, Tyler Han, Simon S. Du, Abhishek Gupta

Figure 1 for Transferable Reinforcement Learning via Generalized Occupancy Models
Figure 2 for Transferable Reinforcement Learning via Generalized Occupancy Models
Figure 3 for Transferable Reinforcement Learning via Generalized Occupancy Models
Figure 4 for Transferable Reinforcement Learning via Generalized Occupancy Models
Viaarxiv icon

Reflect-RL: Two-Player Online RL Fine-Tuning for LMs

Add code
Bookmark button
Alert button
Feb 20, 2024
Runlong Zhou, Simon S. Du, Beibin Li

Viaarxiv icon

Learning Optimal Tax Design in Nonatomic Congestion Games

Add code
Bookmark button
Alert button
Feb 12, 2024
Qiwen Cui, Maryam Fazel, Simon S. Du

Viaarxiv icon

Refined Sample Complexity for Markov Games with Independent Linear Function Approximation

Add code
Bookmark button
Alert button
Feb 11, 2024
Yan Dai, Qiwen Cui, Simon S. Du

Viaarxiv icon

An Experimental Design Framework for Label-Efficient Supervised Finetuning of Large Language Models

Add code
Bookmark button
Alert button
Jan 12, 2024
Gantavya Bhatt, Yifang Chen, Arnav M. Das, Jifan Zhang, Sang T. Truong, Stephen Mussmann, Yinglun Zhu, Jeffrey Bilmes, Simon S. Du, Kevin Jamieson, Jordan T. Ash, Robert D. Nowak

Viaarxiv icon

Optimal Multi-Distribution Learning

Add code
Bookmark button
Alert button
Dec 08, 2023
Zihan Zhang, Wenhao Zhan, Yuxin Chen, Simon S. Du, Jason D. Lee

Viaarxiv icon

Dichotomy of Early and Late Phase Implicit Biases Can Provably Induce Grokking

Add code
Bookmark button
Alert button
Nov 30, 2023
Kaifeng Lyu, Jikai Jin, Zhiyuan Li, Simon S. Du, Jason D. Lee, Wei Hu

Viaarxiv icon

Unleashing the Power of Pre-trained Language Models for Offline Reinforcement Learning

Add code
Bookmark button
Alert button
Nov 07, 2023
Ruizhe Shi, Yuyao Liu, Yanjie Ze, Simon S. Du, Huazhe Xu

Viaarxiv icon