Alert button
Picture for Qiwen Cui

Qiwen Cui

Alert button

$\mathbf{(N,K)}$-Puzzle: A Cost-Efficient Testbed for Benchmarking Reinforcement Learning Algorithms in Generative Language Model

Add code
Bookmark button
Alert button
Mar 11, 2024
Yufeng Zhang, Liyu Chen, Boyi Liu, Yingxiang Yang, Qiwen Cui, Yunzhe Tao, Hongxia Yang

Figure 1 for $\mathbf{(N,K)}$-Puzzle: A Cost-Efficient Testbed for Benchmarking Reinforcement Learning Algorithms in Generative Language Model
Figure 2 for $\mathbf{(N,K)}$-Puzzle: A Cost-Efficient Testbed for Benchmarking Reinforcement Learning Algorithms in Generative Language Model
Figure 3 for $\mathbf{(N,K)}$-Puzzle: A Cost-Efficient Testbed for Benchmarking Reinforcement Learning Algorithms in Generative Language Model
Figure 4 for $\mathbf{(N,K)}$-Puzzle: A Cost-Efficient Testbed for Benchmarking Reinforcement Learning Algorithms in Generative Language Model
Viaarxiv icon

Learning Optimal Tax Design in Nonatomic Congestion Games

Add code
Bookmark button
Alert button
Feb 12, 2024
Qiwen Cui, Maryam Fazel, Simon S. Du

Viaarxiv icon

Refined Sample Complexity for Markov Games with Independent Linear Function Approximation

Add code
Bookmark button
Alert button
Feb 11, 2024
Yan Dai, Qiwen Cui, Simon S. Du

Viaarxiv icon

Free from Bellman Completeness: Trajectory Stitching via Model-based Return-conditioned Supervised Learning

Add code
Bookmark button
Alert button
Oct 30, 2023
Zhaoyi Zhou, Chuning Zhu, Runlong Zhou, Qiwen Cui, Abhishek Gupta, Simon Shaolei Du

Viaarxiv icon

A Black-box Approach for Non-stationary Multi-agent Reinforcement Learning

Add code
Bookmark button
Alert button
Jun 12, 2023
Haozhe Jiang, Qiwen Cui, Zhihan Xiong, Maryam Fazel, Simon S. Du

Figure 1 for A Black-box Approach for Non-stationary Multi-agent Reinforcement Learning
Figure 2 for A Black-box Approach for Non-stationary Multi-agent Reinforcement Learning
Figure 3 for A Black-box Approach for Non-stationary Multi-agent Reinforcement Learning
Figure 4 for A Black-box Approach for Non-stationary Multi-agent Reinforcement Learning
Viaarxiv icon

Breaking the Curse of Multiagents in a Large State Space: RL in Markov Games with Independent Linear Function Approximation

Add code
Bookmark button
Alert button
Mar 02, 2023
Qiwen Cui, Kaiqing Zhang, Simon S. Du

Figure 1 for Breaking the Curse of Multiagents in a Large State Space: RL in Markov Games with Independent Linear Function Approximation
Figure 2 for Breaking the Curse of Multiagents in a Large State Space: RL in Markov Games with Independent Linear Function Approximation
Viaarxiv icon

Offline congestion games: How feedback type affects data coverage requirement

Add code
Bookmark button
Alert button
Oct 24, 2022
Haozhe Jiang, Qiwen Cui, Zhihan Xiong, Maryam Fazel, Simon S. Du

Figure 1 for Offline congestion games: How feedback type affects data coverage requirement
Figure 2 for Offline congestion games: How feedback type affects data coverage requirement
Figure 3 for Offline congestion games: How feedback type affects data coverage requirement
Figure 4 for Offline congestion games: How feedback type affects data coverage requirement
Viaarxiv icon

Learning in Congestion Games with Bandit Feedback

Add code
Bookmark button
Alert button
Jun 04, 2022
Qiwen Cui, Zhihan Xiong, Maryam Fazel, Simon S. Du

Figure 1 for Learning in Congestion Games with Bandit Feedback
Viaarxiv icon

On Gap-dependent Bounds for Offline Reinforcement Learning

Add code
Bookmark button
Alert button
Jun 01, 2022
Xinqi Wang, Qiwen Cui, Simon S. Du

Figure 1 for On Gap-dependent Bounds for Offline Reinforcement Learning
Figure 2 for On Gap-dependent Bounds for Offline Reinforcement Learning
Viaarxiv icon