Picture for Qiwen Cui

Qiwen Cui

$\mathbf{(N,K)}$-Puzzle: A Cost-Efficient Testbed for Benchmarking Reinforcement Learning Algorithms in Generative Language Model

Mar 11, 2024
Figure 1 for $\mathbf{(N,K)}$-Puzzle: A Cost-Efficient Testbed for Benchmarking Reinforcement Learning Algorithms in Generative Language Model
Figure 2 for $\mathbf{(N,K)}$-Puzzle: A Cost-Efficient Testbed for Benchmarking Reinforcement Learning Algorithms in Generative Language Model
Figure 3 for $\mathbf{(N,K)}$-Puzzle: A Cost-Efficient Testbed for Benchmarking Reinforcement Learning Algorithms in Generative Language Model
Figure 4 for $\mathbf{(N,K)}$-Puzzle: A Cost-Efficient Testbed for Benchmarking Reinforcement Learning Algorithms in Generative Language Model
Viaarxiv icon

Learning Optimal Tax Design in Nonatomic Congestion Games

Feb 12, 2024
Viaarxiv icon

Refined Sample Complexity for Markov Games with Independent Linear Function Approximation

Feb 11, 2024
Viaarxiv icon

Free from Bellman Completeness: Trajectory Stitching via Model-based Return-conditioned Supervised Learning

Add code
Oct 30, 2023
Viaarxiv icon

A Black-box Approach for Non-stationary Multi-agent Reinforcement Learning

Jun 12, 2023
Figure 1 for A Black-box Approach for Non-stationary Multi-agent Reinforcement Learning
Figure 2 for A Black-box Approach for Non-stationary Multi-agent Reinforcement Learning
Figure 3 for A Black-box Approach for Non-stationary Multi-agent Reinforcement Learning
Figure 4 for A Black-box Approach for Non-stationary Multi-agent Reinforcement Learning
Viaarxiv icon

Breaking the Curse of Multiagents in a Large State Space: RL in Markov Games with Independent Linear Function Approximation

Mar 02, 2023
Figure 1 for Breaking the Curse of Multiagents in a Large State Space: RL in Markov Games with Independent Linear Function Approximation
Figure 2 for Breaking the Curse of Multiagents in a Large State Space: RL in Markov Games with Independent Linear Function Approximation
Viaarxiv icon

Offline congestion games: How feedback type affects data coverage requirement

Oct 24, 2022
Figure 1 for Offline congestion games: How feedback type affects data coverage requirement
Figure 2 for Offline congestion games: How feedback type affects data coverage requirement
Figure 3 for Offline congestion games: How feedback type affects data coverage requirement
Figure 4 for Offline congestion games: How feedback type affects data coverage requirement
Viaarxiv icon

Learning in Congestion Games with Bandit Feedback

Jun 04, 2022
Figure 1 for Learning in Congestion Games with Bandit Feedback
Viaarxiv icon

On Gap-dependent Bounds for Offline Reinforcement Learning

Jun 01, 2022
Figure 1 for On Gap-dependent Bounds for Offline Reinforcement Learning
Figure 2 for On Gap-dependent Bounds for Offline Reinforcement Learning
Viaarxiv icon

Provably Efficient Offline Multi-agent Reinforcement Learning via Strategy-wise Bonus

Jun 01, 2022
Viaarxiv icon