Picture for Chen-Yu Wei

Chen-Yu Wei

Offline Reinforcement Learning: Role of State Aggregation and Trajectory Data

Add code
Mar 25, 2024
Figure 1 for Offline Reinforcement Learning: Role of State Aggregation and Trajectory Data
Figure 2 for Offline Reinforcement Learning: Role of State Aggregation and Trajectory Data
Figure 3 for Offline Reinforcement Learning: Role of State Aggregation and Trajectory Data
Figure 4 for Offline Reinforcement Learning: Role of State Aggregation and Trajectory Data
Viaarxiv icon

Tractable Local Equilibria in Non-Concave Games

Add code
Mar 13, 2024
Figure 1 for Tractable Local Equilibria in Non-Concave Games
Figure 2 for Tractable Local Equilibria in Non-Concave Games
Figure 3 for Tractable Local Equilibria in Non-Concave Games
Figure 4 for Tractable Local Equilibria in Non-Concave Games
Viaarxiv icon

Near-Optimal Policy Optimization for Correlated Equilibrium in General-Sum Markov Games

Add code
Jan 26, 2024
Viaarxiv icon

Towards Optimal Regret in Adversarial Linear MDPs with Bandit Feedback

Add code
Oct 17, 2023
Viaarxiv icon

Bypassing the Simulator: Near-Optimal Adversarial Linear Contextual Bandits

Add code
Sep 02, 2023
Figure 1 for Bypassing the Simulator: Near-Optimal Adversarial Linear Contextual Bandits
Viaarxiv icon

Last-Iterate Convergent Policy Gradient Primal-Dual Methods for Constrained MDPs

Add code
Jun 20, 2023
Figure 1 for Last-Iterate Convergent Policy Gradient Primal-Dual Methods for Constrained MDPs
Figure 2 for Last-Iterate Convergent Policy Gradient Primal-Dual Methods for Constrained MDPs
Figure 3 for Last-Iterate Convergent Policy Gradient Primal-Dual Methods for Constrained MDPs
Figure 4 for Last-Iterate Convergent Policy Gradient Primal-Dual Methods for Constrained MDPs
Viaarxiv icon

No-Regret Online Reinforcement Learning with Adversarial Losses and Transitions

Add code
May 30, 2023
Viaarxiv icon

First- and Second-Order Bounds for Adversarial Linear Contextual Bandits

Add code
May 01, 2023
Viaarxiv icon

Uncoupled and Convergent Learning in Two-Player Zero-Sum Markov Games

Add code
Mar 05, 2023
Figure 1 for Uncoupled and Convergent Learning in Two-Player Zero-Sum Markov Games
Viaarxiv icon

A Blackbox Approach to Best of Both Worlds in Bandits and Beyond

Add code
Feb 20, 2023
Figure 1 for A Blackbox Approach to Best of Both Worlds in Bandits and Beyond
Viaarxiv icon