Picture for Yuanhao Wang

Yuanhao Wang

Towards Principled Superhuman AI for Multiplayer Symmetric Games

Add code
Jun 06, 2024
Figure 1 for Towards Principled Superhuman AI for Multiplayer Symmetric Games
Figure 2 for Towards Principled Superhuman AI for Multiplayer Symmetric Games
Viaarxiv icon

Directional Smoothness and Gradient Methods: Convergence and Adaptivity

Add code
Mar 06, 2024
Figure 1 for Directional Smoothness and Gradient Methods: Convergence and Adaptivity
Figure 2 for Directional Smoothness and Gradient Methods: Convergence and Adaptivity
Figure 3 for Directional Smoothness and Gradient Methods: Convergence and Adaptivity
Figure 4 for Directional Smoothness and Gradient Methods: Convergence and Adaptivity
Viaarxiv icon

Is RLHF More Difficult than Standard RL?

Add code
Jun 25, 2023
Figure 1 for Is RLHF More Difficult than Standard RL?
Viaarxiv icon

Breaking the Curse of Multiagency: Provably Efficient Decentralized Multi-Agent RL with Function Approximation

Add code
Mar 02, 2023
Viaarxiv icon

Learning Rationalizable Equilibria in Multiplayer Games

Add code
Oct 20, 2022
Figure 1 for Learning Rationalizable Equilibria in Multiplayer Games
Viaarxiv icon

Neural Adaptive SCEne Tracing

Add code
Mar 16, 2022
Figure 1 for Neural Adaptive SCEne Tracing
Figure 2 for Neural Adaptive SCEne Tracing
Figure 3 for Neural Adaptive SCEne Tracing
Figure 4 for Neural Adaptive SCEne Tracing
Viaarxiv icon

Learning Markov Games with Adversarial Opponents: Efficient Algorithms and Fundamental Limits

Add code
Mar 14, 2022
Figure 1 for Learning Markov Games with Adversarial Opponents: Efficient Algorithms and Fundamental Limits
Figure 2 for Learning Markov Games with Adversarial Opponents: Efficient Algorithms and Fundamental Limits
Figure 3 for Learning Markov Games with Adversarial Opponents: Efficient Algorithms and Fundamental Limits
Viaarxiv icon

NeAT: Neural Adaptive Tomography

Add code
Feb 04, 2022
Figure 1 for NeAT: Neural Adaptive Tomography
Figure 2 for NeAT: Neural Adaptive Tomography
Figure 3 for NeAT: Neural Adaptive Tomography
Figure 4 for NeAT: Neural Adaptive Tomography
Viaarxiv icon

V-Learning -- A Simple, Efficient, Decentralized Algorithm for Multiagent RL

Add code
Oct 27, 2021
Figure 1 for V-Learning -- A Simple, Efficient, Decentralized Algorithm for Multiagent RL
Viaarxiv icon

An Exponential Lower Bound for Linearly-Realizable MDPs with Constant Suboptimality Gap

Add code
Mar 23, 2021
Figure 1 for An Exponential Lower Bound for Linearly-Realizable MDPs with Constant Suboptimality Gap
Figure 2 for An Exponential Lower Bound for Linearly-Realizable MDPs with Constant Suboptimality Gap
Viaarxiv icon