Picture for Guannan Qu

Guannan Qu

A Theoretical Study of (Hyper) Self-Attention through the Lens of Interactions: Representation, Training, Generalization

Add code
Jun 06, 2025
Viaarxiv icon

Thinking Beyond Visibility: A Near-Optimal Policy Framework for Locally Interdependent Multi-Agent MDPs

Add code
Jun 04, 2025
Viaarxiv icon

Natural Policy Gradient for Average Reward Non-Stationary RL

Add code
Apr 23, 2025
Figure 1 for Natural Policy Gradient for Average Reward Non-Stationary RL
Figure 2 for Natural Policy Gradient for Average Reward Non-Stationary RL
Figure 3 for Natural Policy Gradient for Average Reward Non-Stationary RL
Figure 4 for Natural Policy Gradient for Average Reward Non-Stationary RL
Viaarxiv icon

Whole-Body Model-Predictive Control of Legged Robots with MuJoCo

Add code
Mar 06, 2025
Viaarxiv icon

ASAP: Aligning Simulation and Real-World Physics for Learning Agile Humanoid Whole-Body Skills

Add code
Feb 03, 2025
Viaarxiv icon

Mean-Field Sampling for Cooperative Multi-Agent Reinforcement Learning

Add code
Dec 01, 2024
Viaarxiv icon

Predictive Control and Regret Analysis of Non-Stationary MDP with Look-ahead Information

Add code
Sep 13, 2024
Figure 1 for Predictive Control and Regret Analysis of Non-Stationary MDP with Look-ahead Information
Figure 2 for Predictive Control and Regret Analysis of Non-Stationary MDP with Look-ahead Information
Figure 3 for Predictive Control and Regret Analysis of Non-Stationary MDP with Look-ahead Information
Viaarxiv icon

Locally Interdependent Multi-Agent MDP: Theoretical Framework for Decentralized Agents with Dynamic Dependencies

Add code
Jun 10, 2024
Figure 1 for Locally Interdependent Multi-Agent MDP: Theoretical Framework for Decentralized Agents with Dynamic Dependencies
Figure 2 for Locally Interdependent Multi-Agent MDP: Theoretical Framework for Decentralized Agents with Dynamic Dependencies
Figure 3 for Locally Interdependent Multi-Agent MDP: Theoretical Framework for Decentralized Agents with Dynamic Dependencies
Figure 4 for Locally Interdependent Multi-Agent MDP: Theoretical Framework for Decentralized Agents with Dynamic Dependencies
Viaarxiv icon

Learning to Stabilize Unknown LTI Systems on a Single Trajectory under Stochastic Noise

Add code
May 31, 2024
Figure 1 for Learning to Stabilize Unknown LTI Systems on a Single Trajectory under Stochastic Noise
Figure 2 for Learning to Stabilize Unknown LTI Systems on a Single Trajectory under Stochastic Noise
Figure 3 for Learning to Stabilize Unknown LTI Systems on a Single Trajectory under Stochastic Noise
Viaarxiv icon

Efficient Reinforcement Learning for Global Decision Making in the Presence of Local Agents at Scale

Add code
Mar 01, 2024
Figure 1 for Efficient Reinforcement Learning for Global Decision Making in the Presence of Local Agents at Scale
Figure 2 for Efficient Reinforcement Learning for Global Decision Making in the Presence of Local Agents at Scale
Figure 3 for Efficient Reinforcement Learning for Global Decision Making in the Presence of Local Agents at Scale
Viaarxiv icon