Picture for Pan Xu

Pan Xu

Localized Dynamics-Aware Domain Adaption for Off-Dynamics Offline Reinforcement Learning

Add code
Feb 24, 2026
Viaarxiv icon

Policy Regularized Distributionally Robust Markov Decision Processes with Linear Function Approximation

Add code
Oct 16, 2025
Viaarxiv icon

Rethinking Langevin Thompson Sampling from A Stochastic Approximation Perspective

Add code
Oct 06, 2025
Figure 1 for Rethinking Langevin Thompson Sampling from A Stochastic Approximation Perspective
Figure 2 for Rethinking Langevin Thompson Sampling from A Stochastic Approximation Perspective
Figure 3 for Rethinking Langevin Thompson Sampling from A Stochastic Approximation Perspective
Figure 4 for Rethinking Langevin Thompson Sampling from A Stochastic Approximation Perspective
Viaarxiv icon

How to Provably Improve Return Conditioned Supervised Learning?

Add code
Jun 10, 2025
Figure 1 for How to Provably Improve Return Conditioned Supervised Learning?
Figure 2 for How to Provably Improve Return Conditioned Supervised Learning?
Figure 3 for How to Provably Improve Return Conditioned Supervised Learning?
Figure 4 for How to Provably Improve Return Conditioned Supervised Learning?
Viaarxiv icon

MOBODY: Model Based Off-Dynamics Offline Reinforcement Learning

Add code
Jun 10, 2025
Viaarxiv icon

Linear Mixture Distributionally Robust Markov Decision Processes

Add code
May 23, 2025
Figure 1 for Linear Mixture Distributionally Robust Markov Decision Processes
Figure 2 for Linear Mixture Distributionally Robust Markov Decision Processes
Figure 3 for Linear Mixture Distributionally Robust Markov Decision Processes
Figure 4 for Linear Mixture Distributionally Robust Markov Decision Processes
Viaarxiv icon

Off-Dynamics Reinforcement Learning via Domain Adaptation and Reward Augmented Imitation

Add code
Nov 15, 2024
Figure 1 for Off-Dynamics Reinforcement Learning via Domain Adaptation and Reward Augmented Imitation
Figure 2 for Off-Dynamics Reinforcement Learning via Domain Adaptation and Reward Augmented Imitation
Figure 3 for Off-Dynamics Reinforcement Learning via Domain Adaptation and Reward Augmented Imitation
Figure 4 for Off-Dynamics Reinforcement Learning via Domain Adaptation and Reward Augmented Imitation
Viaarxiv icon

Return Augmented Decision Transformer for Off-Dynamics Reinforcement Learning

Add code
Oct 30, 2024
Figure 1 for Return Augmented Decision Transformer for Off-Dynamics Reinforcement Learning
Figure 2 for Return Augmented Decision Transformer for Off-Dynamics Reinforcement Learning
Figure 3 for Return Augmented Decision Transformer for Off-Dynamics Reinforcement Learning
Figure 4 for Return Augmented Decision Transformer for Off-Dynamics Reinforcement Learning
Viaarxiv icon

Upper and Lower Bounds for Distributionally Robust Off-Dynamics Reinforcement Learning

Add code
Sep 30, 2024
Figure 1 for Upper and Lower Bounds for Distributionally Robust Off-Dynamics Reinforcement Learning
Figure 2 for Upper and Lower Bounds for Distributionally Robust Off-Dynamics Reinforcement Learning
Figure 3 for Upper and Lower Bounds for Distributionally Robust Off-Dynamics Reinforcement Learning
Figure 4 for Upper and Lower Bounds for Distributionally Robust Off-Dynamics Reinforcement Learning
Viaarxiv icon

Pre-trained Language Models Improve the Few-shot Prompt Ability of Decision Transformer

Add code
Aug 02, 2024
Figure 1 for Pre-trained Language Models Improve the Few-shot Prompt Ability of Decision Transformer
Figure 2 for Pre-trained Language Models Improve the Few-shot Prompt Ability of Decision Transformer
Figure 3 for Pre-trained Language Models Improve the Few-shot Prompt Ability of Decision Transformer
Figure 4 for Pre-trained Language Models Improve the Few-shot Prompt Ability of Decision Transformer
Viaarxiv icon