Alert button
Picture for Xiaoteng Ma

Xiaoteng Ma

Alert button

SEABO: A Simple Search-Based Method for Offline Imitation Learning

Add code
Bookmark button
Alert button
Feb 06, 2024
Jiafei Lyu, Xiaoteng Ma, Le Wan, Runze Liu, Xiu Li, Zongqing Lu

Viaarxiv icon

What is Essential for Unseen Goal Generalization of Offline Goal-conditioned RL?

Add code
Bookmark button
Alert button
Jun 02, 2023
Rui Yang, Yong Lin, Xiaoteng Ma, Hao Hu, Chongjie Zhang, Tong Zhang

Figure 1 for What is Essential for Unseen Goal Generalization of Offline Goal-conditioned RL?
Figure 2 for What is Essential for Unseen Goal Generalization of Offline Goal-conditioned RL?
Figure 3 for What is Essential for Unseen Goal Generalization of Offline Goal-conditioned RL?
Figure 4 for What is Essential for Unseen Goal Generalization of Offline Goal-conditioned RL?
Viaarxiv icon

Cross-Domain Policy Adaptation via Value-Guided Data Filtering

Add code
Bookmark button
Alert button
May 28, 2023
Kang Xu, Chenjia Bai, Xiaoteng Ma, Dong Wang, Bin Zhao, Zhen Wang, Xuelong Li, Wei Li

Figure 1 for Cross-Domain Policy Adaptation via Value-Guided Data Filtering
Figure 2 for Cross-Domain Policy Adaptation via Value-Guided Data Filtering
Figure 3 for Cross-Domain Policy Adaptation via Value-Guided Data Filtering
Figure 4 for Cross-Domain Policy Adaptation via Value-Guided Data Filtering
Viaarxiv icon

Learning Diverse Risk Preferences in Population-based Self-play

Add code
Bookmark button
Alert button
May 19, 2023
Yuhua Jiang, Qihan Liu, Xiaoteng Ma, Chenghao Li, Yiqin Yang, Jun Yang, Bin Liang, Qianchuan Zhao

Figure 1 for Learning Diverse Risk Preferences in Population-based Self-play
Figure 2 for Learning Diverse Risk Preferences in Population-based Self-play
Figure 3 for Learning Diverse Risk Preferences in Population-based Self-play
Figure 4 for Learning Diverse Risk Preferences in Population-based Self-play
Viaarxiv icon

Uncertainty-driven Trajectory Truncation for Model-based Offline Reinforcement Learning

Add code
Bookmark button
Alert button
Apr 10, 2023
Junjie Zhang, Jiafei Lyu, Xiaoteng Ma, Jiangpeng Yan, Jun Yang, Le Wan, Xiu Li

Figure 1 for Uncertainty-driven Trajectory Truncation for Model-based Offline Reinforcement Learning
Figure 2 for Uncertainty-driven Trajectory Truncation for Model-based Offline Reinforcement Learning
Figure 3 for Uncertainty-driven Trajectory Truncation for Model-based Offline Reinforcement Learning
Figure 4 for Uncertainty-driven Trajectory Truncation for Model-based Offline Reinforcement Learning
Viaarxiv icon

Single-Trajectory Distributionally Robust Reinforcement Learning

Add code
Bookmark button
Alert button
Jan 27, 2023
Zhipeng Liang, Xiaoteng Ma, Jose Blanchet, Jiheng Zhang, Zhengyuan Zhou

Figure 1 for Single-Trajectory Distributionally Robust Reinforcement Learning
Figure 2 for Single-Trajectory Distributionally Robust Reinforcement Learning
Figure 3 for Single-Trajectory Distributionally Robust Reinforcement Learning
Figure 4 for Single-Trajectory Distributionally Robust Reinforcement Learning
Viaarxiv icon

Distributionally Robust Offline Reinforcement Learning with Linear Function Approximation

Add code
Bookmark button
Alert button
Sep 29, 2022
Xiaoteng Ma, Zhipeng Liang, Jose Blanchet, Mingwen Liu, Li Xia, Jiheng Zhang, Qianchuan Zhao, Zhengyuan Zhou

Figure 1 for Distributionally Robust Offline Reinforcement Learning with Linear Function Approximation
Figure 2 for Distributionally Robust Offline Reinforcement Learning with Linear Function Approximation
Viaarxiv icon

Exploiting Reward Shifting in Value-Based Deep RL

Add code
Bookmark button
Alert button
Sep 15, 2022
Hao Sun, Lei Han, Rui Yang, Xiaoteng Ma, Jian Guo, Bolei Zhou

Figure 1 for Exploiting Reward Shifting in Value-Based Deep RL
Figure 2 for Exploiting Reward Shifting in Value-Based Deep RL
Figure 3 for Exploiting Reward Shifting in Value-Based Deep RL
Figure 4 for Exploiting Reward Shifting in Value-Based Deep RL
Viaarxiv icon

Mean-Semivariance Policy Optimization via Risk-Averse Reinforcement Learning

Add code
Bookmark button
Alert button
Jun 15, 2022
Xiaoteng Ma, Shuai Ma, Li Xia, Qianchuan Zhao

Figure 1 for Mean-Semivariance Policy Optimization via Risk-Averse Reinforcement Learning
Figure 2 for Mean-Semivariance Policy Optimization via Risk-Averse Reinforcement Learning
Figure 3 for Mean-Semivariance Policy Optimization via Risk-Averse Reinforcement Learning
Figure 4 for Mean-Semivariance Policy Optimization via Risk-Averse Reinforcement Learning
Viaarxiv icon

Mildly Conservative Q-Learning for Offline Reinforcement Learning

Add code
Bookmark button
Alert button
Jun 09, 2022
Jiafei Lyu, Xiaoteng Ma, Xiu Li, Zongqing Lu

Figure 1 for Mildly Conservative Q-Learning for Offline Reinforcement Learning
Figure 2 for Mildly Conservative Q-Learning for Offline Reinforcement Learning
Figure 3 for Mildly Conservative Q-Learning for Offline Reinforcement Learning
Figure 4 for Mildly Conservative Q-Learning for Offline Reinforcement Learning
Viaarxiv icon