Alert button
Picture for Xiao Ma

Xiao Ma

Alert button

Let's Do a Thought Experiment: Using Counterfactuals to Improve Moral Reasoning

Add code
Bookmark button
Alert button
Jun 25, 2023
Xiao Ma, Swaroop Mishra, Ahmad Beirami, Alex Beutel, Jilin Chen

Figure 1 for Let's Do a Thought Experiment: Using Counterfactuals to Improve Moral Reasoning
Figure 2 for Let's Do a Thought Experiment: Using Counterfactuals to Improve Moral Reasoning
Viaarxiv icon

Offline Prioritized Experience Replay

Add code
Bookmark button
Alert button
Jun 08, 2023
Yang Yue, Bingyi Kang, Xiao Ma, Gao Huang, Shiji Song, Shuicheng Yan

Figure 1 for Offline Prioritized Experience Replay
Figure 2 for Offline Prioritized Experience Replay
Figure 3 for Offline Prioritized Experience Replay
Figure 4 for Offline Prioritized Experience Replay
Viaarxiv icon

Improving and Benchmarking Offline Reinforcement Learning Algorithms

Add code
Bookmark button
Alert button
Jun 01, 2023
Bingyi Kang, Xiao Ma, Yirui Wang, Yang Yue, Shuicheng Yan

Figure 1 for Improving and Benchmarking Offline Reinforcement Learning Algorithms
Figure 2 for Improving and Benchmarking Offline Reinforcement Learning Algorithms
Figure 3 for Improving and Benchmarking Offline Reinforcement Learning Algorithms
Figure 4 for Improving and Benchmarking Offline Reinforcement Learning Algorithms
Viaarxiv icon

Efficient Diffusion Policies for Offline Reinforcement Learning

Add code
Bookmark button
Alert button
May 31, 2023
Bingyi Kang, Xiao Ma, Chao Du, Tianyu Pang, Shuicheng Yan

Figure 1 for Efficient Diffusion Policies for Offline Reinforcement Learning
Figure 2 for Efficient Diffusion Policies for Offline Reinforcement Learning
Figure 3 for Efficient Diffusion Policies for Offline Reinforcement Learning
Figure 4 for Efficient Diffusion Policies for Offline Reinforcement Learning
Viaarxiv icon

DiffMimic: Efficient Motion Mimicking with Differentiable Physics

Add code
Bookmark button
Alert button
Apr 26, 2023
Jiawei Ren, Cunjun Yu, Siwei Chen, Xiao Ma, Liang Pan, Ziwei Liu

Figure 1 for DiffMimic: Efficient Motion Mimicking with Differentiable Physics
Figure 2 for DiffMimic: Efficient Motion Mimicking with Differentiable Physics
Figure 3 for DiffMimic: Efficient Motion Mimicking with Differentiable Physics
Figure 4 for DiffMimic: Efficient Motion Mimicking with Differentiable Physics
Viaarxiv icon

Visualising Personal Data Flows: Insights from a Case Study of Booking.com

Add code
Bookmark button
Alert button
Apr 20, 2023
Haiyue Yuan, Matthew Boakes, Xiao Ma, Dongmei Cao, Shujun Li

Figure 1 for Visualising Personal Data Flows: Insights from a Case Study of Booking.com
Viaarxiv icon

Hi Sheldon! Creating Deep Personalized Characters from TV Shows

Add code
Bookmark button
Alert button
Apr 09, 2023
Meidai Xuanyuan, Yuwang Wang, Honglei Guo, Xiao Ma, Yuchen Guo, Tao Yu, Qionghai Dai

Figure 1 for Hi Sheldon! Creating Deep Personalized Characters from TV Shows
Figure 2 for Hi Sheldon! Creating Deep Personalized Characters from TV Shows
Figure 3 for Hi Sheldon! Creating Deep Personalized Characters from TV Shows
Figure 4 for Hi Sheldon! Creating Deep Personalized Characters from TV Shows
Viaarxiv icon

Benchmarking Deformable Object Manipulation with Differentiable Physics

Add code
Bookmark button
Alert button
Oct 24, 2022
Siwei Chen, Cunjun Yu, Yiqing Xu, Linfeng Li, Xiao Ma, Zhongwen Xu, David Hsu

Figure 1 for Benchmarking Deformable Object Manipulation with Differentiable Physics
Figure 2 for Benchmarking Deformable Object Manipulation with Differentiable Physics
Figure 3 for Benchmarking Deformable Object Manipulation with Differentiable Physics
Figure 4 for Benchmarking Deformable Object Manipulation with Differentiable Physics
Viaarxiv icon

RPM: Generalizable Behaviors for Multi-Agent Reinforcement Learning

Add code
Bookmark button
Alert button
Oct 18, 2022
Wei Qiu, Xiao Ma, Bo An, Svetlana Obraztsova, Shuicheng Yan, Zhongwen Xu

Figure 1 for RPM: Generalizable Behaviors for Multi-Agent Reinforcement Learning
Figure 2 for RPM: Generalizable Behaviors for Multi-Agent Reinforcement Learning
Figure 3 for RPM: Generalizable Behaviors for Multi-Agent Reinforcement Learning
Figure 4 for RPM: Generalizable Behaviors for Multi-Agent Reinforcement Learning
Viaarxiv icon