Alert button
Picture for Fan-Ming Luo

Fan-Ming Luo

Alert button

Reward-Consistent Dynamics Models are Strongly Generalizable for Offline Reinforcement Learning

Add code
Bookmark button
Alert button
Oct 09, 2023
Fan-Ming Luo, Tian Xu, Xingchen Cao, Yang Yu

Figure 1 for Reward-Consistent Dynamics Models are Strongly Generalizable for Offline Reinforcement Learning
Figure 2 for Reward-Consistent Dynamics Models are Strongly Generalizable for Offline Reinforcement Learning
Figure 3 for Reward-Consistent Dynamics Models are Strongly Generalizable for Offline Reinforcement Learning
Figure 4 for Reward-Consistent Dynamics Models are Strongly Generalizable for Offline Reinforcement Learning
Viaarxiv icon

Unified Policy Optimization for Continuous-action Reinforcement Learning in Non-stationary Tasks and Games

Add code
Bookmark button
Alert button
Aug 19, 2022
Rong-Jun Qin, Fan-Ming Luo, Hong Qian, Yang Yu

Figure 1 for Unified Policy Optimization for Continuous-action Reinforcement Learning in Non-stationary Tasks and Games
Figure 2 for Unified Policy Optimization for Continuous-action Reinforcement Learning in Non-stationary Tasks and Games
Figure 3 for Unified Policy Optimization for Continuous-action Reinforcement Learning in Non-stationary Tasks and Games
Figure 4 for Unified Policy Optimization for Continuous-action Reinforcement Learning in Non-stationary Tasks and Games
Viaarxiv icon

A Survey on Model-based Reinforcement Learning

Add code
Bookmark button
Alert button
Jun 19, 2022
Fan-Ming Luo, Tian Xu, Hang Lai, Xiong-Hui Chen, Weinan Zhang, Yang Yu

Figure 1 for A Survey on Model-based Reinforcement Learning
Viaarxiv icon

Transferable Reward Learning by Dynamics-Agnostic Discriminator Ensemble

Add code
Bookmark button
Alert button
Jun 01, 2022
Fan-Ming Luo, Xingchen Cao, Yang Yu

Figure 1 for Transferable Reward Learning by Dynamics-Agnostic Discriminator Ensemble
Figure 2 for Transferable Reward Learning by Dynamics-Agnostic Discriminator Ensemble
Figure 3 for Transferable Reward Learning by Dynamics-Agnostic Discriminator Ensemble
Figure 4 for Transferable Reward Learning by Dynamics-Agnostic Discriminator Ensemble
Viaarxiv icon