Alert button
Picture for Yujing Hu

Yujing Hu

Alert button

XRL-Bench: A Benchmark for Evaluating and Comparing Explainable Reinforcement Learning Techniques

Add code
Bookmark button
Alert button
Feb 20, 2024
Yu Xiong, Zhipeng Hu, Ye Huang, Runze Wu, Kai Guan, Xingchen Fang, Ji Jiang, Tianze Zhou, Yujing Hu, Haoyu Liu, Tangjie Lyu, Changjie Fan

Viaarxiv icon

AlignDiff: Aligning Diverse Human Preferences via Behavior-Customisable Diffusion Model

Add code
Bookmark button
Alert button
Oct 03, 2023
Zibin Dong, Yifu Yuan, Jianye Hao, Fei Ni, Yao Mu, Yan Zheng, Yujing Hu, Tangjie Lv, Changjie Fan, Zhipeng Hu

Figure 1 for AlignDiff: Aligning Diverse Human Preferences via Behavior-Customisable Diffusion Model
Figure 2 for AlignDiff: Aligning Diverse Human Preferences via Behavior-Customisable Diffusion Model
Figure 3 for AlignDiff: Aligning Diverse Human Preferences via Behavior-Customisable Diffusion Model
Figure 4 for AlignDiff: Aligning Diverse Human Preferences via Behavior-Customisable Diffusion Model
Viaarxiv icon

Prioritized Trajectory Replay: A Replay Memory for Data-driven Reinforcement Learning

Add code
Bookmark button
Alert button
Jun 27, 2023
Jinyi Liu, Yi Ma, Jianye Hao, Yujing Hu, Yan Zheng, Tangjie Lv, Changjie Fan

Figure 1 for Prioritized Trajectory Replay: A Replay Memory for Data-driven Reinforcement Learning
Figure 2 for Prioritized Trajectory Replay: A Replay Memory for Data-driven Reinforcement Learning
Figure 3 for Prioritized Trajectory Replay: A Replay Memory for Data-driven Reinforcement Learning
Figure 4 for Prioritized Trajectory Replay: A Replay Memory for Data-driven Reinforcement Learning
Viaarxiv icon

Towards Solving Fuzzy Tasks with Human Feedback: A Retrospective of the MineRL BASALT 2022 Competition

Add code
Bookmark button
Alert button
Mar 23, 2023
Stephanie Milani, Anssi Kanervisto, Karolis Ramanauskas, Sander Schulhoff, Brandon Houghton, Sharada Mohanty, Byron Galbraith, Ke Chen, Yan Song, Tianze Zhou, Bingquan Yu, He Liu, Kai Guan, Yujing Hu, Tangjie Lv, Federico Malato, Florian Leopold, Amogh Raut, Ville Hautamäki, Andrew Melnik, Shu Ishida, João F. Henriques, Robert Klassert, Walter Laurito, Ellen Novoseller, Vinicius G. Goecks, Nicholas Waytowich, David Watkins, Josh Miller, Rohin Shah

Figure 1 for Towards Solving Fuzzy Tasks with Human Feedback: A Retrospective of the MineRL BASALT 2022 Competition
Figure 2 for Towards Solving Fuzzy Tasks with Human Feedback: A Retrospective of the MineRL BASALT 2022 Competition
Figure 3 for Towards Solving Fuzzy Tasks with Human Feedback: A Retrospective of the MineRL BASALT 2022 Competition
Figure 4 for Towards Solving Fuzzy Tasks with Human Feedback: A Retrospective of the MineRL BASALT 2022 Competition
Viaarxiv icon

Adaptive Value Decomposition with Greedy Marginal Contribution Computation for Cooperative Multi-Agent Reinforcement Learning

Add code
Bookmark button
Alert button
Feb 14, 2023
Shanqi Liu, Yujing Hu, Runze Wu, Dong Xing, Yu Xiong, Changjie Fan, Kun Kuang, Yong Liu

Figure 1 for Adaptive Value Decomposition with Greedy Marginal Contribution Computation for Cooperative Multi-Agent Reinforcement Learning
Figure 2 for Adaptive Value Decomposition with Greedy Marginal Contribution Computation for Cooperative Multi-Agent Reinforcement Learning
Figure 3 for Adaptive Value Decomposition with Greedy Marginal Contribution Computation for Cooperative Multi-Agent Reinforcement Learning
Figure 4 for Adaptive Value Decomposition with Greedy Marginal Contribution Computation for Cooperative Multi-Agent Reinforcement Learning
Viaarxiv icon

Towards Skilled Population Curriculum for Multi-Agent Reinforcement Learning

Add code
Bookmark button
Alert button
Feb 07, 2023
Rundong Wang, Longtao Zheng, Wei Qiu, Bowei He, Bo An, Zinovi Rabinovich, Yujing Hu, Yingfeng Chen, Tangjie Lv, Changjie Fan

Figure 1 for Towards Skilled Population Curriculum for Multi-Agent Reinforcement Learning
Figure 2 for Towards Skilled Population Curriculum for Multi-Agent Reinforcement Learning
Figure 3 for Towards Skilled Population Curriculum for Multi-Agent Reinforcement Learning
Figure 4 for Towards Skilled Population Curriculum for Multi-Agent Reinforcement Learning
Viaarxiv icon

Neural Episodic Control with State Abstraction

Add code
Bookmark button
Alert button
Jan 27, 2023
Zhuo Li, Derui Zhu, Yujing Hu, Xiaofei Xie, Lei Ma, Yan Zheng, Yan Song, Yingfeng Chen, Jianjun Zhao

Figure 1 for Neural Episodic Control with State Abstraction
Figure 2 for Neural Episodic Control with State Abstraction
Figure 3 for Neural Episodic Control with State Abstraction
Figure 4 for Neural Episodic Control with State Abstraction
Viaarxiv icon

EUCLID: Towards Efficient Unsupervised Reinforcement Learning with Multi-choice Dynamics Model

Add code
Bookmark button
Alert button
Oct 02, 2022
Yifu Yuan, Jianye Hao, Fei Ni, Yao Mu, Yan Zheng, Yujing Hu, Jinyi Liu, Yingfeng Chen, Changjie Fan

Figure 1 for EUCLID: Towards Efficient Unsupervised Reinforcement Learning with Multi-choice Dynamics Model
Figure 2 for EUCLID: Towards Efficient Unsupervised Reinforcement Learning with Multi-choice Dynamics Model
Figure 3 for EUCLID: Towards Efficient Unsupervised Reinforcement Learning with Multi-choice Dynamics Model
Figure 4 for EUCLID: Towards Efficient Unsupervised Reinforcement Learning with Multi-choice Dynamics Model
Viaarxiv icon

Distributed Multi-Robot Obstacle Avoidance via Logarithmic Map-based Deep Reinforcement Learning

Add code
Bookmark button
Alert button
Sep 14, 2022
Jiafeng Ma, Guangda chen, Yingfeng Chen, Yujing Hu, Changjie Fan, Jianming Zhang

Figure 1 for Distributed Multi-Robot Obstacle Avoidance via Logarithmic Map-based Deep Reinforcement Learning
Figure 2 for Distributed Multi-Robot Obstacle Avoidance via Logarithmic Map-based Deep Reinforcement Learning
Figure 3 for Distributed Multi-Robot Obstacle Avoidance via Logarithmic Map-based Deep Reinforcement Learning
Figure 4 for Distributed Multi-Robot Obstacle Avoidance via Logarithmic Map-based Deep Reinforcement Learning
Viaarxiv icon

Automatic Reward Design via Learning Motivation-Consistent Intrinsic Rewards

Add code
Bookmark button
Alert button
Jul 29, 2022
Yixiang Wang, Yujing Hu, Feng Wu, Yingfeng Chen

Figure 1 for Automatic Reward Design via Learning Motivation-Consistent Intrinsic Rewards
Figure 2 for Automatic Reward Design via Learning Motivation-Consistent Intrinsic Rewards
Figure 3 for Automatic Reward Design via Learning Motivation-Consistent Intrinsic Rewards
Figure 4 for Automatic Reward Design via Learning Motivation-Consistent Intrinsic Rewards
Viaarxiv icon