Alert button
Picture for Yun Hua

Yun Hua

Alert button

Can language agents be alternatives to PPO? A Preliminary Empirical Study On OpenAI Gym

Add code
Bookmark button
Alert button
Dec 06, 2023
Junjie Sheng, Zixiao Huang, Chuyun Shen, Wenhao Li, Yun Hua, Bo Jin, Hongyuan Zha, Xiangfeng Wang

Viaarxiv icon

VMAgent: Scheduling Simulator for Reinforcement Learning

Add code
Bookmark button
Alert button
Dec 09, 2021
Junjie Sheng, Shengliang Cai, Haochuan Cui, Wenhao Li, Yun Hua, Bo Jin, Wenli Zhou, Yiqiu Hu, Lei Zhu, Qian Peng, Hongyuan Zha, Xiangfeng Wang

Figure 1 for VMAgent: Scheduling Simulator for Reinforcement Learning
Figure 2 for VMAgent: Scheduling Simulator for Reinforcement Learning
Viaarxiv icon

Structured Diversification Emergence via Reinforced Organization Control and Hierarchical Consensus Learning

Add code
Bookmark button
Alert button
Feb 09, 2021
Wenhao Li, Xiangfeng Wang, Bo Jin, Junjie Sheng, Yun Hua, Hongyuan Zha

Figure 1 for Structured Diversification Emergence via Reinforced Organization Control and Hierarchical Consensus Learning
Figure 2 for Structured Diversification Emergence via Reinforced Organization Control and Hierarchical Consensus Learning
Figure 3 for Structured Diversification Emergence via Reinforced Organization Control and Hierarchical Consensus Learning
Figure 4 for Structured Diversification Emergence via Reinforced Organization Control and Hierarchical Consensus Learning
Viaarxiv icon

Hyper-Meta Reinforcement Learning with Sparse Reward

Add code
Bookmark button
Alert button
Feb 11, 2020
Yun Hua, Xiangfeng Wang, Bo Jin, Wenhao Li, Junchi Yan, Xiaofeng He, Hongyuan Zha

Figure 1 for Hyper-Meta Reinforcement Learning with Sparse Reward
Figure 2 for Hyper-Meta Reinforcement Learning with Sparse Reward
Figure 3 for Hyper-Meta Reinforcement Learning with Sparse Reward
Figure 4 for Hyper-Meta Reinforcement Learning with Sparse Reward
Viaarxiv icon