Alert button
Picture for Beining Han

Beining Han

Alert button

Infinite Photorealistic Worlds using Procedural Generation

Add code
Bookmark button
Alert button
Jun 26, 2023
Alexander Raistrick, Lahav Lipson, Zeyu Ma, Lingjie Mei, Mingzhe Wang, Yiming Zuo, Karhan Kayan, Hongyu Wen, Beining Han, Yihan Wang, Alejandro Newell, Hei Law, Ankit Goyal, Kaiyu Yang, Jia Deng

Figure 1 for Infinite Photorealistic Worlds using Procedural Generation
Figure 2 for Infinite Photorealistic Worlds using Procedural Generation
Figure 3 for Infinite Photorealistic Worlds using Procedural Generation
Figure 4 for Infinite Photorealistic Worlds using Procedural Generation
Viaarxiv icon

Learning Domain Invariant Representations in Goal-conditioned Block MDPs

Add code
Bookmark button
Alert button
Oct 28, 2021
Beining Han, Chongyi Zheng, Harris Chan, Keiran Paster, Michael R. Zhang, Jimmy Ba

Figure 1 for Learning Domain Invariant Representations in Goal-conditioned Block MDPs
Figure 2 for Learning Domain Invariant Representations in Goal-conditioned Block MDPs
Figure 3 for Learning Domain Invariant Representations in Goal-conditioned Block MDPs
Figure 4 for Learning Domain Invariant Representations in Goal-conditioned Block MDPs
Viaarxiv icon

On the Estimation Bias in Double Q-Learning

Add code
Bookmark button
Alert button
Sep 29, 2021
Zhizhou Ren, Guangxiang Zhu, Hao Hu, Beining Han, Jianglun Chen, Chongjie Zhang

Figure 1 for On the Estimation Bias in Double Q-Learning
Figure 2 for On the Estimation Bias in Double Q-Learning
Figure 3 for On the Estimation Bias in Double Q-Learning
Figure 4 for On the Estimation Bias in Double Q-Learning
Viaarxiv icon

Off-Policy Reinforcement Learning with Delayed Rewards

Add code
Bookmark button
Alert button
Jun 22, 2021
Beining Han, Zhizhou Ren, Zuofan Wu, Yuan Zhou, Jian Peng

Figure 1 for Off-Policy Reinforcement Learning with Delayed Rewards
Figure 2 for Off-Policy Reinforcement Learning with Delayed Rewards
Figure 3 for Off-Policy Reinforcement Learning with Delayed Rewards
Figure 4 for Off-Policy Reinforcement Learning with Delayed Rewards
Viaarxiv icon

Off-Policy Multi-Agent Decomposed Policy Gradients

Add code
Bookmark button
Alert button
Jul 24, 2020
Yihan Wang, Beining Han, Tonghan Wang, Heng Dong, Chongjie Zhang

Figure 1 for Off-Policy Multi-Agent Decomposed Policy Gradients
Figure 2 for Off-Policy Multi-Agent Decomposed Policy Gradients
Figure 3 for Off-Policy Multi-Agent Decomposed Policy Gradients
Viaarxiv icon

Towards Understanding Linear Value Decomposition in Cooperative Multi-Agent Q-Learning

Add code
Bookmark button
Alert button
Jun 23, 2020
Jianhao Wang, Zhizhou Ren, Beining Han, Chongjie Zhang

Figure 1 for Towards Understanding Linear Value Decomposition in Cooperative Multi-Agent Q-Learning
Figure 2 for Towards Understanding Linear Value Decomposition in Cooperative Multi-Agent Q-Learning
Figure 3 for Towards Understanding Linear Value Decomposition in Cooperative Multi-Agent Q-Learning
Viaarxiv icon