Alert button
Picture for Yiming Lu

Yiming Lu

Alert button

Rethinking Goal-conditioned Supervised Learning and Its Connection to Offline RL

Add code
Bookmark button
Alert button
Feb 14, 2022
Rui Yang, Yiming Lu, Wenzhe Li, Hao Sun, Meng Fang, Yali Du, Xiu Li, Lei Han, Chongjie Zhang

Figure 1 for Rethinking Goal-conditioned Supervised Learning and Its Connection to Offline RL
Figure 2 for Rethinking Goal-conditioned Supervised Learning and Its Connection to Offline RL
Figure 3 for Rethinking Goal-conditioned Supervised Learning and Its Connection to Offline RL
Figure 4 for Rethinking Goal-conditioned Supervised Learning and Its Connection to Offline RL
Viaarxiv icon

SEIHAI: A Sample-efficient Hierarchical AI for the MineRL Competition

Add code
Bookmark button
Alert button
Nov 17, 2021
Hangyu Mao, Chao Wang, Xiaotian Hao, Yihuan Mao, Yiming Lu, Chengjie Wu, Jianye Hao, Dong Li, Pingzhong Tang

Figure 1 for SEIHAI: A Sample-efficient Hierarchical AI for the MineRL Competition
Figure 2 for SEIHAI: A Sample-efficient Hierarchical AI for the MineRL Competition
Figure 3 for SEIHAI: A Sample-efficient Hierarchical AI for the MineRL Competition
Figure 4 for SEIHAI: A Sample-efficient Hierarchical AI for the MineRL Competition
Viaarxiv icon

Towards robust and domain agnostic reinforcement learning competitions

Add code
Bookmark button
Alert button
Jun 07, 2021
William Hebgen Guss, Stephanie Milani, Nicholay Topin, Brandon Houghton, Sharada Mohanty, Andrew Melnik, Augustin Harter, Benoit Buschmaas, Bjarne Jaster, Christoph Berganski, Dennis Heitkamp, Marko Henning, Helge Ritter, Chengjie Wu, Xiaotian Hao, Yiming Lu, Hangyu Mao, Yihuan Mao, Chao Wang, Michal Opanowicz, Anssi Kanervisto, Yanick Schraner, Christian Scheller, Xiren Zhou, Lu Liu, Daichi Nishio, Toi Tsuneda, Karolis Ramanauskas, Gabija Juceviciute

Figure 1 for Towards robust and domain agnostic reinforcement learning competitions
Figure 2 for Towards robust and domain agnostic reinforcement learning competitions
Figure 3 for Towards robust and domain agnostic reinforcement learning competitions
Figure 4 for Towards robust and domain agnostic reinforcement learning competitions
Viaarxiv icon