Alert button
Picture for Li Zhao

Li Zhao

Alert button

An Adaptive Deep RL Method for Non-Stationary Environments with Piecewise Stable Context

Add code
Bookmark button
Alert button
Dec 24, 2022
Xiaoyu Chen, Xiangming Zhu, Yufeng Zheng, Pushi Zhang, Li Zhao, Wenxue Cheng, Peng Cheng, Yongqiang Xiong, Tao Qin, Jianyu Chen, Tie-Yan Liu

Figure 1 for An Adaptive Deep RL Method for Non-Stationary Environments with Piecewise Stable Context
Figure 2 for An Adaptive Deep RL Method for Non-Stationary Environments with Piecewise Stable Context
Figure 3 for An Adaptive Deep RL Method for Non-Stationary Environments with Piecewise Stable Context
Figure 4 for An Adaptive Deep RL Method for Non-Stationary Environments with Piecewise Stable Context
Viaarxiv icon

Multi-Agent Reinforcement Learning with Shared Resources for Inventory Management

Add code
Bookmark button
Alert button
Dec 18, 2022
Yuandong Ding, Mingxiao Feng, Guozi Liu, Wei Jiang, Chuheng Zhang, Li Zhao, Lei Song, Houqiang Li, Yan Jin, Jiang Bian

Figure 1 for Multi-Agent Reinforcement Learning with Shared Resources for Inventory Management
Figure 2 for Multi-Agent Reinforcement Learning with Shared Resources for Inventory Management
Figure 3 for Multi-Agent Reinforcement Learning with Shared Resources for Inventory Management
Figure 4 for Multi-Agent Reinforcement Learning with Shared Resources for Inventory Management
Viaarxiv icon

TD3 with Reverse KL Regularizer for Offline Reinforcement Learning from Mixed Datasets

Add code
Bookmark button
Alert button
Dec 05, 2022
Yuanying Cai, Chuheng Zhang, Li Zhao, Wei Shen, Xuyun Zhang, Lei Song, Jiang Bian, Tao Qin, Tieyan Liu

Figure 1 for TD3 with Reverse KL Regularizer for Offline Reinforcement Learning from Mixed Datasets
Figure 2 for TD3 with Reverse KL Regularizer for Offline Reinforcement Learning from Mixed Datasets
Figure 3 for TD3 with Reverse KL Regularizer for Offline Reinforcement Learning from Mixed Datasets
Figure 4 for TD3 with Reverse KL Regularizer for Offline Reinforcement Learning from Mixed Datasets
Viaarxiv icon

Inspector: Pixel-Based Automated Game Testing via Exploration, Detection, and Investigation

Add code
Bookmark button
Alert button
Jul 18, 2022
Guoqing Liu, Mengzhang Cai, Li Zhao, Tao Qin, Adrian Brown, Jimmy Bischoff, Tie-Yan Liu

Figure 1 for Inspector: Pixel-Based Automated Game Testing via Exploration, Detection, and Investigation
Figure 2 for Inspector: Pixel-Based Automated Game Testing via Exploration, Detection, and Investigation
Figure 3 for Inspector: Pixel-Based Automated Game Testing via Exploration, Detection, and Investigation
Figure 4 for Inspector: Pixel-Based Automated Game Testing via Exploration, Detection, and Investigation
Viaarxiv icon

Tiered Reinforcement Learning: Pessimism in the Face of Uncertainty and Constant Regret

Add code
Bookmark button
Alert button
May 25, 2022
Jiawei Huang, Li Zhao, Tao Qin, Wei Chen, Nan Jiang, Tie-Yan Liu

Figure 1 for Tiered Reinforcement Learning: Pessimism in the Face of Uncertainty and Constant Regret
Figure 2 for Tiered Reinforcement Learning: Pessimism in the Face of Uncertainty and Constant Regret
Viaarxiv icon

Fetal Brain Tissue Annotation and Segmentation Challenge Results

Add code
Bookmark button
Alert button
Apr 20, 2022
Kelly Payette, Hongwei Li, Priscille de Dumast, Roxane Licandro, Hui Ji, Md Mahfuzur Rahman Siddiquee, Daguang Xu, Andriy Myronenko, Hao Liu, Yuchen Pei, Lisheng Wang, Ying Peng, Juanying Xie, Huiquan Zhang, Guiming Dong, Hao Fu, Guotai Wang, ZunHyan Rieu, Donghyeon Kim, Hyun Gi Kim, Davood Karimi, Ali Gholipour, Helena R. Torres, Bruno Oliveira, João L. Vilaça, Yang Lin, Netanell Avisdris, Ori Ben-Zvi, Dafna Ben Bashat, Lucas Fidon, Michael Aertsen, Tom Vercauteren, Daniel Sobotka, Georg Langs, Mireia Alenyà, Maria Inmaculada Villanueva, Oscar Camara, Bella Specktor Fadida, Leo Joskowicz, Liao Weibin, Lv Yi, Li Xuesong, Moona Mazher, Abdul Qayyum, Domenec Puig, Hamza Kebiri, Zelin Zhang, Xinyi Xu, Dan Wu, KuanLun Liao, YiXuan Wu, JinTai Chen, Yunzhi Xu, Li Zhao, Lana Vasung, Bjoern Menze, Meritxell Bach Cuadra, Andras Jakab

Figure 1 for Fetal Brain Tissue Annotation and Segmentation Challenge Results
Figure 2 for Fetal Brain Tissue Annotation and Segmentation Challenge Results
Figure 3 for Fetal Brain Tissue Annotation and Segmentation Challenge Results
Figure 4 for Fetal Brain Tissue Annotation and Segmentation Challenge Results
Viaarxiv icon

Towards Deployment-Efficient Reinforcement Learning: Lower Bound and Optimality

Add code
Bookmark button
Alert button
Feb 14, 2022
Jiawei Huang, Jinglin Chen, Li Zhao, Tao Qin, Nan Jiang, Tie-Yan Liu

Figure 1 for Towards Deployment-Efficient Reinforcement Learning: Lower Bound and Optimality
Figure 2 for Towards Deployment-Efficient Reinforcement Learning: Lower Bound and Optimality
Viaarxiv icon

ERNIE 3.0 Titan: Exploring Larger-scale Knowledge Enhanced Pre-training for Language Understanding and Generation

Add code
Bookmark button
Alert button
Dec 23, 2021
Shuohuan Wang, Yu Sun, Yang Xiang, Zhihua Wu, Siyu Ding, Weibao Gong, Shikun Feng, Junyuan Shang, Yanbin Zhao, Chao Pang, Jiaxiang Liu, Xuyi Chen, Yuxiang Lu, Weixin Liu, Xi Wang, Yangfan Bai, Qiuliang Chen, Li Zhao, Shiyong Li, Peng Sun, Dianhai Yu, Yanjun Ma, Hao Tian, Hua Wu, Tian Wu, Wei Zeng, Ge Li, Wen Gao, Haifeng Wang

Figure 1 for ERNIE 3.0 Titan: Exploring Larger-scale Knowledge Enhanced Pre-training for Language Understanding and Generation
Figure 2 for ERNIE 3.0 Titan: Exploring Larger-scale Knowledge Enhanced Pre-training for Language Understanding and Generation
Figure 3 for ERNIE 3.0 Titan: Exploring Larger-scale Knowledge Enhanced Pre-training for Language Understanding and Generation
Figure 4 for ERNIE 3.0 Titan: Exploring Larger-scale Knowledge Enhanced Pre-training for Language Understanding and Generation
Viaarxiv icon

Curriculum Offline Imitation Learning

Add code
Bookmark button
Alert button
Nov 03, 2021
Minghuan Liu, Hanye Zhao, Zhengyu Yang, Jian Shen, Weinan Zhang, Li Zhao, Tie-Yan Liu

Figure 1 for Curriculum Offline Imitation Learning
Figure 2 for Curriculum Offline Imitation Learning
Figure 3 for Curriculum Offline Imitation Learning
Figure 4 for Curriculum Offline Imitation Learning
Viaarxiv icon