Alert button
Picture for Weinan Zhang

Weinan Zhang

Alert button

Model-based Policy Optimization with Unsupervised Model Adaptation

Add code
Bookmark button
Alert button
Oct 28, 2020
Jian Shen, Han Zhao, Weinan Zhang, Yong Yu

Figure 1 for Model-based Policy Optimization with Unsupervised Model Adaptation
Figure 2 for Model-based Policy Optimization with Unsupervised Model Adaptation
Figure 3 for Model-based Policy Optimization with Unsupervised Model Adaptation
Figure 4 for Model-based Policy Optimization with Unsupervised Model Adaptation
Viaarxiv icon

Efficient Projection-Free Algorithms for Saddle Point Problems

Add code
Bookmark button
Alert button
Oct 21, 2020
Cheng Chen, Luo Luo, Weinan Zhang, Yong Yu

Figure 1 for Efficient Projection-Free Algorithms for Saddle Point Problems
Viaarxiv icon

SMARTS: Scalable Multi-Agent Reinforcement Learning Training School for Autonomous Driving

Add code
Bookmark button
Alert button
Oct 19, 2020
Ming Zhou, Jun Luo, Julian Villela, Yaodong Yang, David Rusu, Jiayu Miao, Weinan Zhang, Montgomery Alban, Iman Fadakar, Zheng Chen, Aurora Chongxi Huang, Ying Wen, Kimia Hassanzadeh, Daniel Graves, Dong Chen, Zhengbang Zhu, Nhat Nguyen, Mohamed Elsayed, Kun Shao, Sanjeevan Ahilan, Baokuan Zhang, Jiannan Wu, Zhengang Fu, Kasra Rezaee, Peyman Yadmellat, Mohsen Rohani, Nicolas Perez Nieves, Yihan Ni, Seyedershad Banijamali, Alexander Cowen Rivers, Zheng Tian, Daniel Palenicek, Haitham bou Ammar, Hongbo Zhang, Wulong Liu, Jianye Hao, Jun Wang

Figure 1 for SMARTS: Scalable Multi-Agent Reinforcement Learning Training School for Autonomous Driving
Figure 2 for SMARTS: Scalable Multi-Agent Reinforcement Learning Training School for Autonomous Driving
Figure 3 for SMARTS: Scalable Multi-Agent Reinforcement Learning Training School for Autonomous Driving
Figure 4 for SMARTS: Scalable Multi-Agent Reinforcement Learning Training School for Autonomous Driving
Viaarxiv icon

A Compare Aggregate Transformer for Understanding Document-grounded Dialogue

Add code
Bookmark button
Alert button
Oct 01, 2020
Longxuan Ma, Weinan Zhang, Runxin Sun, Ting Liu

Figure 1 for A Compare Aggregate Transformer for Understanding Document-grounded Dialogue
Figure 2 for A Compare Aggregate Transformer for Understanding Document-grounded Dialogue
Figure 3 for A Compare Aggregate Transformer for Understanding Document-grounded Dialogue
Figure 4 for A Compare Aggregate Transformer for Understanding Document-grounded Dialogue
Viaarxiv icon

GeneraLight: Improving Environment Generalization of Traffic Signal Control via Meta Reinforcement Learning

Add code
Bookmark button
Alert button
Sep 17, 2020
Chang Liu, Huichu Zhang, Weinan Zhang, Guanjie Zheng, Yong Yu

Figure 1 for GeneraLight: Improving Environment Generalization of Traffic Signal Control via Meta Reinforcement Learning
Figure 2 for GeneraLight: Improving Environment Generalization of Traffic Signal Control via Meta Reinforcement Learning
Figure 3 for GeneraLight: Improving Environment Generalization of Traffic Signal Control via Meta Reinforcement Learning
Figure 4 for GeneraLight: Improving Environment Generalization of Traffic Signal Control via Meta Reinforcement Learning
Viaarxiv icon

GIKT: A Graph-based Interaction Model for Knowledge Tracing

Add code
Bookmark button
Alert button
Sep 13, 2020
Yang Yang, Jian Shen, Yanru Qu, Yunfei Liu, Kerong Wang, Yaoming Zhu, Weinan Zhang, Yong Yu

Figure 1 for GIKT: A Graph-based Interaction Model for Knowledge Tracing
Figure 2 for GIKT: A Graph-based Interaction Model for Knowledge Tracing
Figure 3 for GIKT: A Graph-based Interaction Model for Knowledge Tracing
Figure 4 for GIKT: A Graph-based Interaction Model for Knowledge Tracing
Viaarxiv icon

Learning to Infer User Hidden States for Online Sequential Advertising

Add code
Bookmark button
Alert button
Sep 03, 2020
Zhaoqing Peng, Junqi Jin, Lan Luo, Yaodong Yang, Rui Luo, Jun Wang, Weinan Zhang, Haiyang Xu, Miao Xu, Chuan Yu, Tiejian Luo, Han Li, Jian Xu, Kun Gai

Figure 1 for Learning to Infer User Hidden States for Online Sequential Advertising
Figure 2 for Learning to Infer User Hidden States for Online Sequential Advertising
Figure 3 for Learning to Infer User Hidden States for Online Sequential Advertising
Figure 4 for Learning to Infer User Hidden States for Online Sequential Advertising
Viaarxiv icon

Glancing Transformer for Non-Autoregressive Neural Machine Translation

Add code
Bookmark button
Alert button
Aug 18, 2020
Lihua Qian, Hao Zhou, Yu Bao, Mingxuan Wang, Lin Qiu, Weinan Zhang, Yong Yu, Lei Li

Figure 1 for Glancing Transformer for Non-Autoregressive Neural Machine Translation
Figure 2 for Glancing Transformer for Non-Autoregressive Neural Machine Translation
Figure 3 for Glancing Transformer for Non-Autoregressive Neural Machine Translation
Figure 4 for Glancing Transformer for Non-Autoregressive Neural Machine Translation
Viaarxiv icon

Bidirectional Model-based Policy Optimization

Add code
Bookmark button
Alert button
Jul 04, 2020
Hang Lai, Jian Shen, Weinan Zhang, Yong Yu

Figure 1 for Bidirectional Model-based Policy Optimization
Figure 2 for Bidirectional Model-based Policy Optimization
Figure 3 for Bidirectional Model-based Policy Optimization
Figure 4 for Bidirectional Model-based Policy Optimization
Viaarxiv icon