Alert button
Picture for Mingfei Sun

Mingfei Sun

Alert button

Towards Flexible Inference in Sequential Decision Problems via Bidirectional Transformers

Add code
Bookmark button
Alert button
Apr 28, 2022
Micah Carroll, Jessy Lin, Orr Paradise, Raluca Georgescu, Mingfei Sun, David Bignell, Stephanie Milani, Katja Hofmann, Matthew Hausknecht, Anca Dragan, Sam Devlin

Figure 1 for Towards Flexible Inference in Sequential Decision Problems via Bidirectional Transformers
Figure 2 for Towards Flexible Inference in Sequential Decision Problems via Bidirectional Transformers
Figure 3 for Towards Flexible Inference in Sequential Decision Problems via Bidirectional Transformers
Figure 4 for Towards Flexible Inference in Sequential Decision Problems via Bidirectional Transformers
Viaarxiv icon

Generalization in Cooperative Multi-Agent Systems

Add code
Bookmark button
Alert button
Jan 31, 2022
Anuj Mahajan, Mikayel Samvelyan, Tarun Gupta, Benjamin Ellis, Mingfei Sun, Tim Rocktäschel, Shimon Whiteson

Figure 1 for Generalization in Cooperative Multi-Agent Systems
Figure 2 for Generalization in Cooperative Multi-Agent Systems
Figure 3 for Generalization in Cooperative Multi-Agent Systems
Figure 4 for Generalization in Cooperative Multi-Agent Systems
Viaarxiv icon

Monotonic Improvement Guarantees under Non-stationarity for Decentralized PPO

Add code
Bookmark button
Alert button
Jan 31, 2022
Mingfei Sun, Sam Devlin, Katja Hofmann, Shimon Whiteson

Figure 1 for Monotonic Improvement Guarantees under Non-stationarity for Decentralized PPO
Figure 2 for Monotonic Improvement Guarantees under Non-stationarity for Decentralized PPO
Figure 3 for Monotonic Improvement Guarantees under Non-stationarity for Decentralized PPO
Figure 4 for Monotonic Improvement Guarantees under Non-stationarity for Decentralized PPO
Viaarxiv icon

You May Not Need Ratio Clipping in PPO

Add code
Bookmark button
Alert button
Jan 31, 2022
Mingfei Sun, Vitaly Kurin, Guoqing Liu, Sam Devlin, Tao Qin, Katja Hofmann, Shimon Whiteson

Figure 1 for You May Not Need Ratio Clipping in PPO
Figure 2 for You May Not Need Ratio Clipping in PPO
Figure 3 for You May Not Need Ratio Clipping in PPO
Figure 4 for You May Not Need Ratio Clipping in PPO
Viaarxiv icon

Birds Eye View Social Distancing Analysis System

Add code
Bookmark button
Alert button
Dec 14, 2021
Zhengye Yang, Mingfei Sun, Hongzhe Ye, Zihao Xiong, Gil Zussman, Zoran Kostic

Figure 1 for Birds Eye View Social Distancing Analysis System
Figure 2 for Birds Eye View Social Distancing Analysis System
Figure 3 for Birds Eye View Social Distancing Analysis System
Figure 4 for Birds Eye View Social Distancing Analysis System
Viaarxiv icon

Deterministic and Discriminative Imitation (D2-Imitation): Revisiting Adversarial Imitation for Sample Efficiency

Add code
Bookmark button
Alert button
Dec 11, 2021
Mingfei Sun, Sam Devlin, Katja Hofmann, Shimon Whiteson

Figure 1 for Deterministic and Discriminative Imitation (D2-Imitation): Revisiting Adversarial Imitation for Sample Efficiency
Figure 2 for Deterministic and Discriminative Imitation (D2-Imitation): Revisiting Adversarial Imitation for Sample Efficiency
Figure 3 for Deterministic and Discriminative Imitation (D2-Imitation): Revisiting Adversarial Imitation for Sample Efficiency
Figure 4 for Deterministic and Discriminative Imitation (D2-Imitation): Revisiting Adversarial Imitation for Sample Efficiency
Viaarxiv icon

SoftDICE for Imitation Learning: Rethinking Off-policy Distribution Matching

Add code
Bookmark button
Alert button
Jun 06, 2021
Mingfei Sun, Anuj Mahajan, Katja Hofmann, Shimon Whiteson

Figure 1 for SoftDICE for Imitation Learning: Rethinking Off-policy Distribution Matching
Figure 2 for SoftDICE for Imitation Learning: Rethinking Off-policy Distribution Matching
Figure 3 for SoftDICE for Imitation Learning: Rethinking Off-policy Distribution Matching
Figure 4 for SoftDICE for Imitation Learning: Rethinking Off-policy Distribution Matching
Viaarxiv icon

Supervised Learning Achieves Human-Level Performance in MOBA Games: A Case Study of Honor of Kings

Add code
Bookmark button
Alert button
Nov 25, 2020
Deheng Ye, Guibin Chen, Peilin Zhao, Fuhao Qiu, Bo Yuan, Wen Zhang, Sheng Chen, Mingfei Sun, Xiaoqian Li, Siqin Li, Jing Liang, Zhenjie Lian, Bei Shi, Liang Wang, Tengfei Shi, Qiang Fu, Wei Yang, Lanxiao Huang

Figure 1 for Supervised Learning Achieves Human-Level Performance in MOBA Games: A Case Study of Honor of Kings
Figure 2 for Supervised Learning Achieves Human-Level Performance in MOBA Games: A Case Study of Honor of Kings
Figure 3 for Supervised Learning Achieves Human-Level Performance in MOBA Games: A Case Study of Honor of Kings
Figure 4 for Supervised Learning Achieves Human-Level Performance in MOBA Games: A Case Study of Honor of Kings
Viaarxiv icon

Is Independent Learning All You Need in the StarCraft Multi-Agent Challenge?

Add code
Bookmark button
Alert button
Nov 18, 2020
Christian Schroeder de Witt, Tarun Gupta, Denys Makoviichuk, Viktor Makoviychuk, Philip H. S. Torr, Mingfei Sun, Shimon Whiteson

Figure 1 for Is Independent Learning All You Need in the StarCraft Multi-Agent Challenge?
Figure 2 for Is Independent Learning All You Need in the StarCraft Multi-Agent Challenge?
Figure 3 for Is Independent Learning All You Need in the StarCraft Multi-Agent Challenge?
Figure 4 for Is Independent Learning All You Need in the StarCraft Multi-Agent Challenge?
Viaarxiv icon

Investigating the Effects of Robot Engagement Communication on Learning from Demonstration

Add code
Bookmark button
Alert button
May 03, 2020
Mingfei Sun, Zhenhui Peng, Meng Xia, Xiaojuan Ma

Figure 1 for Investigating the Effects of Robot Engagement Communication on Learning from Demonstration
Figure 2 for Investigating the Effects of Robot Engagement Communication on Learning from Demonstration
Figure 3 for Investigating the Effects of Robot Engagement Communication on Learning from Demonstration
Figure 4 for Investigating the Effects of Robot Engagement Communication on Learning from Demonstration
Viaarxiv icon