Alert button
Picture for Hangyu Mao

Hangyu Mao

Alert button

TPTU: Task Planning and Tool Usage of Large Language Model-based AI Agents

Add code
Bookmark button
Alert button
Aug 07, 2023
Jingqing Ruan, Yihong Chen, Bin Zhang, Zhiwei Xu, Tianpeng Bao, Guoqing Du, Shiwei Shi, Hangyu Mao, Xingyu Zeng, Rui Zhao

Figure 1 for TPTU: Task Planning and Tool Usage of Large Language Model-based AI Agents
Figure 2 for TPTU: Task Planning and Tool Usage of Large Language Model-based AI Agents
Figure 3 for TPTU: Task Planning and Tool Usage of Large Language Model-based AI Agents
Figure 4 for TPTU: Task Planning and Tool Usage of Large Language Model-based AI Agents
Viaarxiv icon

Stackelberg Decision Transformer for Asynchronous Action Coordination in Multi-Agent Systems

Add code
Bookmark button
Alert button
May 13, 2023
Bin Zhang, Hangyu Mao, Lijuan Li, Zhiwei Xu, Dapeng Li, Rui Zhao, Guoliang Fan

Figure 1 for Stackelberg Decision Transformer for Asynchronous Action Coordination in Multi-Agent Systems
Figure 2 for Stackelberg Decision Transformer for Asynchronous Action Coordination in Multi-Agent Systems
Figure 3 for Stackelberg Decision Transformer for Asynchronous Action Coordination in Multi-Agent Systems
Figure 4 for Stackelberg Decision Transformer for Asynchronous Action Coordination in Multi-Agent Systems
Viaarxiv icon

Transformer in Transformer as Backbone for Deep Reinforcement Learning

Add code
Bookmark button
Alert button
Jan 03, 2023
Hangyu Mao, Rui Zhao, Hao Chen, Jianye Hao, Yiqun Chen, Dong Li, Junge Zhang, Zhen Xiao

Figure 1 for Transformer in Transformer as Backbone for Deep Reinforcement Learning
Figure 2 for Transformer in Transformer as Backbone for Deep Reinforcement Learning
Figure 3 for Transformer in Transformer as Backbone for Deep Reinforcement Learning
Figure 4 for Transformer in Transformer as Backbone for Deep Reinforcement Learning
Viaarxiv icon

PTDE: Personalized Training with Distillated Execution for Multi-Agent Reinforcement Learning

Add code
Bookmark button
Alert button
Oct 17, 2022
Yiqun Chen, Hangyu Mao, Tianle Zhang, Shiguang Wu, Bin Zhang, Jianye Hao, Dong Li, Bin Wang, Hongxing Chang

Figure 1 for PTDE: Personalized Training with Distillated Execution for Multi-Agent Reinforcement Learning
Figure 2 for PTDE: Personalized Training with Distillated Execution for Multi-Agent Reinforcement Learning
Figure 3 for PTDE: Personalized Training with Distillated Execution for Multi-Agent Reinforcement Learning
Figure 4 for PTDE: Personalized Training with Distillated Execution for Multi-Agent Reinforcement Learning
Viaarxiv icon

API: Boosting Multi-Agent Reinforcement Learning via Agent-Permutation-Invariant Networks

Add code
Bookmark button
Alert button
Mar 10, 2022
Xiaotian Hao, Weixun Wang, Hangyu Mao, Yaodong Yang, Dong Li, Yan Zheng, Zhen Wang, Jianye Hao

Figure 1 for API: Boosting Multi-Agent Reinforcement Learning via Agent-Permutation-Invariant Networks
Figure 2 for API: Boosting Multi-Agent Reinforcement Learning via Agent-Permutation-Invariant Networks
Figure 3 for API: Boosting Multi-Agent Reinforcement Learning via Agent-Permutation-Invariant Networks
Figure 4 for API: Boosting Multi-Agent Reinforcement Learning via Agent-Permutation-Invariant Networks
Viaarxiv icon

SEIHAI: A Sample-efficient Hierarchical AI for the MineRL Competition

Add code
Bookmark button
Alert button
Nov 17, 2021
Hangyu Mao, Chao Wang, Xiaotian Hao, Yihuan Mao, Yiming Lu, Chengjie Wu, Jianye Hao, Dong Li, Pingzhong Tang

Figure 1 for SEIHAI: A Sample-efficient Hierarchical AI for the MineRL Competition
Figure 2 for SEIHAI: A Sample-efficient Hierarchical AI for the MineRL Competition
Figure 3 for SEIHAI: A Sample-efficient Hierarchical AI for the MineRL Competition
Figure 4 for SEIHAI: A Sample-efficient Hierarchical AI for the MineRL Competition
Viaarxiv icon

Towards robust and domain agnostic reinforcement learning competitions

Add code
Bookmark button
Alert button
Jun 07, 2021
William Hebgen Guss, Stephanie Milani, Nicholay Topin, Brandon Houghton, Sharada Mohanty, Andrew Melnik, Augustin Harter, Benoit Buschmaas, Bjarne Jaster, Christoph Berganski, Dennis Heitkamp, Marko Henning, Helge Ritter, Chengjie Wu, Xiaotian Hao, Yiming Lu, Hangyu Mao, Yihuan Mao, Chao Wang, Michal Opanowicz, Anssi Kanervisto, Yanick Schraner, Christian Scheller, Xiren Zhou, Lu Liu, Daichi Nishio, Toi Tsuneda, Karolis Ramanauskas, Gabija Juceviciute

Figure 1 for Towards robust and domain agnostic reinforcement learning competitions
Figure 2 for Towards robust and domain agnostic reinforcement learning competitions
Figure 3 for Towards robust and domain agnostic reinforcement learning competitions
Figure 4 for Towards robust and domain agnostic reinforcement learning competitions
Viaarxiv icon

Cooperative Multi-Agent Transfer Learning with Level-Adaptive Credit Assignment

Add code
Bookmark button
Alert button
Jun 03, 2021
Tianze Zhou, Fubiao Zhang, Kun Shao, Kai Li, Wenhan Huang, Jun Luo, Weixun Wang, Yaodong Yang, Hangyu Mao, Bin Wang, Dong Li, Wulong Liu, Jianye Hao

Figure 1 for Cooperative Multi-Agent Transfer Learning with Level-Adaptive Credit Assignment
Figure 2 for Cooperative Multi-Agent Transfer Learning with Level-Adaptive Credit Assignment
Figure 3 for Cooperative Multi-Agent Transfer Learning with Level-Adaptive Credit Assignment
Figure 4 for Cooperative Multi-Agent Transfer Learning with Level-Adaptive Credit Assignment
Viaarxiv icon

Reward Design in Cooperative Multi-agent Reinforcement Learning for Packet Routing

Add code
Bookmark button
Alert button
Mar 05, 2020
Hangyu Mao, Zhibo Gong, Zhen Xiao

Figure 1 for Reward Design in Cooperative Multi-agent Reinforcement Learning for Packet Routing
Figure 2 for Reward Design in Cooperative Multi-agent Reinforcement Learning for Packet Routing
Figure 3 for Reward Design in Cooperative Multi-agent Reinforcement Learning for Packet Routing
Figure 4 for Reward Design in Cooperative Multi-agent Reinforcement Learning for Packet Routing
Viaarxiv icon