Alert button
Picture for Hangyu Mao

Hangyu Mao

Alert button

PDiT: Interleaving Perception and Decision-making Transformers for Deep Reinforcement Learning

Dec 26, 2023
Hangyu Mao, Rui Zhao, Ziyue Li, Zhiwei Xu, Hao Chen, Yiqun Chen, Bin Zhang, Zhen Xiao, Junge Zhang, Jiangjin Yin

Viaarxiv icon

DuaLight: Enhancing Traffic Signal Control by Leveraging Scenario-Specific and Scenario-Shared Knowledge

Dec 22, 2023
Jiaming Lu, Jingqing Ruan, Haoyuan Jiang, Ziyue Li, Hangyu Mao, Rui Zhao

Viaarxiv icon

Controlling Large Language Model-based Agents for Large-Scale Decision-Making: An Actor-Critic Approach

Nov 23, 2023
Bin Zhang, Hangyu Mao, Jingqing Ruan, Ying Wen, Yang Li, Shao Zhang, Zhiwei Xu, Dapeng Li, Ziyue Li, Rui Zhao, Lijuan Li, Guoliang Fan

Viaarxiv icon

TPTU-v2: Boosting Task Planning and Tool Usage of Large Language Model-based Agents in Real-world Systems

Nov 19, 2023
Yilun Kong, Jingqing Ruan, Yihong Chen, Bin Zhang, Tianpeng Bao, Shiwei Shi, Guoqing Du, Xiaoru Hu, Hangyu Mao, Ziyue Li, Xingyu Zeng, Rui Zhao

Viaarxiv icon

Reboost Large Language Model-based Text-to-SQL, Text-to-Python, and Text-to-Function -- with Real Applications in Traffic Domain

Oct 31, 2023
Guanghu Sui, Zhishuai Li, Ziyue Li, Sun Yang, Jingqing Ruan, Hangyu Mao, Rui Zhao

Viaarxiv icon

TPTU: Task Planning and Tool Usage of Large Language Model-based AI Agents

Aug 07, 2023
Jingqing Ruan, Yihong Chen, Bin Zhang, Zhiwei Xu, Tianpeng Bao, Guoqing Du, Shiwei Shi, Hangyu Mao, Xingyu Zeng, Rui Zhao

Figure 1 for TPTU: Task Planning and Tool Usage of Large Language Model-based AI Agents
Figure 2 for TPTU: Task Planning and Tool Usage of Large Language Model-based AI Agents
Figure 3 for TPTU: Task Planning and Tool Usage of Large Language Model-based AI Agents
Figure 4 for TPTU: Task Planning and Tool Usage of Large Language Model-based AI Agents
Viaarxiv icon

Stackelberg Decision Transformer for Asynchronous Action Coordination in Multi-Agent Systems

May 13, 2023
Bin Zhang, Hangyu Mao, Lijuan Li, Zhiwei Xu, Dapeng Li, Rui Zhao, Guoliang Fan

Figure 1 for Stackelberg Decision Transformer for Asynchronous Action Coordination in Multi-Agent Systems
Figure 2 for Stackelberg Decision Transformer for Asynchronous Action Coordination in Multi-Agent Systems
Figure 3 for Stackelberg Decision Transformer for Asynchronous Action Coordination in Multi-Agent Systems
Figure 4 for Stackelberg Decision Transformer for Asynchronous Action Coordination in Multi-Agent Systems
Viaarxiv icon

Transformer in Transformer as Backbone for Deep Reinforcement Learning

Jan 03, 2023
Hangyu Mao, Rui Zhao, Hao Chen, Jianye Hao, Yiqun Chen, Dong Li, Junge Zhang, Zhen Xiao

Figure 1 for Transformer in Transformer as Backbone for Deep Reinforcement Learning
Figure 2 for Transformer in Transformer as Backbone for Deep Reinforcement Learning
Figure 3 for Transformer in Transformer as Backbone for Deep Reinforcement Learning
Figure 4 for Transformer in Transformer as Backbone for Deep Reinforcement Learning
Viaarxiv icon

PTDE: Personalized Training with Distillated Execution for Multi-Agent Reinforcement Learning

Oct 17, 2022
Yiqun Chen, Hangyu Mao, Tianle Zhang, Shiguang Wu, Bin Zhang, Jianye Hao, Dong Li, Bin Wang, Hongxing Chang

Figure 1 for PTDE: Personalized Training with Distillated Execution for Multi-Agent Reinforcement Learning
Figure 2 for PTDE: Personalized Training with Distillated Execution for Multi-Agent Reinforcement Learning
Figure 3 for PTDE: Personalized Training with Distillated Execution for Multi-Agent Reinforcement Learning
Figure 4 for PTDE: Personalized Training with Distillated Execution for Multi-Agent Reinforcement Learning
Viaarxiv icon