Alert button
Picture for Deheng Ye

Deheng Ye

Alert button

Reaching Consensus in Cooperative Multi-Agent Reinforcement Learning with Goal Imagination

Add code
Bookmark button
Alert button
Mar 05, 2024
Liangzhou Wang, Kaiwen Zhu, Fengming Zhu, Xinghu Yao, Shujie Zhang, Deheng Ye, Haobo Fu, Qiang Fu, Wei Yang

Figure 1 for Reaching Consensus in Cooperative Multi-Agent Reinforcement Learning with Goal Imagination
Figure 2 for Reaching Consensus in Cooperative Multi-Agent Reinforcement Learning with Goal Imagination
Figure 3 for Reaching Consensus in Cooperative Multi-Agent Reinforcement Learning with Goal Imagination
Figure 4 for Reaching Consensus in Cooperative Multi-Agent Reinforcement Learning with Goal Imagination
Viaarxiv icon

Affordable Generative Agents

Add code
Bookmark button
Alert button
Feb 03, 2024
Yangbin Yu, Qin Zhang, Junyou Li, Qiang Fu, Deheng Ye

Viaarxiv icon

More Agents Is All You Need

Add code
Bookmark button
Alert button
Feb 03, 2024
Junyou Li, Qin Zhang, Yangbin Yu, Qiang Fu, Deheng Ye

Viaarxiv icon

HGAttack: Transferable Heterogeneous Graph Adversarial Attack

Add code
Bookmark button
Alert button
Jan 18, 2024
He Zhao, Zhiwei Zeng, Yongwei Wang, Deheng Ye, Chunyan Miao

Viaarxiv icon

Replay-enhanced Continual Reinforcement Learning

Add code
Bookmark button
Alert button
Nov 20, 2023
Tiantian Zhang, Kevin Zehua Shen, Zichuan Lin, Bo Yuan, Xueqian Wang, Xiu Li, Deheng Ye

Viaarxiv icon

LLM-Based Agent Society Investigation: Collaboration and Confrontation in Avalon Gameplay

Add code
Bookmark button
Alert button
Oct 23, 2023
Yihuai Lan, Zhiqiang Hu, Lei Wang, Yang Wang, Deheng Ye, Peilin Zhao, Ee-Peng Lim, Hui Xiong, Hao Wang

Figure 1 for LLM-Based Agent Society Investigation: Collaboration and Confrontation in Avalon Gameplay
Figure 2 for LLM-Based Agent Society Investigation: Collaboration and Confrontation in Avalon Gameplay
Figure 3 for LLM-Based Agent Society Investigation: Collaboration and Confrontation in Avalon Gameplay
Figure 4 for LLM-Based Agent Society Investigation: Collaboration and Confrontation in Avalon Gameplay
Viaarxiv icon

Master-slave Deep Architecture for Top-K Multi-armed Bandits with Non-linear Bandit Feedback and Diversity Constraints

Add code
Bookmark button
Alert button
Aug 24, 2023
Hanchi Huang, Li Shen, Deheng Ye, Wei Liu

Figure 1 for Master-slave Deep Architecture for Top-K Multi-armed Bandits with Non-linear Bandit Feedback and Diversity Constraints
Figure 2 for Master-slave Deep Architecture for Top-K Multi-armed Bandits with Non-linear Bandit Feedback and Diversity Constraints
Figure 3 for Master-slave Deep Architecture for Top-K Multi-armed Bandits with Non-linear Bandit Feedback and Diversity Constraints
Figure 4 for Master-slave Deep Architecture for Top-K Multi-armed Bandits with Non-linear Bandit Feedback and Diversity Constraints
Viaarxiv icon

RLTF: Reinforcement Learning from Unit Test Feedback

Add code
Bookmark button
Alert button
Jul 10, 2023
Jiate Liu, Yiqin Zhu, Kaiwen Xiao, Qiang Fu, Xiao Han, Wei Yang, Deheng Ye

Figure 1 for RLTF: Reinforcement Learning from Unit Test Feedback
Figure 2 for RLTF: Reinforcement Learning from Unit Test Feedback
Figure 3 for RLTF: Reinforcement Learning from Unit Test Feedback
Figure 4 for RLTF: Reinforcement Learning from Unit Test Feedback
Viaarxiv icon

Future-conditioned Unsupervised Pretraining for Decision Transformer

Add code
Bookmark button
Alert button
May 26, 2023
Zhihui Xie, Zichuan Lin, Deheng Ye, Qiang Fu, Wei Yang, Shuai Li

Figure 1 for Future-conditioned Unsupervised Pretraining for Decision Transformer
Figure 2 for Future-conditioned Unsupervised Pretraining for Decision Transformer
Figure 3 for Future-conditioned Unsupervised Pretraining for Decision Transformer
Figure 4 for Future-conditioned Unsupervised Pretraining for Decision Transformer
Viaarxiv icon

Deploying Offline Reinforcement Learning with Human Feedback

Add code
Bookmark button
Alert button
Mar 13, 2023
Ziniu Li, Ke Xu, Liu Liu, Lanqing Li, Deheng Ye, Peilin Zhao

Figure 1 for Deploying Offline Reinforcement Learning with Human Feedback
Figure 2 for Deploying Offline Reinforcement Learning with Human Feedback
Figure 3 for Deploying Offline Reinforcement Learning with Human Feedback
Figure 4 for Deploying Offline Reinforcement Learning with Human Feedback
Viaarxiv icon