Picture for Deheng Ye

Deheng Ye

Tencent Inc

Reaching Consensus in Cooperative Multi-Agent Reinforcement Learning with Goal Imagination

Add code
Mar 05, 2024
Figure 1 for Reaching Consensus in Cooperative Multi-Agent Reinforcement Learning with Goal Imagination
Figure 2 for Reaching Consensus in Cooperative Multi-Agent Reinforcement Learning with Goal Imagination
Figure 3 for Reaching Consensus in Cooperative Multi-Agent Reinforcement Learning with Goal Imagination
Figure 4 for Reaching Consensus in Cooperative Multi-Agent Reinforcement Learning with Goal Imagination
Viaarxiv icon

More Agents Is All You Need

Add code
Feb 03, 2024
Viaarxiv icon

Affordable Generative Agents

Add code
Feb 03, 2024
Figure 1 for Affordable Generative Agents
Figure 2 for Affordable Generative Agents
Figure 3 for Affordable Generative Agents
Figure 4 for Affordable Generative Agents
Viaarxiv icon

HGAttack: Transferable Heterogeneous Graph Adversarial Attack

Add code
Jan 18, 2024
Figure 1 for HGAttack: Transferable Heterogeneous Graph Adversarial Attack
Figure 2 for HGAttack: Transferable Heterogeneous Graph Adversarial Attack
Figure 3 for HGAttack: Transferable Heterogeneous Graph Adversarial Attack
Figure 4 for HGAttack: Transferable Heterogeneous Graph Adversarial Attack
Viaarxiv icon

Replay-enhanced Continual Reinforcement Learning

Add code
Nov 20, 2023
Figure 1 for Replay-enhanced Continual Reinforcement Learning
Figure 2 for Replay-enhanced Continual Reinforcement Learning
Figure 3 for Replay-enhanced Continual Reinforcement Learning
Figure 4 for Replay-enhanced Continual Reinforcement Learning
Viaarxiv icon

LLM-Based Agent Society Investigation: Collaboration and Confrontation in Avalon Gameplay

Add code
Oct 23, 2023
Figure 1 for LLM-Based Agent Society Investigation: Collaboration and Confrontation in Avalon Gameplay
Figure 2 for LLM-Based Agent Society Investigation: Collaboration and Confrontation in Avalon Gameplay
Figure 3 for LLM-Based Agent Society Investigation: Collaboration and Confrontation in Avalon Gameplay
Figure 4 for LLM-Based Agent Society Investigation: Collaboration and Confrontation in Avalon Gameplay
Viaarxiv icon

Master-slave Deep Architecture for Top-K Multi-armed Bandits with Non-linear Bandit Feedback and Diversity Constraints

Add code
Aug 24, 2023
Viaarxiv icon

RLTF: Reinforcement Learning from Unit Test Feedback

Add code
Jul 10, 2023
Figure 1 for RLTF: Reinforcement Learning from Unit Test Feedback
Figure 2 for RLTF: Reinforcement Learning from Unit Test Feedback
Figure 3 for RLTF: Reinforcement Learning from Unit Test Feedback
Figure 4 for RLTF: Reinforcement Learning from Unit Test Feedback
Viaarxiv icon

Future-conditioned Unsupervised Pretraining for Decision Transformer

Add code
May 26, 2023
Figure 1 for Future-conditioned Unsupervised Pretraining for Decision Transformer
Figure 2 for Future-conditioned Unsupervised Pretraining for Decision Transformer
Figure 3 for Future-conditioned Unsupervised Pretraining for Decision Transformer
Figure 4 for Future-conditioned Unsupervised Pretraining for Decision Transformer
Viaarxiv icon

Deploying Offline Reinforcement Learning with Human Feedback

Add code
Mar 13, 2023
Figure 1 for Deploying Offline Reinforcement Learning with Human Feedback
Figure 2 for Deploying Offline Reinforcement Learning with Human Feedback
Figure 3 for Deploying Offline Reinforcement Learning with Human Feedback
Figure 4 for Deploying Offline Reinforcement Learning with Human Feedback
Viaarxiv icon