Picture for Deheng Ye

Deheng Ye

Reaching Consensus in Cooperative Multi-Agent Reinforcement Learning with Goal Imagination

Mar 05, 2024
Figure 1 for Reaching Consensus in Cooperative Multi-Agent Reinforcement Learning with Goal Imagination
Figure 2 for Reaching Consensus in Cooperative Multi-Agent Reinforcement Learning with Goal Imagination
Figure 3 for Reaching Consensus in Cooperative Multi-Agent Reinforcement Learning with Goal Imagination
Figure 4 for Reaching Consensus in Cooperative Multi-Agent Reinforcement Learning with Goal Imagination
Viaarxiv icon

Affordable Generative Agents

Add code
Feb 03, 2024
Viaarxiv icon

More Agents Is All You Need

Add code
Feb 03, 2024
Viaarxiv icon

HGAttack: Transferable Heterogeneous Graph Adversarial Attack

Jan 18, 2024
Figure 1 for HGAttack: Transferable Heterogeneous Graph Adversarial Attack
Figure 2 for HGAttack: Transferable Heterogeneous Graph Adversarial Attack
Figure 3 for HGAttack: Transferable Heterogeneous Graph Adversarial Attack
Figure 4 for HGAttack: Transferable Heterogeneous Graph Adversarial Attack
Viaarxiv icon

Replay-enhanced Continual Reinforcement Learning

Nov 20, 2023
Viaarxiv icon

LLM-Based Agent Society Investigation: Collaboration and Confrontation in Avalon Gameplay

Oct 23, 2023
Figure 1 for LLM-Based Agent Society Investigation: Collaboration and Confrontation in Avalon Gameplay
Figure 2 for LLM-Based Agent Society Investigation: Collaboration and Confrontation in Avalon Gameplay
Figure 3 for LLM-Based Agent Society Investigation: Collaboration and Confrontation in Avalon Gameplay
Figure 4 for LLM-Based Agent Society Investigation: Collaboration and Confrontation in Avalon Gameplay
Viaarxiv icon

Master-slave Deep Architecture for Top-K Multi-armed Bandits with Non-linear Bandit Feedback and Diversity Constraints

Add code
Aug 24, 2023
Figure 1 for Master-slave Deep Architecture for Top-K Multi-armed Bandits with Non-linear Bandit Feedback and Diversity Constraints
Figure 2 for Master-slave Deep Architecture for Top-K Multi-armed Bandits with Non-linear Bandit Feedback and Diversity Constraints
Figure 3 for Master-slave Deep Architecture for Top-K Multi-armed Bandits with Non-linear Bandit Feedback and Diversity Constraints
Figure 4 for Master-slave Deep Architecture for Top-K Multi-armed Bandits with Non-linear Bandit Feedback and Diversity Constraints
Viaarxiv icon

RLTF: Reinforcement Learning from Unit Test Feedback

Add code
Jul 10, 2023
Figure 1 for RLTF: Reinforcement Learning from Unit Test Feedback
Figure 2 for RLTF: Reinforcement Learning from Unit Test Feedback
Figure 3 for RLTF: Reinforcement Learning from Unit Test Feedback
Figure 4 for RLTF: Reinforcement Learning from Unit Test Feedback
Viaarxiv icon

Future-conditioned Unsupervised Pretraining for Decision Transformer

Add code
May 26, 2023
Figure 1 for Future-conditioned Unsupervised Pretraining for Decision Transformer
Figure 2 for Future-conditioned Unsupervised Pretraining for Decision Transformer
Figure 3 for Future-conditioned Unsupervised Pretraining for Decision Transformer
Figure 4 for Future-conditioned Unsupervised Pretraining for Decision Transformer
Viaarxiv icon

Deploying Offline Reinforcement Learning with Human Feedback

Add code
Mar 13, 2023
Figure 1 for Deploying Offline Reinforcement Learning with Human Feedback
Figure 2 for Deploying Offline Reinforcement Learning with Human Feedback
Figure 3 for Deploying Offline Reinforcement Learning with Human Feedback
Figure 4 for Deploying Offline Reinforcement Learning with Human Feedback
Viaarxiv icon