Picture for Deheng Ye

Deheng Ye

Tencent Inc

Improving Sample Efficiency of Reinforcement Learning with Background Knowledge from Large Language Models

Add code
Jul 04, 2024
Viaarxiv icon

Reaching Consensus in Cooperative Multi-Agent Reinforcement Learning with Goal Imagination

Add code
Mar 05, 2024
Figure 1 for Reaching Consensus in Cooperative Multi-Agent Reinforcement Learning with Goal Imagination
Figure 2 for Reaching Consensus in Cooperative Multi-Agent Reinforcement Learning with Goal Imagination
Figure 3 for Reaching Consensus in Cooperative Multi-Agent Reinforcement Learning with Goal Imagination
Figure 4 for Reaching Consensus in Cooperative Multi-Agent Reinforcement Learning with Goal Imagination
Viaarxiv icon

Affordable Generative Agents

Add code
Feb 03, 2024
Viaarxiv icon

More Agents Is All You Need

Add code
Feb 03, 2024
Viaarxiv icon

HGAttack: Transferable Heterogeneous Graph Adversarial Attack

Add code
Jan 18, 2024
Figure 1 for HGAttack: Transferable Heterogeneous Graph Adversarial Attack
Figure 2 for HGAttack: Transferable Heterogeneous Graph Adversarial Attack
Figure 3 for HGAttack: Transferable Heterogeneous Graph Adversarial Attack
Figure 4 for HGAttack: Transferable Heterogeneous Graph Adversarial Attack
Viaarxiv icon

Replay-enhanced Continual Reinforcement Learning

Add code
Nov 20, 2023
Figure 1 for Replay-enhanced Continual Reinforcement Learning
Figure 2 for Replay-enhanced Continual Reinforcement Learning
Figure 3 for Replay-enhanced Continual Reinforcement Learning
Figure 4 for Replay-enhanced Continual Reinforcement Learning
Viaarxiv icon

LLM-Based Agent Society Investigation: Collaboration and Confrontation in Avalon Gameplay

Add code
Oct 23, 2023
Figure 1 for LLM-Based Agent Society Investigation: Collaboration and Confrontation in Avalon Gameplay
Figure 2 for LLM-Based Agent Society Investigation: Collaboration and Confrontation in Avalon Gameplay
Figure 3 for LLM-Based Agent Society Investigation: Collaboration and Confrontation in Avalon Gameplay
Figure 4 for LLM-Based Agent Society Investigation: Collaboration and Confrontation in Avalon Gameplay
Viaarxiv icon

Master-slave Deep Architecture for Top-K Multi-armed Bandits with Non-linear Bandit Feedback and Diversity Constraints

Add code
Aug 24, 2023
Figure 1 for Master-slave Deep Architecture for Top-K Multi-armed Bandits with Non-linear Bandit Feedback and Diversity Constraints
Figure 2 for Master-slave Deep Architecture for Top-K Multi-armed Bandits with Non-linear Bandit Feedback and Diversity Constraints
Figure 3 for Master-slave Deep Architecture for Top-K Multi-armed Bandits with Non-linear Bandit Feedback and Diversity Constraints
Figure 4 for Master-slave Deep Architecture for Top-K Multi-armed Bandits with Non-linear Bandit Feedback and Diversity Constraints
Viaarxiv icon

RLTF: Reinforcement Learning from Unit Test Feedback

Add code
Jul 10, 2023
Figure 1 for RLTF: Reinforcement Learning from Unit Test Feedback
Figure 2 for RLTF: Reinforcement Learning from Unit Test Feedback
Figure 3 for RLTF: Reinforcement Learning from Unit Test Feedback
Figure 4 for RLTF: Reinforcement Learning from Unit Test Feedback
Viaarxiv icon

Future-conditioned Unsupervised Pretraining for Decision Transformer

Add code
May 26, 2023
Figure 1 for Future-conditioned Unsupervised Pretraining for Decision Transformer
Figure 2 for Future-conditioned Unsupervised Pretraining for Decision Transformer
Figure 3 for Future-conditioned Unsupervised Pretraining for Decision Transformer
Figure 4 for Future-conditioned Unsupervised Pretraining for Decision Transformer
Viaarxiv icon