Picture for Deheng Ye

Deheng Ye

Tencent Inc

Affordable Generative Agents

Add code
Feb 03, 2024
Figure 1 for Affordable Generative Agents
Figure 2 for Affordable Generative Agents
Figure 3 for Affordable Generative Agents
Figure 4 for Affordable Generative Agents
Viaarxiv icon

HGAttack: Transferable Heterogeneous Graph Adversarial Attack

Add code
Jan 18, 2024
Figure 1 for HGAttack: Transferable Heterogeneous Graph Adversarial Attack
Figure 2 for HGAttack: Transferable Heterogeneous Graph Adversarial Attack
Figure 3 for HGAttack: Transferable Heterogeneous Graph Adversarial Attack
Figure 4 for HGAttack: Transferable Heterogeneous Graph Adversarial Attack
Viaarxiv icon

Replay-enhanced Continual Reinforcement Learning

Add code
Nov 20, 2023
Figure 1 for Replay-enhanced Continual Reinforcement Learning
Figure 2 for Replay-enhanced Continual Reinforcement Learning
Figure 3 for Replay-enhanced Continual Reinforcement Learning
Figure 4 for Replay-enhanced Continual Reinforcement Learning
Viaarxiv icon

LLM-Based Agent Society Investigation: Collaboration and Confrontation in Avalon Gameplay

Add code
Oct 23, 2023
Figure 1 for LLM-Based Agent Society Investigation: Collaboration and Confrontation in Avalon Gameplay
Figure 2 for LLM-Based Agent Society Investigation: Collaboration and Confrontation in Avalon Gameplay
Figure 3 for LLM-Based Agent Society Investigation: Collaboration and Confrontation in Avalon Gameplay
Figure 4 for LLM-Based Agent Society Investigation: Collaboration and Confrontation in Avalon Gameplay
Viaarxiv icon

Master-slave Deep Architecture for Top-K Multi-armed Bandits with Non-linear Bandit Feedback and Diversity Constraints

Add code
Aug 24, 2023
Viaarxiv icon

RLTF: Reinforcement Learning from Unit Test Feedback

Add code
Jul 10, 2023
Figure 1 for RLTF: Reinforcement Learning from Unit Test Feedback
Figure 2 for RLTF: Reinforcement Learning from Unit Test Feedback
Figure 3 for RLTF: Reinforcement Learning from Unit Test Feedback
Figure 4 for RLTF: Reinforcement Learning from Unit Test Feedback
Viaarxiv icon

Future-conditioned Unsupervised Pretraining for Decision Transformer

Add code
May 26, 2023
Figure 1 for Future-conditioned Unsupervised Pretraining for Decision Transformer
Figure 2 for Future-conditioned Unsupervised Pretraining for Decision Transformer
Figure 3 for Future-conditioned Unsupervised Pretraining for Decision Transformer
Figure 4 for Future-conditioned Unsupervised Pretraining for Decision Transformer
Viaarxiv icon

Deploying Offline Reinforcement Learning with Human Feedback

Add code
Mar 13, 2023
Figure 1 for Deploying Offline Reinforcement Learning with Human Feedback
Figure 2 for Deploying Offline Reinforcement Learning with Human Feedback
Figure 3 for Deploying Offline Reinforcement Learning with Human Feedback
Figure 4 for Deploying Offline Reinforcement Learning with Human Feedback
Viaarxiv icon

Sample Dropout: A Simple yet Effective Variance Reduction Technique in Deep Policy Optimization

Add code
Feb 05, 2023
Viaarxiv icon

Revisiting Estimation Bias in Policy Gradients for Deep Reinforcement Learning

Add code
Jan 20, 2023
Viaarxiv icon