Picture for Lei Yuan

Lei Yuan

Speedup Patch: Learning a Plug-and-Play Policy to Accelerate Embodied Manipulation

Add code
Mar 21, 2026
Viaarxiv icon

MBD: A Model-Based Debiasing Framework Across User, Content, and Model Dimensions

Add code
Mar 15, 2026
Viaarxiv icon

Multi-agent In-context Coordination via Decentralized Memory Retrieval

Add code
Nov 13, 2025
Figure 1 for Multi-agent In-context Coordination via Decentralized Memory Retrieval
Figure 2 for Multi-agent In-context Coordination via Decentralized Memory Retrieval
Figure 3 for Multi-agent In-context Coordination via Decentralized Memory Retrieval
Figure 4 for Multi-agent In-context Coordination via Decentralized Memory Retrieval
Viaarxiv icon

Multi-agent Embodied AI: Advances and Future Directions

Add code
May 08, 2025
Viaarxiv icon

HyperZero: A Customized End-to-End Auto-Tuning System for Recommendation with Hourly Feedback

Add code
Jan 30, 2025
Figure 1 for HyperZero: A Customized End-to-End Auto-Tuning System for Recommendation with Hourly Feedback
Figure 2 for HyperZero: A Customized End-to-End Auto-Tuning System for Recommendation with Hourly Feedback
Figure 3 for HyperZero: A Customized End-to-End Auto-Tuning System for Recommendation with Hourly Feedback
Figure 4 for HyperZero: A Customized End-to-End Auto-Tuning System for Recommendation with Hourly Feedback
Viaarxiv icon

SkillTree: Explainable Skill-Based Deep Reinforcement Learning for Long-Horizon Control Tasks

Add code
Nov 19, 2024
Figure 1 for SkillTree: Explainable Skill-Based Deep Reinforcement Learning for Long-Horizon Control Tasks
Figure 2 for SkillTree: Explainable Skill-Based Deep Reinforcement Learning for Long-Horizon Control Tasks
Figure 3 for SkillTree: Explainable Skill-Based Deep Reinforcement Learning for Long-Horizon Control Tasks
Figure 4 for SkillTree: Explainable Skill-Based Deep Reinforcement Learning for Long-Horizon Control Tasks
Viaarxiv icon

Stable Continual Reinforcement Learning via Diffusion-based Trajectory Replay

Add code
Nov 16, 2024
Figure 1 for Stable Continual Reinforcement Learning via Diffusion-based Trajectory Replay
Figure 2 for Stable Continual Reinforcement Learning via Diffusion-based Trajectory Replay
Figure 3 for Stable Continual Reinforcement Learning via Diffusion-based Trajectory Replay
Figure 4 for Stable Continual Reinforcement Learning via Diffusion-based Trajectory Replay
Viaarxiv icon

Q-Adapter: Training Your LLM Adapter as a Residual Q-Function

Add code
Jul 04, 2024
Viaarxiv icon

Disentangling Policy from Offline Task Representation Learning via Adversarial Data Augmentation

Add code
Mar 12, 2024
Figure 1 for Disentangling Policy from Offline Task Representation Learning via Adversarial Data Augmentation
Figure 2 for Disentangling Policy from Offline Task Representation Learning via Adversarial Data Augmentation
Figure 3 for Disentangling Policy from Offline Task Representation Learning via Adversarial Data Augmentation
Figure 4 for Disentangling Policy from Offline Task Representation Learning via Adversarial Data Augmentation
Viaarxiv icon

Debiased Offline Representation Learning for Fast Online Adaptation in Non-stationary Dynamics

Add code
Feb 17, 2024
Figure 1 for Debiased Offline Representation Learning for Fast Online Adaptation in Non-stationary Dynamics
Figure 2 for Debiased Offline Representation Learning for Fast Online Adaptation in Non-stationary Dynamics
Figure 3 for Debiased Offline Representation Learning for Fast Online Adaptation in Non-stationary Dynamics
Figure 4 for Debiased Offline Representation Learning for Fast Online Adaptation in Non-stationary Dynamics
Viaarxiv icon