Picture for Xianyuan Zhan

Xianyuan Zhan

Bidirectional-Reachable Hierarchical Reinforcement Learning with Mutually Responsive Policies

Add code
Jun 26, 2024
Viaarxiv icon

Instruction-Guided Visual Masking

Add code
May 30, 2024
Viaarxiv icon

OMPO: A Unified Framework for RL under Policy and Dynamics Shifts

Add code
May 29, 2024
Viaarxiv icon

Offline-Boosted Actor-Critic: Adaptively Blending Optimal Historical Behaviors in Deep Off-Policy RL

Add code
May 28, 2024
Figure 1 for Offline-Boosted Actor-Critic: Adaptively Blending Optimal Historical Behaviors in Deep Off-Policy RL
Figure 2 for Offline-Boosted Actor-Critic: Adaptively Blending Optimal Historical Behaviors in Deep Off-Policy RL
Figure 3 for Offline-Boosted Actor-Critic: Adaptively Blending Optimal Historical Behaviors in Deep Off-Policy RL
Figure 4 for Offline-Boosted Actor-Critic: Adaptively Blending Optimal Historical Behaviors in Deep Off-Policy RL
Viaarxiv icon

Policy Bifurcation in Safe Reinforcement Learning

Add code
Mar 28, 2024
Figure 1 for Policy Bifurcation in Safe Reinforcement Learning
Figure 2 for Policy Bifurcation in Safe Reinforcement Learning
Figure 3 for Policy Bifurcation in Safe Reinforcement Learning
Figure 4 for Policy Bifurcation in Safe Reinforcement Learning
Viaarxiv icon

DecisionNCE: Embodied Multimodal Representations via Implicit Preference Learning

Add code
Feb 28, 2024
Figure 1 for DecisionNCE: Embodied Multimodal Representations via Implicit Preference Learning
Figure 2 for DecisionNCE: Embodied Multimodal Representations via Implicit Preference Learning
Figure 3 for DecisionNCE: Embodied Multimodal Representations via Implicit Preference Learning
Figure 4 for DecisionNCE: Embodied Multimodal Representations via Implicit Preference Learning
Viaarxiv icon

A Comprehensive Survey of Cross-Domain Policy Transfer for Embodied Agents

Add code
Feb 07, 2024
Viaarxiv icon

ODICE: Revealing the Mystery of Distribution Correction Estimation via Orthogonal-gradient Update

Add code
Feb 01, 2024
Viaarxiv icon

Safe Offline Reinforcement Learning with Feasibility-Guided Diffusion Model

Add code
Jan 19, 2024
Viaarxiv icon

FlexSSL : A Generic and Efficient Framework for Semi-Supervised Learning

Add code
Dec 28, 2023
Viaarxiv icon