Picture for Zifeng Zhuang

Zifeng Zhuang

MMaDA-VLA: Large Diffusion Vision-Language-Action Model with Unified Multi-Modal Instruction and Generation

Add code
Mar 27, 2026
Viaarxiv icon

NFPO: Stabilized Policy Optimization of Normalizing Flow for Robotic Policy Learning

Add code
Mar 12, 2026
Viaarxiv icon

HiF-VLA: Hindsight, Insight and Foresight through Motion Representation for Vision-Language-Action Models

Add code
Dec 10, 2025
Figure 1 for HiF-VLA: Hindsight, Insight and Foresight through Motion Representation for Vision-Language-Action Models
Figure 2 for HiF-VLA: Hindsight, Insight and Foresight through Motion Representation for Vision-Language-Action Models
Figure 3 for HiF-VLA: Hindsight, Insight and Foresight through Motion Representation for Vision-Language-Action Models
Figure 4 for HiF-VLA: Hindsight, Insight and Foresight through Motion Representation for Vision-Language-Action Models
Viaarxiv icon

Efficient Online RL Fine Tuning with Offline Pre-trained Policy Only

Add code
May 22, 2025
Viaarxiv icon

VARD: Efficient and Dense Fine-Tuning for Diffusion Models with Value-based RL

Add code
May 21, 2025
Figure 1 for VARD: Efficient and Dense Fine-Tuning for Diffusion Models with Value-based RL
Figure 2 for VARD: Efficient and Dense Fine-Tuning for Diffusion Models with Value-based RL
Figure 3 for VARD: Efficient and Dense Fine-Tuning for Diffusion Models with Value-based RL
Figure 4 for VARD: Efficient and Dense Fine-Tuning for Diffusion Models with Value-based RL
Viaarxiv icon

ReinboT: Amplifying Robot Visual-Language Manipulation with Reinforcement Learning

Add code
May 12, 2025
Figure 1 for ReinboT: Amplifying Robot Visual-Language Manipulation with Reinforcement Learning
Figure 2 for ReinboT: Amplifying Robot Visual-Language Manipulation with Reinforcement Learning
Figure 3 for ReinboT: Amplifying Robot Visual-Language Manipulation with Reinforcement Learning
Figure 4 for ReinboT: Amplifying Robot Visual-Language Manipulation with Reinforcement Learning
Viaarxiv icon

TDMPBC: Self-Imitative Reinforcement Learning for Humanoid Robot Control

Add code
Feb 24, 2025
Viaarxiv icon

Q-WSL:Leveraging Dynamic Programming for Weighted Supervised Learning in Goal-conditioned RL

Add code
Oct 10, 2024
Figure 1 for Q-WSL:Leveraging Dynamic Programming for Weighted Supervised Learning in Goal-conditioned RL
Figure 2 for Q-WSL:Leveraging Dynamic Programming for Weighted Supervised Learning in Goal-conditioned RL
Figure 3 for Q-WSL:Leveraging Dynamic Programming for Weighted Supervised Learning in Goal-conditioned RL
Figure 4 for Q-WSL:Leveraging Dynamic Programming for Weighted Supervised Learning in Goal-conditioned RL
Viaarxiv icon

ADR-BC: Adversarial Density Weighted Regression Behavior Cloning

Add code
May 28, 2024
Figure 1 for ADR-BC: Adversarial Density Weighted Regression Behavior Cloning
Figure 2 for ADR-BC: Adversarial Density Weighted Regression Behavior Cloning
Figure 3 for ADR-BC: Adversarial Density Weighted Regression Behavior Cloning
Figure 4 for ADR-BC: Adversarial Density Weighted Regression Behavior Cloning
Viaarxiv icon

DIDI: Diffusion-Guided Diversity for Offline Behavioral Generation

Add code
May 23, 2024
Viaarxiv icon