Picture for Zifeng Zhuang

Zifeng Zhuang

Efficient Online RL Fine Tuning with Offline Pre-trained Policy Only

Add code
May 22, 2025
Viaarxiv icon

VARD: Efficient and Dense Fine-Tuning for Diffusion Models with Value-based RL

Add code
May 21, 2025
Viaarxiv icon

ReinboT: Amplifying Robot Visual-Language Manipulation with Reinforcement Learning

Add code
May 12, 2025
Viaarxiv icon

TDMPBC: Self-Imitative Reinforcement Learning for Humanoid Robot Control

Add code
Feb 24, 2025
Viaarxiv icon

Q-WSL:Leveraging Dynamic Programming for Weighted Supervised Learning in Goal-conditioned RL

Add code
Oct 10, 2024
Viaarxiv icon

ADR-BC: Adversarial Density Weighted Regression Behavior Cloning

Add code
May 28, 2024
Figure 1 for ADR-BC: Adversarial Density Weighted Regression Behavior Cloning
Figure 2 for ADR-BC: Adversarial Density Weighted Regression Behavior Cloning
Figure 3 for ADR-BC: Adversarial Density Weighted Regression Behavior Cloning
Figure 4 for ADR-BC: Adversarial Density Weighted Regression Behavior Cloning
Viaarxiv icon

DIDI: Diffusion-Guided Diversity for Offline Behavioral Generation

Add code
May 23, 2024
Viaarxiv icon

Reinformer: Max-Return Sequence Modeling for offline RL

Add code
May 14, 2024
Viaarxiv icon

Context-Former: Stitching via Latent Conditioned Sequence Modeling

Add code
Feb 03, 2024
Viaarxiv icon

Adaptive Proximal Policy Optimization with Upper Confidence Bound

Add code
Dec 12, 2023
Viaarxiv icon