Picture for Zidan Wang

Zidan Wang

Separators in Enhancing Autoregressive Pretraining for Vision Mamba

Add code
Mar 04, 2026
Viaarxiv icon

ITO: Images and Texts as One via Synergizing Multiple Alignment and Training-Time Fusion

Add code
Mar 04, 2026
Viaarxiv icon

iGVLM: Dynamic Instruction-Guided Vision Encoding for Question-Aware Multimodal Understanding

Add code
Mar 03, 2026
Viaarxiv icon

Solving Robotics Problems in Zero-Shot with Vision-Language Models

Add code
Jul 26, 2024
Viaarxiv icon

Cold Diffusion on the Replay Buffer: Learning to Plan from Known Good States

Add code
Oct 21, 2023
Figure 1 for Cold Diffusion on the Replay Buffer: Learning to Plan from Known Good States
Figure 2 for Cold Diffusion on the Replay Buffer: Learning to Plan from Known Good States
Figure 3 for Cold Diffusion on the Replay Buffer: Learning to Plan from Known Good States
Figure 4 for Cold Diffusion on the Replay Buffer: Learning to Plan from Known Good States
Viaarxiv icon