Picture for Sicheng Gao

Sicheng Gao

VLA-Thinker: Boosting Vision-Language-Action Models through Thinking-with-Image Reasoning

Add code
Mar 15, 2026
Viaarxiv icon

e5-omni: Explicit Cross-modal Alignment for Omni-modal Embeddings

Add code
Jan 07, 2026
Viaarxiv icon

AdaTooler-V: Adaptive Tool-Use for Images and Videos

Add code
Dec 19, 2025
Figure 1 for AdaTooler-V: Adaptive Tool-Use for Images and Videos
Figure 2 for AdaTooler-V: Adaptive Tool-Use for Images and Videos
Figure 3 for AdaTooler-V: Adaptive Tool-Use for Images and Videos
Figure 4 for AdaTooler-V: Adaptive Tool-Use for Images and Videos
Viaarxiv icon

DiTVR: Zero-Shot Diffusion Transformer for Video Restoration

Add code
Aug 11, 2025
Figure 1 for DiTVR: Zero-Shot Diffusion Transformer for Video Restoration
Figure 2 for DiTVR: Zero-Shot Diffusion Transformer for Video Restoration
Figure 3 for DiTVR: Zero-Shot Diffusion Transformer for Video Restoration
Figure 4 for DiTVR: Zero-Shot Diffusion Transformer for Video Restoration
Viaarxiv icon

ZONE: Zero-Shot Instruction-Guided Local Editing

Add code
Dec 28, 2023
Figure 1 for ZONE: Zero-Shot Instruction-Guided Local Editing
Figure 2 for ZONE: Zero-Shot Instruction-Guided Local Editing
Figure 3 for ZONE: Zero-Shot Instruction-Guided Local Editing
Figure 4 for ZONE: Zero-Shot Instruction-Guided Local Editing
Viaarxiv icon

IPDreamer: Appearance-Controllable 3D Object Generation with Image Prompts

Add code
Oct 09, 2023
Figure 1 for IPDreamer: Appearance-Controllable 3D Object Generation with Image Prompts
Figure 2 for IPDreamer: Appearance-Controllable 3D Object Generation with Image Prompts
Figure 3 for IPDreamer: Appearance-Controllable 3D Object Generation with Image Prompts
Figure 4 for IPDreamer: Appearance-Controllable 3D Object Generation with Image Prompts
Viaarxiv icon

Controllable Mind Visual Diffusion Model

Add code
May 18, 2023
Viaarxiv icon

Face Animation with an Attribute-Guided Diffusion Model

Add code
Apr 06, 2023
Figure 1 for Face Animation with an Attribute-Guided Diffusion Model
Figure 2 for Face Animation with an Attribute-Guided Diffusion Model
Figure 3 for Face Animation with an Attribute-Guided Diffusion Model
Figure 4 for Face Animation with an Attribute-Guided Diffusion Model
Viaarxiv icon

Implicit Diffusion Models for Continuous Super-Resolution

Add code
Mar 29, 2023
Figure 1 for Implicit Diffusion Models for Continuous Super-Resolution
Figure 2 for Implicit Diffusion Models for Continuous Super-Resolution
Figure 3 for Implicit Diffusion Models for Continuous Super-Resolution
Figure 4 for Implicit Diffusion Models for Continuous Super-Resolution
Viaarxiv icon