Picture for Saining Zhang

Saining Zhang

LIBERO-Safety: A Comprehensive Benchmark for Physical and Semantic Safety in Vision-Language-Action Models

Add code
Jun 22, 2026
Viaarxiv icon

Imagine Before You Draw: Visual Prompt Engineering for Image Generation

Add code
Jun 03, 2026
Viaarxiv icon

Feedforward 3D Editing Learns from Semantic-Part Transformation

Add code
May 27, 2026
Viaarxiv icon

Dexora: Open-source VLA for High-DoF Bimanual Dexterity

Add code
May 18, 2026
Viaarxiv icon

ArtHOI: Articulated Human-Object Interaction Synthesis by 4D Reconstruction from Video Priors

Add code
Mar 04, 2026
Viaarxiv icon

Diffusion Knows Transparency: Repurposing Video Diffusion for Transparent Object Depth and Normal Estimation

Add code
Dec 29, 2025
Viaarxiv icon

WEAVE: Unleashing and Benchmarking the In-context Interleaved Comprehension and Generation

Add code
Nov 14, 2025
Figure 1 for WEAVE: Unleashing and Benchmarking the In-context Interleaved Comprehension and Generation
Figure 2 for WEAVE: Unleashing and Benchmarking the In-context Interleaved Comprehension and Generation
Figure 3 for WEAVE: Unleashing and Benchmarking the In-context Interleaved Comprehension and Generation
Figure 4 for WEAVE: Unleashing and Benchmarking the In-context Interleaved Comprehension and Generation
Viaarxiv icon

GaussianArt: Unified Modeling of Geometry and Motion for Articulated Objects

Add code
Aug 20, 2025
Viaarxiv icon

Unifying Appearance Codes and Bilateral Grids for Driving Scene Gaussian Splatting

Add code
Jun 06, 2025
Viaarxiv icon

Selftok: Discrete Visual Tokens of Autoregression, by Diffusion, and for Reasoning

Add code
May 18, 2025
Viaarxiv icon