Picture for Zhibo Chen

Zhibo Chen

Training-Free Sparse Attention for Fast Video Generation via Offline Layer-Wise Sparsity Profiling and Online Bidirectional Co-Clustering

Add code
Mar 19, 2026
Viaarxiv icon

PhysVideo: Physically Plausible Video Generation with Cross-View Geometry Guidance

Add code
Mar 19, 2026
Viaarxiv icon

VLA-JEPA: Enhancing Vision-Language-Action Model with Latent World Model

Add code
Feb 14, 2026
Viaarxiv icon

WorldArena: A Unified Benchmark for Evaluating Perception and Functional Utility of Embodied World Models

Add code
Feb 09, 2026
Viaarxiv icon

Back to Physics: Operator-Guided Generative Paths for SMS MRI Reconstruction

Add code
Feb 08, 2026
Viaarxiv icon

TrackVLA++: Unleashing Reasoning and Memory Capabilities in VLA Models for Embodied Visual Tracking

Add code
Oct 08, 2025
Viaarxiv icon

Comp-X: On Defining an Interactive Learned Image Compression Paradigm With Expert-driven LLM Agent

Add code
Aug 21, 2025
Viaarxiv icon

Structure-preserving Feature Alignment for Old Photo Colorization

Add code
Aug 18, 2025
Viaarxiv icon

LiftVSR: Lifting Image Diffusion to Video Super-Resolution via Hybrid Temporal Modeling with Only 4$\times$RTX 4090s

Add code
Jun 10, 2025
Figure 1 for LiftVSR: Lifting Image Diffusion to Video Super-Resolution via Hybrid Temporal Modeling with Only 4$\times$RTX 4090s
Figure 2 for LiftVSR: Lifting Image Diffusion to Video Super-Resolution via Hybrid Temporal Modeling with Only 4$\times$RTX 4090s
Figure 3 for LiftVSR: Lifting Image Diffusion to Video Super-Resolution via Hybrid Temporal Modeling with Only 4$\times$RTX 4090s
Figure 4 for LiftVSR: Lifting Image Diffusion to Video Super-Resolution via Hybrid Temporal Modeling with Only 4$\times$RTX 4090s
Viaarxiv icon

Why Compress What You Can Generate? When GPT-4o Generation Ushers in Image Compression Fields

Add code
Apr 30, 2025
Viaarxiv icon