Picture for Feng Zhao

Feng Zhao

Gaussian Variation Field Diffusion for High-fidelity Video-to-4D Synthesis

Add code
Jul 31, 2025
Viaarxiv icon

InstructVLA: Vision-Language-Action Instruction Tuning from Understanding to Manipulation

Add code
Jul 23, 2025
Viaarxiv icon

CronusVLA: Transferring Latent Motion Across Time for Multi-Frame Prediction in Manipulation

Add code
Jun 24, 2025
Viaarxiv icon

VideoMAR: Autoregressive Video Generatio with Continuous Tokens

Add code
Jun 18, 2025
Viaarxiv icon

DualFast: Dual-Speedup Framework for Fast Sampling of Diffusion Models

Add code
Jun 16, 2025
Viaarxiv icon

Adaptive Dropout: Unleashing Dropout across Layers for Generalizable Image Super-Resolution

Add code
Jun 15, 2025
Viaarxiv icon

AnySplat: Feed-forward 3D Gaussian Splatting from Unconstrained Views

Add code
May 29, 2025
Viaarxiv icon

VRAG-RL: Empower Vision-Perception-Based RAG for Visually Rich Information Understanding via Iterative Reasoning with Reinforcement Learning

Add code
May 28, 2025
Viaarxiv icon

FreePCA: Integrating Consistency Information across Long-short Frames in Training-free Long Video Generation via Principal Component Analysis

Add code
May 02, 2025
Figure 1 for FreePCA: Integrating Consistency Information across Long-short Frames in Training-free Long Video Generation via Principal Component Analysis
Figure 2 for FreePCA: Integrating Consistency Information across Long-short Frames in Training-free Long Video Generation via Principal Component Analysis
Figure 3 for FreePCA: Integrating Consistency Information across Long-short Frames in Training-free Long Video Generation via Principal Component Analysis
Figure 4 for FreePCA: Integrating Consistency Information across Long-short Frames in Training-free Long Video Generation via Principal Component Analysis
Viaarxiv icon

AB-Cache: Training-Free Acceleration of Diffusion Models via Adams-Bashforth Cached Feature Reuse

Add code
Apr 13, 2025
Viaarxiv icon