Picture for Xiawu Zheng

Xiawu Zheng

HASTE: Training-Free Video Diffusion Acceleration via Head-Wise Adaptive Sparse Attention

Add code
May 14, 2026
Viaarxiv icon

ALGOGEN: Tool-Generated Verifiable Traces for Reliable Algorithm Visualization

Add code
May 12, 2026
Viaarxiv icon

Motion-Aware Caching for Efficient Autoregressive Video Generation

Add code
May 03, 2026
Viaarxiv icon

Video-MME-v2: Towards the Next Stage in Benchmarks for Comprehensive Video Understanding

Add code
Apr 06, 2026
Viaarxiv icon

SpecEyes: Accelerating Agentic Multimodal LLMs via Speculative Perception and Planning

Add code
Mar 24, 2026
Viaarxiv icon

SocialOmni: Benchmarking Audio-Visual Social Interactivity in Omni Models

Add code
Mar 17, 2026
Viaarxiv icon

Event-Anchored Frame Selection for Effective Long-Video Understanding

Add code
Mar 01, 2026
Viaarxiv icon

Wavelet-based Frame Selection by Detecting Semantic Boundary for Long Video Understanding

Add code
Feb 28, 2026
Viaarxiv icon

Flow caching for autoregressive video generation

Add code
Feb 11, 2026
Viaarxiv icon

Reasoning via Video: The First Evaluation of Video Models' Reasoning Abilities through Maze-Solving Tasks

Add code
Nov 19, 2025
Figure 1 for Reasoning via Video: The First Evaluation of Video Models' Reasoning Abilities through Maze-Solving Tasks
Figure 2 for Reasoning via Video: The First Evaluation of Video Models' Reasoning Abilities through Maze-Solving Tasks
Figure 3 for Reasoning via Video: The First Evaluation of Video Models' Reasoning Abilities through Maze-Solving Tasks
Figure 4 for Reasoning via Video: The First Evaluation of Video Models' Reasoning Abilities through Maze-Solving Tasks
Viaarxiv icon