Picture for Xiuyu Li

Xiuyu Li

Michael Pokorny

OFA-Diffusion Compression: Compressing Diffusion Model in One-Shot Manner

Add code
Apr 14, 2026
Viaarxiv icon

$V_1$: Unifying Generation and Self-Verification for Parallel Reasoners

Add code
Mar 04, 2026
Viaarxiv icon

Quant VideoGen: Auto-Regressive Long Video Generation via 2-Bit KV-Cache Quantization

Add code
Feb 03, 2026
Viaarxiv icon

Reasoning and Tool-use Compete in Agentic RL:From Quantifying Interference to Disentangled Tuning

Add code
Feb 01, 2026
Viaarxiv icon

ETS: Energy-Guided Test-Time Scaling for Training-Free RL Alignment

Add code
Jan 29, 2026
Viaarxiv icon

Vision-as-Inverse-Graphics Agent via Interleaved Multimodal Reasoning

Add code
Jan 16, 2026
Viaarxiv icon

StreamDiffusionV2: A Streaming System for Dynamic and Interactive Video Generation

Add code
Nov 10, 2025
Figure 1 for StreamDiffusionV2: A Streaming System for Dynamic and Interactive Video Generation
Figure 2 for StreamDiffusionV2: A Streaming System for Dynamic and Interactive Video Generation
Figure 3 for StreamDiffusionV2: A Streaming System for Dynamic and Interactive Video Generation
Figure 4 for StreamDiffusionV2: A Streaming System for Dynamic and Interactive Video Generation
Viaarxiv icon

Sparse VideoGen2: Accelerate Video Generation with Sparse Attention via Semantic-Aware Permutation

Add code
May 24, 2025
Viaarxiv icon

Learning Adaptive Parallel Reasoning with Language Models

Add code
Apr 21, 2025
Viaarxiv icon

Token-Efficient Long Video Understanding for Multimodal LLMs

Add code
Mar 06, 2025
Figure 1 for Token-Efficient Long Video Understanding for Multimodal LLMs
Figure 2 for Token-Efficient Long Video Understanding for Multimodal LLMs
Figure 3 for Token-Efficient Long Video Understanding for Multimodal LLMs
Figure 4 for Token-Efficient Long Video Understanding for Multimodal LLMs
Viaarxiv icon