Picture for Jintao Zhang

Jintao Zhang

6Bit-Diffusion: Inference-Time Mixed-Precision Quantization for Video Diffusion Models

Add code
Mar 19, 2026
Viaarxiv icon

SVG-EAR: Parameter-Free Linear Compensation for Sparse Video Generation via Error-aware Routing

Add code
Mar 09, 2026
Viaarxiv icon

HybridStitch: Pixel and Timestep Level Model Stitching for Diffusion Acceleration

Add code
Mar 08, 2026
Viaarxiv icon

SageBwd: A Trainable Low-bit Attention

Add code
Mar 02, 2026
Viaarxiv icon

SpargeAttention2: Trainable Sparse Attention via Hybrid Top-k+Top-p Masking and Distillation Fine-Tuning

Add code
Feb 13, 2026
Viaarxiv icon

SLA2: Sparse-Linear Attention with Learnable Routing and QAT

Add code
Feb 13, 2026
Viaarxiv icon

Geometry-Aware Rotary Position Embedding for Consistent Video World Model

Add code
Feb 08, 2026
Viaarxiv icon

Quant VideoGen: Auto-Regressive Long Video Generation via 2-Bit KV-Cache Quantization

Add code
Feb 03, 2026
Viaarxiv icon

Residual Context Diffusion Language Models

Add code
Jan 30, 2026
Viaarxiv icon

TurboDiffusion: Accelerating Video Diffusion Models by 100-200 Times

Add code
Dec 18, 2025
Viaarxiv icon