Picture for Jintao Zhang

Jintao Zhang

KernelBench-X: A Comprehensive Benchmark for Evaluating LLM-Generated GPU Kernels

Add code
May 06, 2026
Viaarxiv icon

Speculative Decoding for Autoregressive Video Generation

Add code
Apr 19, 2026
Viaarxiv icon

SHIELD: A Segmented Hierarchical Memory Architecture for Energy-Efficient LLM Inference on Edge NPUs

Add code
Apr 08, 2026
Viaarxiv icon

6Bit-Diffusion: Inference-Time Mixed-Precision Quantization for Video Diffusion Models

Add code
Mar 19, 2026
Viaarxiv icon

SVG-EAR: Parameter-Free Linear Compensation for Sparse Video Generation via Error-aware Routing

Add code
Mar 09, 2026
Viaarxiv icon

HybridStitch: Pixel and Timestep Level Model Stitching for Diffusion Acceleration

Add code
Mar 08, 2026
Viaarxiv icon

SageBwd: A Trainable Low-bit Attention

Add code
Mar 02, 2026
Viaarxiv icon

SpargeAttention2: Trainable Sparse Attention via Hybrid Top-k+Top-p Masking and Distillation Fine-Tuning

Add code
Feb 13, 2026
Viaarxiv icon

SLA2: Sparse-Linear Attention with Learnable Routing and QAT

Add code
Feb 13, 2026
Viaarxiv icon

Geometry-Aware Rotary Position Embedding for Consistent Video World Model

Add code
Feb 08, 2026
Viaarxiv icon