Picture for Yuezhou Hu

Yuezhou Hu

Speculative Decoding for Autoregressive Video Generation

Add code
Apr 19, 2026
Viaarxiv icon

LoSA: Locality Aware Sparse Attention for Block-Wise Diffusion Language Models

Add code
Apr 13, 2026
Viaarxiv icon

Squeeze Evolve: Unified Multi-Model Orchestration for Verifier-Free Evolution

Add code
Apr 09, 2026
Viaarxiv icon

SpargeAttention2: Trainable Sparse Attention via Hybrid Top-k+Top-p Masking and Distillation Fine-Tuning

Add code
Feb 13, 2026
Viaarxiv icon

Residual Context Diffusion Language Models

Add code
Jan 30, 2026
Viaarxiv icon

ParallelBench: Understanding the Trade-offs of Parallel Decoding in Diffusion LLMs

Add code
Oct 06, 2025
Viaarxiv icon

S-STE: Continuous Pruning Function for Efficient 2:4 Sparse Pre-training

Add code
Sep 13, 2024
Figure 1 for S-STE: Continuous Pruning Function for Efficient 2:4 Sparse Pre-training
Figure 2 for S-STE: Continuous Pruning Function for Efficient 2:4 Sparse Pre-training
Figure 3 for S-STE: Continuous Pruning Function for Efficient 2:4 Sparse Pre-training
Figure 4 for S-STE: Continuous Pruning Function for Efficient 2:4 Sparse Pre-training
Viaarxiv icon

Pruning Large Language Models with Semi-Structural Adaptive Sparse Training

Add code
Jul 30, 2024
Viaarxiv icon

Accelerating Transformer Pre-Training with 2:4 Sparsity

Add code
Apr 02, 2024
Viaarxiv icon