Picture for Jianfei Chen

Jianfei Chen

SpargeAttention2: Trainable Sparse Attention via Hybrid Top-k+Top-p Masking and Distillation Fine-Tuning

Add code
Feb 13, 2026
Viaarxiv icon

SLA2: Sparse-Linear Attention with Learnable Routing and QAT

Add code
Feb 13, 2026
Viaarxiv icon

TurboDiffusion: Accelerating Video Diffusion Models by 100-200 Times

Add code
Dec 18, 2025
Viaarxiv icon

TetraJet-v2: Accurate NVFP4 Training for Large Language Models with Oscillation Suppression and Outlier Control

Add code
Oct 31, 2025
Viaarxiv icon

Task-Specific Zero-shot Quantization-Aware Training for Object Detection

Add code
Jul 22, 2025
Viaarxiv icon

SageAttention2++: A More Efficient Implementation of SageAttention2

Add code
May 28, 2025
Figure 1 for SageAttention2++: A More Efficient Implementation of SageAttention2
Figure 2 for SageAttention2++: A More Efficient Implementation of SageAttention2
Figure 3 for SageAttention2++: A More Efficient Implementation of SageAttention2
Figure 4 for SageAttention2++: A More Efficient Implementation of SageAttention2
Viaarxiv icon

LLaDA 1.5: Variance-Reduced Preference Optimization for Large Language Diffusion Models

Add code
May 25, 2025
Figure 1 for LLaDA 1.5: Variance-Reduced Preference Optimization for Large Language Diffusion Models
Figure 2 for LLaDA 1.5: Variance-Reduced Preference Optimization for Large Language Diffusion Models
Figure 3 for LLaDA 1.5: Variance-Reduced Preference Optimization for Large Language Diffusion Models
Figure 4 for LLaDA 1.5: Variance-Reduced Preference Optimization for Large Language Diffusion Models
Viaarxiv icon

Sparse VideoGen2: Accelerate Video Generation with Sparse Attention via Semantic-Aware Permutation

Add code
May 24, 2025
Viaarxiv icon

SageAttention3: Microscaling FP4 Attention for Inference and An Exploration of 8-Bit Training

Add code
May 16, 2025
Viaarxiv icon

Accurate INT8 Training Through Dynamic Block-Level Fallback

Add code
Mar 11, 2025
Figure 1 for Accurate INT8 Training Through Dynamic Block-Level Fallback
Figure 2 for Accurate INT8 Training Through Dynamic Block-Level Fallback
Figure 3 for Accurate INT8 Training Through Dynamic Block-Level Fallback
Figure 4 for Accurate INT8 Training Through Dynamic Block-Level Fallback
Viaarxiv icon