Picture for Jianfei Chen

Jianfei Chen

SageAttention2++: A More Efficient Implementation of SageAttention2

Add code
May 28, 2025
Viaarxiv icon

LLaDA 1.5: Variance-Reduced Preference Optimization for Large Language Diffusion Models

Add code
May 25, 2025
Viaarxiv icon

Sparse VideoGen2: Accelerate Video Generation with Sparse Attention via Semantic-Aware Permutation

Add code
May 24, 2025
Viaarxiv icon

SageAttention3: Microscaling FP4 Attention for Inference and An Exploration of 8-Bit Training

Add code
May 16, 2025
Viaarxiv icon

Accurate INT8 Training Through Dynamic Block-Level Fallback

Add code
Mar 11, 2025
Viaarxiv icon

Oscillation-Reduced MXFP4 Training for Vision Transformers

Add code
Feb 28, 2025
Viaarxiv icon

SpargeAttn: Accurate Sparse Attention Accelerating Any Model Inference

Add code
Feb 25, 2025
Viaarxiv icon

Elucidating the Preconditioning in Consistency Distillation

Add code
Feb 05, 2025
Viaarxiv icon

Sparse VideoGen: Accelerating Video Diffusion Transformers with Spatial-Temporal Sparsity

Add code
Feb 03, 2025
Figure 1 for Sparse VideoGen: Accelerating Video Diffusion Transformers with Spatial-Temporal Sparsity
Figure 2 for Sparse VideoGen: Accelerating Video Diffusion Transformers with Spatial-Temporal Sparsity
Figure 3 for Sparse VideoGen: Accelerating Video Diffusion Transformers with Spatial-Temporal Sparsity
Figure 4 for Sparse VideoGen: Accelerating Video Diffusion Transformers with Spatial-Temporal Sparsity
Viaarxiv icon

Visual Generation Without Guidance

Add code
Jan 26, 2025
Figure 1 for Visual Generation Without Guidance
Figure 2 for Visual Generation Without Guidance
Figure 3 for Visual Generation Without Guidance
Figure 4 for Visual Generation Without Guidance
Viaarxiv icon