Picture for Jun Zhu

Jun Zhu

Tsinghua University

Bridging Supervised Learning and Reinforcement Learning in Math Reasoning

Add code
May 23, 2025
Viaarxiv icon

Scaling Diffusion Transformers Efficiently via $μ$P

Add code
May 21, 2025
Viaarxiv icon

SageAttention3: Microscaling FP4 Attention for Inference and An Exploration of 8-Bit Training

Add code
May 16, 2025
Viaarxiv icon

Video4DGen: Enhancing Video and 4D Generation through Mutual Optimization

Add code
Apr 05, 2025
Viaarxiv icon

DeepMesh: Auto-Regressive Artist-mesh Creation with Reinforcement Learning

Add code
Mar 19, 2025
Figure 1 for DeepMesh: Auto-Regressive Artist-mesh Creation with Reinforcement Learning
Figure 2 for DeepMesh: Auto-Regressive Artist-mesh Creation with Reinforcement Learning
Figure 3 for DeepMesh: Auto-Regressive Artist-mesh Creation with Reinforcement Learning
Figure 4 for DeepMesh: Auto-Regressive Artist-mesh Creation with Reinforcement Learning
Viaarxiv icon

DiffGAP: A Lightweight Diffusion Module in Contrastive Space for Bridging Cross-Model Gap

Add code
Mar 15, 2025
Figure 1 for DiffGAP: A Lightweight Diffusion Module in Contrastive Space for Bridging Cross-Model Gap
Figure 2 for DiffGAP: A Lightweight Diffusion Module in Contrastive Space for Bridging Cross-Model Gap
Figure 3 for DiffGAP: A Lightweight Diffusion Module in Contrastive Space for Bridging Cross-Model Gap
Figure 4 for DiffGAP: A Lightweight Diffusion Module in Contrastive Space for Bridging Cross-Model Gap
Viaarxiv icon

Accurate INT8 Training Through Dynamic Block-Level Fallback

Add code
Mar 11, 2025
Figure 1 for Accurate INT8 Training Through Dynamic Block-Level Fallback
Figure 2 for Accurate INT8 Training Through Dynamic Block-Level Fallback
Figure 3 for Accurate INT8 Training Through Dynamic Block-Level Fallback
Figure 4 for Accurate INT8 Training Through Dynamic Block-Level Fallback
Viaarxiv icon

UAR-NVC: A Unified AutoRegressive Framework for Memory-Efficient Neural Video Compression

Add code
Mar 04, 2025
Viaarxiv icon

Direct Discriminative Optimization: Your Likelihood-Based Visual Generative Model is Secretly a GAN Discriminator

Add code
Mar 03, 2025
Viaarxiv icon

Oscillation-Reduced MXFP4 Training for Vision Transformers

Add code
Feb 28, 2025
Figure 1 for Oscillation-Reduced MXFP4 Training for Vision Transformers
Figure 2 for Oscillation-Reduced MXFP4 Training for Vision Transformers
Figure 3 for Oscillation-Reduced MXFP4 Training for Vision Transformers
Figure 4 for Oscillation-Reduced MXFP4 Training for Vision Transformers
Viaarxiv icon