Picture for Yaoyuan Wang

Yaoyuan Wang

and Other Contributors

Accelerating Disaggregated RL for Visual Generative LLMs with Diffusion-Based Parallelism and Trainer-Assisted Generation

Add code
Jun 24, 2026
Viaarxiv icon

GNMR: Runtime Stability Control for Low-Precision Large Language Model Training

Add code
May 30, 2026
Viaarxiv icon

TurboGR: An Accelerated Training System for Large-Scale Generative Recommendation

Add code
May 13, 2026
Viaarxiv icon

HiFloat4 Format for Language Model Pre-training on Ascend NPUs

Add code
Apr 09, 2026
Viaarxiv icon

AgentCollab: A Self-Evaluation-Driven Collaboration Paradigm for Efficient LLM Agents

Add code
Mar 27, 2026
Viaarxiv icon

RelayGR: Scaling Long-Sequence Generative Recommendation via Cross-Stage Relay-Race Inference

Add code
Jan 05, 2026
Viaarxiv icon

Towards Universal Offline Black-Box Optimization via Learning Language Model Embeddings

Add code
Jun 08, 2025
Viaarxiv icon

Pangu Pro MoE: Mixture of Grouped Experts for Efficient Sparsity

Add code
May 28, 2025
Viaarxiv icon

TrimR: Verifier-based Training-Free Thinking Compression for Efficient Test-Time Scaling

Add code
May 22, 2025
Figure 1 for TrimR: Verifier-based Training-Free Thinking Compression for Efficient Test-Time Scaling
Figure 2 for TrimR: Verifier-based Training-Free Thinking Compression for Efficient Test-Time Scaling
Figure 3 for TrimR: Verifier-based Training-Free Thinking Compression for Efficient Test-Time Scaling
Figure 4 for TrimR: Verifier-based Training-Free Thinking Compression for Efficient Test-Time Scaling
Viaarxiv icon

Pangu Ultra MoE: How to Train Your Big MoE on Ascend NPUs

Add code
May 07, 2025
Figure 1 for Pangu Ultra MoE: How to Train Your Big MoE on Ascend NPUs
Figure 2 for Pangu Ultra MoE: How to Train Your Big MoE on Ascend NPUs
Figure 3 for Pangu Ultra MoE: How to Train Your Big MoE on Ascend NPUs
Figure 4 for Pangu Ultra MoE: How to Train Your Big MoE on Ascend NPUs
Viaarxiv icon