Picture for Miao Rang

Miao Rang

Revealing the Power of Post-Training for Small Language Models via Knowledge Distillation

Add code
Sep 30, 2025
Viaarxiv icon

Pangu Ultra MoE: How to Train Your Big MoE on Ascend NPUs

Add code
May 07, 2025
Figure 1 for Pangu Ultra MoE: How to Train Your Big MoE on Ascend NPUs
Figure 2 for Pangu Ultra MoE: How to Train Your Big MoE on Ascend NPUs
Figure 3 for Pangu Ultra MoE: How to Train Your Big MoE on Ascend NPUs
Figure 4 for Pangu Ultra MoE: How to Train Your Big MoE on Ascend NPUs
Viaarxiv icon

Eve: Efficient Multimodal Vision Language Models with Elastic Visual Experts

Add code
Jan 08, 2025
Figure 1 for Eve: Efficient Multimodal Vision Language Models with Elastic Visual Experts
Figure 2 for Eve: Efficient Multimodal Vision Language Models with Elastic Visual Experts
Figure 3 for Eve: Efficient Multimodal Vision Language Models with Elastic Visual Experts
Figure 4 for Eve: Efficient Multimodal Vision Language Models with Elastic Visual Experts
Viaarxiv icon

Large OCR Model:An Empirical Study of Scaling Law for OCR

Add code
Jan 02, 2024
Viaarxiv icon