Picture for Hunter He

Hunter He

Layer-adaptive Expert Pruning for Pre-Training of Mixture-of-Experts Large Language Models

Add code
Jan 20, 2026
Viaarxiv icon

Yuan3.0 Flash: An Open Multimodal Large Language Model for Enterprise Applications

Add code
Jan 05, 2026
Viaarxiv icon