Picture for Shiyi Zhan

Shiyi Zhan

SPARKLING: Balancing Signal Preservation and Symmetry Breaking for Width-Progressive Learning

Add code
Feb 02, 2026
Viaarxiv icon

Model Merging in Pre-training of Large Language Models

Add code
May 17, 2025
Viaarxiv icon