Picture for Fangcheng Shi

Fangcheng Shi

Decouple Searching from Training: Scaling Data Mixing via Model Merging for Large Language Model Pre-training

Add code
Jan 31, 2026
Viaarxiv icon

RedOne 2.0: Rethinking Domain-specific LLM Post-Training in Social Networking Services

Add code
Nov 10, 2025
Viaarxiv icon