Picture for Xuemiao Zhang

Xuemiao Zhang

Enhancing LLMs via High-Knowledge Data Selection

Add code
May 20, 2025
Viaarxiv icon

FRAMES: Boosting LLMs with A Four-Quadrant Multi-Stage Pretraining Strategy

Add code
Feb 08, 2025
Viaarxiv icon

FIRE: Flexible Integration of Data Quality Ratings for Effective Pre-Training

Add code
Feb 02, 2025
Viaarxiv icon